A library for sharing GPU memory objects across processes using IPC mechanisms

These details have not been verified by PyPI

Project links

Project description

Shared Tensor

A high-performance library for sharing GPU memory objects across processes using IPC mechanisms with JSON-RPC 2.0 protocol, enabling model and inference engine separation architecture.

🚀 Project Overview

Shared Tensor is a cross-process communication library designed specifically for deep learning and AI applications, utilizing IPC mechanisms and JSON-RPC protocol to achieve:

Efficient GPU Memory Sharing: Cross-process sharing of PyTorch tensors and models
Remote Function Execution: Easy remote function calls through decorators
Async/Sync Support: Flexible execution modes for different scenarios
Model Serving: Deploy machine learning models as independent services
Distributed Inference: Support for distributed computing in multi-GPU environments

📋 Core Features

🔄 Cross-Process Communication

JSON-RPC 2.0 Protocol: Standardized remote procedure calls
HTTP Transport: Reliable HTTP-based communication mechanism
Serialization Optimization: Efficient PyTorch object serialization/deserialization

🎯 Function Sharing

Decorator Pattern: Easy function sharing using @provider.share
Auto Discovery: Smart function path resolution and import
Parameter Passing: Support for complex data type parameters

⚡ Async Support

Async Execution: AsyncSharedTensorProvider supports non-blocking calls
Task Management: Complete async task status tracking
Concurrent Processing: Efficient concurrent request handling

🖥️ GPU Compatibility

CUDA Support: Native CUDA tensor sharing support
Device Management: Smart data migration between devices
Memory Optimization: Efficient GPU memory usage

🛠️ Installation Guide

Requirements

Python: 3.8+
Operating System: Linux (recommended)
PyTorch: 1.12.0+
CUDA: Optional, for GPU support

Installation Methods

Install from Source

# Clone the repository
git clone https://github.com/world-sim-dev/shared-tensor.git
cd shared-tensor

# Install dependencies
pip install -r requirements.txt

# Install the package
pip install -e .

Development Installation

# Install with development dependencies
pip install -e ".[dev]"

# Install with test dependencies
pip install -e ".[test]"

Verify Installation

# Check core functionality
python -c "import shared_tensor; print('✓ Shared Tensor installed successfully')"

🎯 Quick Start

1. Basic Function Sharing

from shared_tensor.async_provider import AsyncSharedTensorProvider

# Create provider
provider = AsyncSharedTensorProvider()

# Share simple function
@provider.share()
def add_numbers(a, b):
    return a + b

# Share PyTorch function
@provider.share()
def create_tensor(shape):
    import torch
    return torch.zeros(shape)

# Load PyTorch model
@provider.share()
def load_model():
    ...

2. Start Server

# Method 1: Use command line tool, single server
shared-tensor-server

# Method 2: Use torchrun
torchrun --nproc_per_node=4 --no-python shared-tensor-server

# Method 3: Custom configuration
python shared_tensor/server.py

📖 Detailed Usage

Model Sharing Example

import torch
import torch.nn as nn

from shared_tensor.async_provider import AsyncSharedTensorProvider

# Create provider
provider = AsyncSharedTensorProvider()

# Define model
class SimpleNet(nn.Module):
    def __init__(self, input_size, hidden_size, output_size):
        super().__init__()
        self.fc1 = nn.Linear(input_size, hidden_size)
        self.relu = nn.ReLU()
        self.fc2 = nn.Linear(hidden_size, output_size)
    
    def forward(self, x):
        x = self.fc1(x)
        x = self.relu(x)
        x = self.fc2(x)
        return x

# Share model creation function
@provider.share(name="create_model")
def create_model(input_size=784, hidden_size=128, output_size=10):
    model = SimpleNet(input_size, hidden_size, output_size)
    return model

# Share inference function
model = create_model()
with torch.no_grad():
    model(input_data)

🔧 Configuration Options

Server Configuration

from shared_tensor.server import SharedTensorServer

server = SharedTensorServer(
    host="0.0.0.0",           # Listen address
    port=2537,                # Port number
    timeout=30,               # Request timeout
    max_workers=4,            # Maximum worker threads
    enable_cache=True,        # Enable result caching
    debug=False               # Debug mode
)

🧪 Testing

Run Test Suite

# Run all tests
python tests/run_tests.py

# Run specific category tests
python tests/run_tests.py --category unit
python tests/run_tests.py --category integration
python tests/run_tests.py --category pytorch

# Run only PyTorch related tests
python tests/run_tests.py --torch-only

# Verbose output
python tests/run_tests.py --verbose

Test Environment Info

# Check test environment
python tests/run_tests.py --env-info

Individual Test Files

# Test tensor serialization
python tests/pytorch_tests/test_tensor_serialization.py

# Test async system
python tests/integration/test_async_system.py

# Test client
python tests/integration/test_client.py

🏗️ Architecture Design

Core Components

shared-tensor/
├── shared_tensor/              # Core modules
│   ├── server.py              # JSON-RPC server
│   ├── client.py              # Sync client
│   ├── provider.py            # Sync provider
│   ├── async_client.py        # Async client
│   ├── async_provider.py      # Async provider
│   ├── async_task.py          # Async task management
│   ├── jsonrpc.py            # JSON-RPC protocol implementation
│   ├── utils.py              # Utility functions
│   └── errors.py             # Exception definitions
├── examples/                  # Usage examples
└── tests/                     # Test suite

Communication Flow

sequenceDiagram
    participant CA as Client App
    participant SC as SharedTensorClient
    participant SS as SharedTensorServer
    participant FE as Function Executor
    
    Note over CA, FE: Client-Server Communication Flow
    
    CA->>SC: call_function("model_inference", args)
    SC->>SC: Serialize parameters
    SC->>SS: HTTP POST /jsonrpc<br/>JSON-RPC Request
    
    Note over SS: Server Processing
    SS->>SS: Parse JSON-RPC request
    SS->>SS: Resolve function path
    SS->>FE: Import & execute function
    FE->>FE: Deserialize parameters
    FE->>FE: Execute function logic
    FE->>SS: Return execution result
    
    Note over SS: Response Preparation
    SS->>SS: Serialize result
    SS->>SS: Create JSON-RPC response
    SS->>SC: HTTP Response<br/>JSON-RPC Result
    
    Note over SC: Client Processing
    SC->>SC: Parse response
    SC->>SC: Deserialize result
    SC->>CA: Return final result
    
    Note over CA, FE: End-to-End Process Complete

Debug Tips

Enable verbose logging:

import logging
logging.basicConfig(level=logging.DEBUG)

Use debug mode:

provider = SharedTensorProvider(verbose_debug=True)

Check function paths:

provider = SharedTensorProvider()
print(provider._registered_functions)

🤝 Contributing

We welcome community contributions! Please follow these steps:

Development Environment Setup

# Clone repository
git clone https://github.com/world-sim-dev/shared-tensor.git
cd shared-tensor

# Create virtual environment
python -m venv venv
source venv/bin/activate

# Install development dependencies
pip install -e ".[dev]"

# Install pre-commit hooks
pre-commit install

# Package & Publish
python setup.py sdist bdist_wheel
python -m twine upload --repository testpypi dist/*
python -m twine upload dist/*

Code Standards

# Code formatting
black shared_tensor/ tests/ examples/

# Import sorting
isort shared_tensor/ tests/ examples/

# Static checking
flake8 shared_tensor/
mypy shared_tensor/

Submission Process

Fork the project and create a feature branch
Write code and tests
Run the complete test suite
Submit a Pull Request

Test Requirements

New features must include tests
Maintain test coverage > 90%
All tests must pass

📄 License

This project is licensed under the Apache 2.0 License - see the LICENSE file for details

🙏 Acknowledgments

PyTorch - Deep learning framework
JSON-RPC 2.0 - Remote procedure call protocol

📞 Contact Us

Issues: GitHub Issues
Documentation: Shared Tensor Documentation
Source: GitHub Repository

Shared Tensor - Making GPU memory sharing simple and efficient 🚀

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.2.16

Mar 28, 2026

0.2.15

Mar 28, 2026

0.2.13

Mar 27, 2026

0.2.12

Mar 27, 2026

0.2.11

Mar 27, 2026

0.2.10

Mar 26, 2026

0.2.9

Mar 26, 2026

0.2.8

Mar 26, 2026

0.2.7

Mar 26, 2026

0.2.6

Mar 25, 2026

0.2.5

Mar 25, 2026

0.2.4

Mar 25, 2026

0.2.2

Mar 25, 2026

0.2.1

Mar 25, 2026

0.1.2

Sep 4, 2025

0.1.1

Sep 4, 2025

This version

0.1.0

Sep 4, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

shared_tensor-0.1.0.tar.gz (25.5 kB view details)

Uploaded Sep 4, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

shared_tensor-0.1.0-py3-none-any.whl (29.4 kB view details)

Uploaded Sep 4, 2025 Python 3

File details

Details for the file shared_tensor-0.1.0.tar.gz.

File metadata

Download URL: shared_tensor-0.1.0.tar.gz
Upload date: Sep 4, 2025
Size: 25.5 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.10.12

File hashes

Hashes for shared_tensor-0.1.0.tar.gz
Algorithm	Hash digest
SHA256	`9432eda8a08b9084c8a0346d9ac07fd7e8d754187e73d834db611beba038b955`
MD5	`765c9d4ce3e3b68b2f27b42c42d6f14e`
BLAKE2b-256	`a001022dcd2ac5b048ac83ddf08c481c45b6c91963ece901145b94ddd96f488f`

See more details on using hashes here.

File details

Details for the file shared_tensor-0.1.0-py3-none-any.whl.

File metadata

Download URL: shared_tensor-0.1.0-py3-none-any.whl
Upload date: Sep 4, 2025
Size: 29.4 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.10.12

File hashes

Hashes for shared_tensor-0.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`93bdaec4d97e4126c4710b3e9f3dbedd35b11f8ddeca885c7db7a1a5de743c9b`
MD5	`ab307f780c49b443d2a6c4dc22dcddbc`
BLAKE2b-256	`a6171eb902a50b3f872c35646a8f6cdd9c66620bf41b78e76eb659ad2dd797ee`

See more details on using hashes here.

shared-tensor 0.1.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Shared Tensor

🚀 Project Overview

📋 Core Features

🔄 Cross-Process Communication

🎯 Function Sharing

⚡ Async Support

🖥️ GPU Compatibility

🛠️ Installation Guide

Requirements

Installation Methods

Install from Source

Development Installation

Verify Installation

🎯 Quick Start

1. Basic Function Sharing

2. Start Server

📖 Detailed Usage

Model Sharing Example

🔧 Configuration Options

Server Configuration

🧪 Testing

Run Test Suite

Test Environment Info

Individual Test Files

🏗️ Architecture Design

Core Components

Communication Flow

Debug Tips

🤝 Contributing

Development Environment Setup

Code Standards

Submission Process

Test Requirements

📄 License

🙏 Acknowledgments

📞 Contact Us

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes