High-performance zero-copy tensor protocol

Project description

Tenso

Up to 12.6x faster than Apache Arrow. 32x less CPU than SafeTensors.

Zero-copy, SIMD-aligned tensor protocol for high-performance ML infrastructure.

Why Tenso?

Most serialization formats are designed for general data or disk storage. Tenso is focused on network tensor transmission where every microsecond matters.

The Problem

Traditional formats waste CPU cycles during deserialization:

SafeTensors: 41.3% CPU usage (great for disk, overkill for network)
Pickle: 43.3% CPU usage + security vulnerabilities
Arrow: Fast, but 12.6x slower than Tenso for large tensors

The Solution

Tenso achieves true zero-copy with:

Minimalist Header: Fixed 8-byte header eliminates JSON parsing overhead.
64-byte Alignment: SIMD-ready padding ensures the data body is cache-line aligned.
Direct Memory Mapping: The CPU points directly to existing buffers without copying.

Result: ~1.3% CPU usage vs >40% for SafeTensors/Pickle.

Benchmarks

System: Python 3.12.9, NumPy 2.3.5, 12 CPU cores, macOS

Deserialization Speed (8192×8192 Float32 Matrix)

Format	Time	CPU Usage	Speedup
Tenso	0.064ms	1.3%	1x
Arrow	0.810ms	1.2%	12.6x slower
SafeTensors	2.792ms	41.3%	43x slower
Pickle	3.031ms	43.3%	47x slower

Stream Reading Performance (95MB Packet)

Method	Time	Throughput	Speedup
Tenso read_stream	7.05ms	13,534 MB/s	1x
Naive loop	7,399.7ms	12.8 MB/s	1,050x slower

Installation

pip install tenso

Quick Start (v0.10.1)

Basic Serialization

import numpy as np
import tenso

# Create tensor
data = np.random.rand(1024, 1024).astype(np.float32)

# Serialize
packet = tenso.dumps(data)

# Deserialize (Zero-copy view)
restored = tenso.loads(packet)

Async I/O

import asyncio
import tenso

async def handle_client(reader, writer):
    # Asynchronously read a tensor from the stream
    data = await tenso.aread_stream(reader)
    
    # Process and write back
    await tenso.awrite_stream(data * 2, writer)

FastAPI Integration

from fastapi import FastAPI
import numpy as np
from tenso.fastapi import TensoResponse

app = FastAPI()

@app.get("/tensor")
async def get_tensor():
    data = np.ones((1024, 1024), dtype=np.float32)
    return TensoResponse(data) # Zero-copy streaming response

Advanced Features

GPU Acceleration (Direct Transfer)

Supports fast transfers between Tenso streams and device memory for CuPy, PyTorch, and JAX using pinned host memory.

import tenso.gpu as tgpu

# Read directly from a stream into a GPU tensor
torch_tensor = tgpu.read_to_device(stream, device_id=0)

Sparse Formats & Bundling

Tenso natively supports complex data structures beyond simple dense arrays:

Sparse Matrices: Direct serialization for COO, CSR, and CSC formats.
Dictionary Bundling: Pack multiple tensors into a single nested dictionary packet.
LZ4 Compression: Optional high-speed compression for sparse or redundant data.

Data Integrity (XXH3)

Protect your tensors against network corruption with ultra-fast 64-bit checksums:

# Serialize with 64-bit checksum footer
packet = tenso.dumps(data, check_integrity=True)

# Verification is automatic during loads()
restored = tenso.loads(packet)

gRPC Integration

Tenso provides built-in support for gRPC, allowing you to pass tensors between services with minimal overhead.

from tenso.grpc import tenso_msg_pb2, tenso_msg_pb2_grpc
import tenso

# In your Servicer
def Predict(self, request, context):
    data = tenso.loads(request.tensor_packet)
    result = data * 2
    return tenso_msg_pb2.PredictResponse(
        result_packet=bytes(tenso.dumps(result))
    )

Protocol Design

Tenso uses a minimalist structure designed for direct memory access:

┌─────────────┬──────────────┬──────────────┬────────────────────────┬──────────────┐
│   HEADER    │    SHAPE     │   PADDING    │    BODY (Raw Data)     │    FOOTER    │
│   8 bytes   │  Variable    │   0-63 bytes │   C-Contiguous Array   │   8 bytes*   │
└─────────────┴──────────────┴──────────────┴────────────────────────┴──────────────┘
                                                                        (*Optional)

The padding ensures the body starts at a 64-byte boundary, enabling AVX-512 vectorization and zero-copy memory mapping.

Use Cases

Model Serving APIs: 12.6x faster deserialization saves massive CPU overhead on inference nodes.
Distributed Training: Efficiently pass gradients or activations between nodes (Ray, Spark).
GPU-Direct Pipelines: Stream data from network cards to GPU memory with minimal host intervention.
Real-time Robotics: Sub-millisecond latency for high-frequency sensor fusion (LIDAR, Radar).

Contributing

Contributions are welcome! We are currently looking for help with:

Rust Core: Porting serialization logic to Rust for even lower overhead.
C++ / JavaScript Clients: Extending the protocol to other ecosystems.

License

Apache License 2.0 - see LICENSE file.

Citation

@software{tenso2025,
  author = {Khushiyant},
  title = {Tenso: High-Performance Zero-Copy Tensor Protocol},
  year = {2025},
  version = {0.10.1},
  url = {[https://github.com/Khushiyant/tenso](https://github.com/Khushiyant/tenso)}
}

Project details

Release history Release notifications | RSS feed

0.20.1

Mar 28, 2026

0.20.0

Mar 28, 2026

0.19.4

Feb 14, 2026

0.19.3

Feb 14, 2026

0.19.2

Feb 14, 2026

0.12.1

Dec 29, 2025

This version

0.12.0

Dec 29, 2025

0.11.0

Dec 29, 2025

0.10.1

Dec 27, 2025

0.10.0

Dec 26, 2025

0.9.1

Dec 22, 2025

0.9.0

Dec 22, 2025

0.8.1

Dec 21, 2025

0.8.0

Dec 21, 2025

0.7.0

Dec 21, 2025

0.6.1

Dec 18, 2025

0.6.0

Dec 16, 2025

0.5.1

Dec 15, 2025

0.5.0

Dec 15, 2025

0.4.7

Dec 14, 2025

0.4.3

Dec 14, 2025

0.3.2

Dec 10, 2025

0.2.2

Dec 4, 2025

0.2.1

Dec 4, 2025

0.2.0

Dec 4, 2025

0.1.1

Dec 3, 2025

0.1.0

Dec 3, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

tenso-0.12.0.tar.gz (12.9 kB view details)

Uploaded Dec 29, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

tenso-0.12.0-py3-none-any.whl (16.9 kB view details)

Uploaded Dec 29, 2025 Python 3

File details

Details for the file tenso-0.12.0.tar.gz.

File metadata

Download URL: tenso-0.12.0.tar.gz
Upload date: Dec 29, 2025
Size: 12.9 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for tenso-0.12.0.tar.gz
Algorithm	Hash digest
SHA256	`9bc0776420f02a292f9c2892d3b2cd61b42aec614af2d354310d280519e2cfe1`
MD5	`02af266faaf62c8858d076464975198b`
BLAKE2b-256	`e54af42bb21ebd8b6e0b56bc87f32c96a510fafcd22977c9d73335a260d14686`

See more details on using hashes here.

Provenance

The following attestation bundles were made for tenso-0.12.0.tar.gz:

Publisher: release.yml on Khushiyant/tenso

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: tenso-0.12.0.tar.gz
- Subject digest: 9bc0776420f02a292f9c2892d3b2cd61b42aec614af2d354310d280519e2cfe1
- Sigstore transparency entry: 781583641
- Sigstore integration time: Dec 29, 2025
Source repository:
- Permalink: Khushiyant/tenso@d4d8674ec7267704f307c936cb92a6b70882e132
- Branch / Tag: refs/heads/main
- Owner: https://github.com/Khushiyant
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@d4d8674ec7267704f307c936cb92a6b70882e132
- Trigger Event: push

File details

Details for the file tenso-0.12.0-py3-none-any.whl.

File metadata

Download URL: tenso-0.12.0-py3-none-any.whl
Upload date: Dec 29, 2025
Size: 16.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for tenso-0.12.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`1e572a3ea250854e32a3e4be354dcdccc93f6fa077fb3741a9af3b4b0c100699`
MD5	`c8708d2432d2bd03a448fac2247fce3a`
BLAKE2b-256	`5e6ed14fb90ecb6715596da3e8e2dd783da9142235a4386fa00b8f77cad9b64f`

See more details on using hashes here.

Provenance

The following attestation bundles were made for tenso-0.12.0-py3-none-any.whl:

Publisher: release.yml on Khushiyant/tenso

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: tenso-0.12.0-py3-none-any.whl
- Subject digest: 1e572a3ea250854e32a3e4be354dcdccc93f6fa077fb3741a9af3b4b0c100699
- Sigstore transparency entry: 781583651
- Sigstore integration time: Dec 29, 2025
Source repository:
- Permalink: Khushiyant/tenso@d4d8674ec7267704f307c936cb92a6b70882e132
- Branch / Tag: refs/heads/main
- Owner: https://github.com/Khushiyant
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@d4d8674ec7267704f307c936cb92a6b70882e132
- Trigger Event: push

tenso 0.12.0

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

Tenso

Why Tenso?

The Problem

The Solution

Benchmarks

Deserialization Speed (8192×8192 Float32 Matrix)

Stream Reading Performance (95MB Packet)

Installation

Quick Start (v0.10.1)

Basic Serialization

Async I/O

FastAPI Integration

Advanced Features

GPU Acceleration (Direct Transfer)

Sparse Formats & Bundling

Data Integrity (XXH3)

gRPC Integration

Protocol Design

Use Cases

Contributing

License

Citation

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance