Skip to main content

A fast zero-copy PyTorch tensor streamer

Project description

Vajra

A fast zero-copy PyTorch tensor streamer powered by Dlang.

Usage

Once installed, you can use Vajra in any Python script or Jupyter Notebook:

import torch
from vajra import VajraStreamer, StreamConfig

# Configure the streamer (optional, uses defaults otherwise)
config = StreamConfig(
    auth_token="hf_YOUR_TOKEN", # Required for gated models like Llama 3
    chunk_size_mb=64,
    chunk_workers=16,
    gpu_workers=3,
    disable_cache=False
)

# Context manager ensures VRAM is freed when done
with VajraStreamer(config) as streamer:
    # Pass the URL or a Hugging Face repo ID
    # (e.g. "meta-llama/Meta-Llama-3-8B")
    tensors = streamer.load("meta-llama/Meta-Llama-3-8B")

    # 'tensors' is a dictionary mapping tensor names to zero-copy PyTorch tensors
    # that are backed directly by the downloaded GPU memory.
    for name, tensor in tensors.items():
        print(f"Tensor: {name}, Shape: {tensor.shape}, Dtype: {tensor.dtype}")

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vajra_streamer-0.0.14.tar.gz (5.5 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

vajra_streamer-0.0.14-cp310-cp310-manylinux_2_34_x86_64.whl (7.6 MB view details)

Uploaded CPython 3.10manylinux: glibc 2.34+ x86-64

File details

Details for the file vajra_streamer-0.0.14.tar.gz.

File metadata

  • Download URL: vajra_streamer-0.0.14.tar.gz
  • Upload date:
  • Size: 5.5 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.11.15

File hashes

Hashes for vajra_streamer-0.0.14.tar.gz
Algorithm Hash digest
SHA256 644e0e1534e0f3da40972840bf53e95a310c561cef93c93ee8deb70eb9539b3e
MD5 639fe8ae1ee0a7d419d88a9299601637
BLAKE2b-256 df1ef8a69921cc2eca71b1ac997a5c496b40f3e0d478da959b9fb1524e5c836b

See more details on using hashes here.

File details

Details for the file vajra_streamer-0.0.14-cp310-cp310-manylinux_2_34_x86_64.whl.

File metadata

File hashes

Hashes for vajra_streamer-0.0.14-cp310-cp310-manylinux_2_34_x86_64.whl
Algorithm Hash digest
SHA256 fd591e1dcc5098ad6f1a72b15a3fd0ef59d61ffdb3cfef08bb26e9512d0a4109
MD5 f23649d46617d1d387eb80d49b7c9970
BLAKE2b-256 b516b97c3639a7441cc100e0a1e26f1b82f2902834f46bbd7bccfc176c587cb9

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page