Skip to main content

JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs welcome).

Project description

Unit Tests PyPI version PyPi downloads Contributions welcome

JetStream is a throughput and memory optimized engine for LLM inference on XLA devices.

About

JetStream is a throughput and memory optimized engine for LLM inference on XLA devices, starting with TPUs (and GPUs in future -- PRs welcome).

JetStream Engine Implementation

Currently, there are two reference engine implementations available -- one for Jax models and another for Pytorch models.

Jax

Pytorch

Documentation

JetStream Standalone Local Setup

Getting Started

Setup

pip install -r requirements.txt

Run local server & Testing

Use the following commands to run a server locally:

# Start a server
python -m jetstream.core.implementations.mock.server

# Test local mock server
python -m jetstream.tools.requester

# Load test local mock server
python -m jetstream.tools.load_tester

Test core modules

# Test JetStream core orchestrator
python -m unittest -v jetstream.tests.core.test_orchestrator

# Test JetStream core server library
python -m unittest -v jetstream.tests.core.test_server

# Test mock JetStream engine implementation
python -m unittest -v jetstream.tests.engine.test_mock_engine

# Test mock JetStream token utils
python -m unittest -v jetstream.tests.engine.test_token_utils
python -m unittest -v jetstream.tests.engine.test_utils

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

google_jetstream-0.2.2.tar.gz (51.5 kB view details)

Uploaded Source

Built Distribution

google_jetstream-0.2.2-py3-none-any.whl (72.3 kB view details)

Uploaded Python 3

File details

Details for the file google_jetstream-0.2.2.tar.gz.

File metadata

  • Download URL: google_jetstream-0.2.2.tar.gz
  • Upload date:
  • Size: 51.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/5.0.0 CPython/3.12.3

File hashes

Hashes for google_jetstream-0.2.2.tar.gz
Algorithm Hash digest
SHA256 9ea3d238cbb2515cd21e2d2753453fdf505e2dc635b81cc159c08161fdad95ef
MD5 d66ddc697be003bab7f825a1bdb422b2
BLAKE2b-256 e91088c13224cdcabdd7e6a39352f9f345ccb0381aff252bee6341fd71dcc745

See more details on using hashes here.

File details

Details for the file google_jetstream-0.2.2-py3-none-any.whl.

File metadata

File hashes

Hashes for google_jetstream-0.2.2-py3-none-any.whl
Algorithm Hash digest
SHA256 d4372f6efbc9cfb7d88127d0c42f6efa89bdb754e2a943df2638fe077900606c
MD5 bbb1ee9717cb79e538c40cc848d4fa68
BLAKE2b-256 7550b7d5ccf7cb3863718dfebe6641973ffd7720a8a4ed22ae4db45b7a2c2954

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page