Skip to main content

A throughput and memory optimized engine for LLM inference on TPU and GPU.

Project description

JetStream - A throughput and memory optimized engine for LLM inference on TPU and GPU

About

JetStream is a fast library for LLM inference and serving on TPU and GPU.

Getting Started

Run local server & Testing

Use the following commands to run a server locally:

# Start a server
python -m jetstream.core.implementations.mock.server

# Test local mock server
python -m jetstream.core.tools.requester

# Load test local mock server
python -m jetstream.core.tools.load_tester

Test core modules

# Test JetStream core orchestrator
python -m jetstream.core.orchestrator_test

# Test JetStream core server library
python -m jetstream.core.server_test

# Test mock JET engine implementation
python -m jetstream.engine.mock_engine_test

# Test mock JET engine implementation
python -m jetstream.engine.utils_test

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

google-jetstream-0.1.1.tar.gz (27.0 kB view details)

Uploaded Source

Built Distribution

google_jetstream-0.1.1-py3-none-any.whl (41.7 kB view details)

Uploaded Python 3

File details

Details for the file google-jetstream-0.1.1.tar.gz.

File metadata

  • Download URL: google-jetstream-0.1.1.tar.gz
  • Upload date:
  • Size: 27.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.0.0 CPython/3.10.0

File hashes

Hashes for google-jetstream-0.1.1.tar.gz
Algorithm Hash digest
SHA256 353b597e509f1a73c1a62963e9a5e9f0f14e2562fce2030d51aa37ad2c2a8b15
MD5 6fc085c6db59a68d7aedf12e08e8f347
BLAKE2b-256 2ad2b515689cdf65a60e8b6645e9878b4802cc5e24852fe08757d0794f53bd35

See more details on using hashes here.

File details

Details for the file google_jetstream-0.1.1-py3-none-any.whl.

File metadata

File hashes

Hashes for google_jetstream-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 c49d1f8bfc9cb4dcefc6fb5ec915f0d418cf23db71006524d04243bc51277959
MD5 25a3d3d1be1a1e8779030dc970b086f7
BLAKE2b-256 247664a64461a6ec2017b7bdd1e54a7edb48a4fcfd29f3727b4d8c710595ad1b

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page