Skip to main content

A readable LLM inference server implementing paged attention and continuous batching

Project description

The author of this package has not provided a project description

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

llm_infer-0.1.0.tar.gz (293.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

llm_infer-0.1.0-py3-none-any.whl (231.8 kB view details)

Uploaded Python 3

File details

Details for the file llm_infer-0.1.0.tar.gz.

File metadata

  • Download URL: llm_infer-0.1.0.tar.gz
  • Upload date:
  • Size: 293.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for llm_infer-0.1.0.tar.gz
Algorithm Hash digest
SHA256 4e22ca3f0a8c72634b163ff39de8c102d8269c90a896d5d14264b04daa7326da
MD5 9fb1dcf81a49ad44c0b7299a607d34e3
BLAKE2b-256 09899d913874a8d9f968fca649c67cadc59af9d875ddb1f66a8112bfd39eb426

See more details on using hashes here.

Provenance

The following attestation bundles were made for llm_infer-0.1.0.tar.gz:

Publisher: release.yml on serendip-ml/llm-infer

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file llm_infer-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: llm_infer-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 231.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for llm_infer-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 cca8f506a88f0c85e15e1fe4decea74a603835048d27c27f302a3e68ef432bdb
MD5 b227bc15b1a9fa90c48e0b74c18ea9a6
BLAKE2b-256 2d0f31a499b92f148dc82623f0237d0e46ced8998b66a8223ae1cfc4f131594b

See more details on using hashes here.

Provenance

The following attestation bundles were made for llm_infer-0.1.0-py3-none-any.whl:

Publisher: release.yml on serendip-ml/llm-infer

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page