Skip to main content

A high-throughput and memory-efficient inference and serving engine for LLMs

Project description

The author of this package has not provided a project description

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

npu_vllm-0.4.2.post2.tar.gz (23.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

npu_vllm-0.4.2.post2-py3-none-any.whl (30.8 kB view details)

Uploaded Python 3

File details

Details for the file npu_vllm-0.4.2.post2.tar.gz.

File metadata

  • Download URL: npu_vllm-0.4.2.post2.tar.gz
  • Upload date:
  • Size: 23.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.0.1 CPython/3.10.15

File hashes

Hashes for npu_vllm-0.4.2.post2.tar.gz
Algorithm Hash digest
SHA256 676170e4c614e7f867018cc5b331b17b49013716848f5c2abc7965353ae91e61
MD5 0b6ad994bf38308fe5c20132f38abb48
BLAKE2b-256 94ee62401e665ef3d47284af8581bddba8bd25a1b6a975c7541cd5eb87762e97

See more details on using hashes here.

File details

Details for the file npu_vllm-0.4.2.post2-py3-none-any.whl.

File metadata

  • Download URL: npu_vllm-0.4.2.post2-py3-none-any.whl
  • Upload date:
  • Size: 30.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.0.1 CPython/3.10.15

File hashes

Hashes for npu_vllm-0.4.2.post2-py3-none-any.whl
Algorithm Hash digest
SHA256 5a2a7ea93e438b7b3915a5e788d87c22df2a99c78111e1c4091553a9531991e5
MD5 9519cd1061689c269f0a48f01e75936d
BLAKE2b-256 8339ae52a9084d0e808cb93cecccb6a6a52279438e59aad0ecdf373f0bedff19

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page