A high-throughput and memory-efficient inference and serving engine for LLMs
Project description
The author of this package has not provided a project description
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
npu_vllm-0.4.2.post1.tar.gz
(23.6 kB
view details)
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file npu_vllm-0.4.2.post1.tar.gz.
File metadata
- Download URL: npu_vllm-0.4.2.post1.tar.gz
- Upload date:
- Size: 23.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.0.1 CPython/3.10.15
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
4cbb7839ba9c1fca672bce744597d6507b16ed774d5e845b4f2afe07dbffd01f
|
|
| MD5 |
781502d85fe519eb9f271a6549d96f7a
|
|
| BLAKE2b-256 |
4a4fa0c115bd676f78fb85001317c224883d8307295fdfd678d317b5ebfbc328
|
File details
Details for the file npu_vllm-0.4.2.post1-py3-none-any.whl.
File metadata
- Download URL: npu_vllm-0.4.2.post1-py3-none-any.whl
- Upload date:
- Size: 30.7 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.0.1 CPython/3.10.15
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
69e70dcfe07f32b660b382bbad703cce0adcefa4350daae63e2c4605861449bc
|
|
| MD5 |
123a83d164554cc79587122e703f12a2
|
|
| BLAKE2b-256 |
b40012dd78992ae3941aca436b2ce454aaa9e014d635f28df47f7bd548b36c3d
|