A unified library for creating, representing, and storing speculative decoding algorithms for LLM serving such as in vLLM.
Project description
A Unified Library for Speculative Decoding Algorithms for LLMs
Overview
Speculators is a unified library for creating, representing, and storing speculative decoding algorithms for large language model (LLM) serving, such as in vLLM. It provides a standard format and tools to facilitate the productization of decoding algorithms for inference servers.
Key Features
- Speculative Decoding: Simplify the creation and representation of speculative decoding algorithms for LLMs.
- Standardized Format: Ensure compatibility and ease of use with a standardized format for decoding algorithms.
- Integration Ready: Designed to integrate seamlessly with LLM inference servers like vLLM.
- Productization Tools: Streamline the process of deploying decoding algorithms in production environments.
Getting Started
Installation
Before installing, ensure you have the following prerequisites:
- OS: Linux or MacOS
- Python: 3.9 or higher
The latest Speculators release can be installed using pip:
pip install speculators
Or from source code using pip:
pip install git+https://github.com/neuralmagic/speculators.git
Quick Start
Coming soon
Resources
Documentation
Coming soon
Releases
Visit our GitHub Releases page and review the release notes to stay updated with the latest releases.
License
Speculators is licensed under the Apache License 2.0.
Cite
If you find Speculators helpful in your research or projects, please consider citing it:
@misc{speculators2025,
title={Speculators: A Unified Library for Speculative Decoding Algorithms in LLM Serving},
author={Red Hat},
year={2025},
howpublished={\url{https://github.com/neuralmagic/speculators}},
}
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file speculators-0.1.0a8.tar.gz.
File metadata
- Download URL: speculators-0.1.0a8.tar.gz
- Upload date:
- Size: 51.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.9.23
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
318bbab841d40637db7dca622ff664ba817d1e137858eea4ad14700f5b8cf445
|
|
| MD5 |
80b599f38913de51bc5850a3e9240a1e
|
|
| BLAKE2b-256 |
f61e7fe815f2bf601f77b5d7b5312fa110fb8f59c3423111759a4c3a45759a52
|
File details
Details for the file speculators-0.1.0a8-py3-none-any.whl.
File metadata
- Download URL: speculators-0.1.0a8-py3-none-any.whl
- Upload date:
- Size: 44.8 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.9.23
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
05bdad9f82485b929507d90adfa6a94b59aaec07b9cbc05e60fc6946634a35d0
|
|
| MD5 |
100c768293be4dbfcb502599a550fa15
|
|
| BLAKE2b-256 |
3a074ee17cbffc95b59eb258292fd2f77e9872f7f25fb6d6d315f577c70014d3
|