Skip to main content

A unified library for creating, representing, and storing speculative decoding algorithms for LLM serving such as in vLLM.

Project description

Speculators Logo

A Unified Library for Speculative Decoding Algorithms for LLMs

GitHub Release Documentation License Python Versions

Overview

Speculators is a unified library for creating, representing, and storing speculative decoding algorithms for large language model (LLM) serving, such as in vLLM. It provides a standard format and tools to facilitate the productization of decoding algorithms for inference servers.

Key Features

  • Speculative Decoding: Simplify the creation and representation of speculative decoding algorithms for LLMs.
  • Standardized Format: Ensure compatibility and ease of use with a standardized format for decoding algorithms.
  • Integration Ready: Designed to integrate seamlessly with LLM inference servers like vLLM.
  • Productization Tools: Streamline the process of deploying decoding algorithms in production environments.

Getting Started

Installation

Before installing, ensure you have the following prerequisites:

  • OS: Linux or MacOS
  • Python: 3.9 or higher

The latest Speculators release can be installed using pip:

pip install speculators

Or from source code using pip:

pip install git+https://github.com/neuralmagic/speculators.git

Quick Start

Coming soon

Resources

Documentation

Coming soon

Releases

Visit our GitHub Releases page and review the release notes to stay updated with the latest releases.

License

Speculators is licensed under the Apache License 2.0.

Cite

If you find Speculators helpful in your research or projects, please consider citing it:

@misc{speculators2025,
  title={Speculators: A Unified Library for Speculative Decoding Algorithms in LLM Serving},
  author={Red Hat},
  year={2025},
  howpublished={\url{https://github.com/neuralmagic/speculators}},
}

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

speculators-0.1.0a9.tar.gz (51.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

speculators-0.1.0a9-py3-none-any.whl (44.8 kB view details)

Uploaded Python 3

File details

Details for the file speculators-0.1.0a9.tar.gz.

File metadata

  • Download URL: speculators-0.1.0a9.tar.gz
  • Upload date:
  • Size: 51.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.9.23

File hashes

Hashes for speculators-0.1.0a9.tar.gz
Algorithm Hash digest
SHA256 fecc8883085bd9560e8c0dbc1ee2c268f756e7e17dc0056158c7f485983b82e9
MD5 04135eee7e6c1db4acef2dd8cc263c6b
BLAKE2b-256 6e762461c5ab1944152df28f25a525a4db3491acd0fd8fa3609aa1c3d8f1385d

See more details on using hashes here.

File details

Details for the file speculators-0.1.0a9-py3-none-any.whl.

File metadata

  • Download URL: speculators-0.1.0a9-py3-none-any.whl
  • Upload date:
  • Size: 44.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.9.23

File hashes

Hashes for speculators-0.1.0a9-py3-none-any.whl
Algorithm Hash digest
SHA256 64bf46d17c3b94399858f5f6f5ece60ffd61c1a82de90cfd14b98299f92f2c24
MD5 ae3f700a28721d3d14624228882f6651
BLAKE2b-256 de0ee832d6b1c3455953b13571daf95600443791d374f5cb3135ec6b410a1973

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page