Skip to main content

Speculative decoding and kernel implementations for inference acceleration

Project description

rotalabs-accel

Inference acceleration from Rotalabs.

Speculative decoding and kernel implementations for 8.1x speedup.

This is a placeholder package. Full implementation coming soon.

Links

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

rotalabs_accel-0.0.1.tar.gz (1.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

rotalabs_accel-0.0.1-py3-none-any.whl (1.5 kB view details)

Uploaded Python 3

File details

Details for the file rotalabs_accel-0.0.1.tar.gz.

File metadata

  • Download URL: rotalabs_accel-0.0.1.tar.gz
  • Upload date:
  • Size: 1.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.7

File hashes

Hashes for rotalabs_accel-0.0.1.tar.gz
Algorithm Hash digest
SHA256 c45c6a417555add5136d55c8100f7dbb432c7c4eee3911223548fe1e7cd29b59
MD5 0677cc5e20dced7375b4d169e0168de9
BLAKE2b-256 ccc008358082415dc7c74882f81970be565ee5dbfbc87319c16bb59709a52300

See more details on using hashes here.

File details

Details for the file rotalabs_accel-0.0.1-py3-none-any.whl.

File metadata

  • Download URL: rotalabs_accel-0.0.1-py3-none-any.whl
  • Upload date:
  • Size: 1.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.7

File hashes

Hashes for rotalabs_accel-0.0.1-py3-none-any.whl
Algorithm Hash digest
SHA256 c9cd76309c6eac2e139aae6913042531fa563c9ade511e23ac1e630c62415fc6
MD5 2b17f5d729403e17216b7c66aecb95c7
BLAKE2b-256 05f0f3e10b6141cc2fac904b43ade286479ed304974af61fcfced96c7fe1bb3d

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page