Skip to main content

A collection of tricks to speed up LLMs, see our transformer-tricks papers on arXiv

Project description

Setup

pip3 install --quiet -r requirements.txt

To run llama and other LLMs that need an agreement (not SmolLM), you first have to type the following:

huggingface-cli login

Above will ask you for the hf_token, which is the same you use e.g. in colab

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

transformer_tricks-0.1.1.tar.gz (2.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

transformer_tricks-0.1.1-py3-none-any.whl (2.7 kB view details)

Uploaded Python 3

File details

Details for the file transformer_tricks-0.1.1.tar.gz.

File metadata

  • Download URL: transformer_tricks-0.1.1.tar.gz
  • Upload date:
  • Size: 2.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.10.9

File hashes

Hashes for transformer_tricks-0.1.1.tar.gz
Algorithm Hash digest
SHA256 dedd4dbd92c58646568e58ab8199169d26e946993d9e697e71afc5a07bc24951
MD5 39d69ca5d513d50029b6aa2443cb5e21
BLAKE2b-256 ff17426d42d3a6043ef8b93451a40c615d327d464a67d3775c55d99431032840

See more details on using hashes here.

File details

Details for the file transformer_tricks-0.1.1-py3-none-any.whl.

File metadata

File hashes

Hashes for transformer_tricks-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 304ab172d937b5feef0e48a9ad1340ff5e22160f5f30436dbf49d4ad573b0600
MD5 61489b27d9e333f179a2afc1bbb57fd5
BLAKE2b-256 d8c484c855d46a220f364f35579656c4c9f80b113357e3f57fabb2cfd6d84a0a

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page