Skip to main content

A collection of tricks to speed up LLMs, see our transformer-tricks papers on arXiv

Project description

Setup

pip3 install --quiet -r requirements.txt

To run llama and other LLMs that need an agreement (not SmolLM), you first have to type the following:

huggingface-cli login

Above will ask you for the hf_token, which is the same you use e.g. in colab

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

transformer_tricks-0.1.0.tar.gz (2.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

transformer_tricks-0.1.0-py3-none-any.whl (2.7 kB view details)

Uploaded Python 3

File details

Details for the file transformer_tricks-0.1.0.tar.gz.

File metadata

  • Download URL: transformer_tricks-0.1.0.tar.gz
  • Upload date:
  • Size: 2.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.10.9

File hashes

Hashes for transformer_tricks-0.1.0.tar.gz
Algorithm Hash digest
SHA256 8cf920c9a468f2626dc032c92217a33569bde25022834a2ca5c8bdd3d313e0cd
MD5 84d19342dd0aebe011a7fad40f277bb8
BLAKE2b-256 83b3261cb8b16a3128584bae8e7b23959a8cc375ff98bc650d36e4fe2ba465ac

See more details on using hashes here.

File details

Details for the file transformer_tricks-0.1.0-py3-none-any.whl.

File metadata

File hashes

Hashes for transformer_tricks-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 75015f0faf5dfbe019a4221ac70a9f2f0ab91a8023ed7d3ebf971c98d39f0c4e
MD5 2b0c48991f294a07c33004066bf125db
BLAKE2b-256 9d138c4cf41d6ef958c9eb02e58f918907f6c18a1f3c223131824a926d97a712

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page