Skip to main content

Two-stage neural architecture search for large language models

Project description

Whittle Logo

Whittle 🪒🧠

Whittle Visualization

PyPI - Python Version unit-tests license

Whittle is a Python library for compressing large language models (LLMs) by extracting sub-networks to balance performance and efficiency. It is based on LitGPT and allows to compress many state-of-the-art models.

  • Neural Architecture Search: Workflows for pre-training super-networks and multi-objective search to select sub-networks.
  • Evaluation: Easy extraction of sub-networks checkpoint and evaluation using LM-Eval-Harness
  • Efficiency: Different metrics to estimate efficiency of sub-networks, such as latency, FLOPs, or energy consumption.

Installation

Whittle supports and is tested for python 3.9 to 3.12.

You can install whittle with:

pip install whittle

Install from source

Install whittle from source to get the most recent version:

git clone git@github.com:whittle-org/whittle.git
cd whittle
pip install -e .

Getting started with whittle

To explore and understand different functionalities of whittle checkout this colab notebook and examples/

Projects that use whittle

How to get involved

We more than happy for any code contribution. If you are interested in contribution to whittle, please read our contribution guide.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

whittle-0.4.0.tar.gz (72.8 kB view details)

Uploaded Source

File details

Details for the file whittle-0.4.0.tar.gz.

File metadata

  • Download URL: whittle-0.4.0.tar.gz
  • Upload date:
  • Size: 72.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.0.1 CPython/3.12.8

File hashes

Hashes for whittle-0.4.0.tar.gz
Algorithm Hash digest
SHA256 64743f4c207991c47f957c8a8687c7ffdd2eaa41232779b04eafe39740bb8343
MD5 ba7b5ebdc915c51b4be5bd22a85c7489
BLAKE2b-256 464f98a7479e9d62ff3c0b768659e8d68a438f50bd9a23b7ffc0620706a4d5fc

See more details on using hashes here.

Provenance

The following attestation bundles were made for whittle-0.4.0.tar.gz:

Publisher: release.yml on whittle-org/whittle

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page