Two-stage neural architecture search for large language models
Project description
Whittle 🪒🧠
Whittle is a Python library for compressing large language models (LLMs) by extracting sub-networks to balance performance and efficiency. It is based on LitGPT and allows to compress many state-of-the-art models.
- Neural Architecture Search: Workflows for pre-training super-networks and multi-objective search to select sub-networks.
- Evaluation: Easy extraction of sub-networks checkpoint and evaluation using LM-Eval-Harness
- Efficiency: Different metrics to estimate efficiency of sub-networks, such as latency, FLOPs, or energy consumption.
Installation
Whittle supports and is tested for python 3.9 to 3.12.
You can install whittle with:
pip install whittle
Install from source
Install whittle from source to get the most recent version:
git clone git@github.com:whittle-org/whittle.git
cd whittle
pip install -e .
Getting started with whittle
To explore and understand different functionalities of whittle checkout this colab notebook and examples/
Projects that use whittle
How to get involved
We more than happy for any code contribution. If you are interested in contribution to whittle, please read our contribution guide.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
File details
Details for the file whittle-0.4.0.tar.gz.
File metadata
- Download URL: whittle-0.4.0.tar.gz
- Upload date:
- Size: 72.8 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.0.1 CPython/3.12.8
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
64743f4c207991c47f957c8a8687c7ffdd2eaa41232779b04eafe39740bb8343
|
|
| MD5 |
ba7b5ebdc915c51b4be5bd22a85c7489
|
|
| BLAKE2b-256 |
464f98a7479e9d62ff3c0b768659e8d68a438f50bd9a23b7ffc0620706a4d5fc
|
Provenance
The following attestation bundles were made for whittle-0.4.0.tar.gz:
Publisher:
release.yml on whittle-org/whittle
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
whittle-0.4.0.tar.gz -
Subject digest:
64743f4c207991c47f957c8a8687c7ffdd2eaa41232779b04eafe39740bb8343 - Sigstore transparency entry: 155024521
- Sigstore integration time:
-
Permalink:
whittle-org/whittle@c8ae1188298c9848d3d60943789a4ea578aab5ca -
Branch / Tag:
refs/heads/210-fix-artifact-upload-for-changelog - Owner: https://github.com/whittle-org
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
release.yml@c8ae1188298c9848d3d60943789a4ea578aab5ca -
Trigger Event:
workflow_dispatch
-
Statement type: