Training and Analyzing Sparse Autoencoders (SAEs)
Project description
SAE Lens
SAELens exists to help researchers:
- Train sparse autoencoders.
- Analyse sparse autoencoders / research mechanistic interpretability.
- Generate insights which make it easier to create safe and aligned AI systems.
Please refer to the documentation for information on how to:
- Download and Analyse pre-trained sparse autoencoders.
- Train your own sparse autoencoders.
- Generate feature dashboards with the SAE-Vis Library.
SAE Lens is the result of many contributors working collectively to improve humanity's understanding of neural networks, many of whom are motivated by a desire to safeguard humanity from risks posed by artificial intelligence.
This library is maintained by Joseph Bloom and David Chanin.
Loading Pre-trained SAEs.
Pre-trained SAEs for various models can be imported via SAE Lens. See this page in the readme for a list of all SAEs.
Tutorials
- SAE Lens + Neuronpedia
- Loading and Analysing Pre-Trained Sparse Autoencoders
- Understanding SAE Features with the Logit Lens
- Training a Sparse Autoencoder
Join the Slack!
Feel free to join the Open Source Mechanistic Interpretability Slack for support!
Citation
Please cite the package as follows:
@misc{bloom2024saetrainingcodebase,
title = {SAELens},
author = {Joseph Bloom, Curt Tigges and David Chanin},
year = {2024},
howpublished = {\url{https://github.com/jbloomAus/SAELens}},
}
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file sae_lens-4.3.2.tar.gz
.
File metadata
- Download URL: sae_lens-4.3.2.tar.gz
- Upload date:
- Size: 128.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/5.1.1 CPython/3.12.7
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 8f24f0c86492dd7e19a814e5285a4c5b1d791ac82097bbc885aab6a6ddfbdd0a |
|
MD5 | 89695f9eeb54abd4de86c2c863bb22f3 |
|
BLAKE2b-256 | 42fc255261a1c9ed2b91bfa1d56caa899833c6650d4971a0de8d8a093ff272aa |
Provenance
The following attestation bundles were made for sae_lens-4.3.2.tar.gz
:
Publisher:
build.yml
on jbloomAus/SAELens
-
Statement type:
https://in-toto.io/Statement/v1
- Predicate type:
https://docs.pypi.org/attestations/publish/v1
- Subject name:
sae_lens-4.3.2.tar.gz
- Subject digest:
8f24f0c86492dd7e19a814e5285a4c5b1d791ac82097bbc885aab6a6ddfbdd0a
- Sigstore transparency entry: 148260116
- Sigstore integration time:
- Predicate type:
File details
Details for the file sae_lens-4.3.2-py3-none-any.whl
.
File metadata
- Download URL: sae_lens-4.3.2-py3-none-any.whl
- Upload date:
- Size: 139.1 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/5.1.1 CPython/3.12.7
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | a3acbb506a2cbd18dda7cfbc4a187d4db936c7bc08385ff90a2a997dff684bd7 |
|
MD5 | a56b0241c3713443e407b1293960cbb1 |
|
BLAKE2b-256 | 2e73367b47179b3b88ca751f1b7565f6cc52f353bfb365d4dae0f72dea1f4b1f |
Provenance
The following attestation bundles were made for sae_lens-4.3.2-py3-none-any.whl
:
Publisher:
build.yml
on jbloomAus/SAELens
-
Statement type:
https://in-toto.io/Statement/v1
- Predicate type:
https://docs.pypi.org/attestations/publish/v1
- Subject name:
sae_lens-4.3.2-py3-none-any.whl
- Subject digest:
a3acbb506a2cbd18dda7cfbc4a187d4db936c7bc08385ff90a2a997dff684bd7
- Sigstore transparency entry: 148260117
- Sigstore integration time:
- Predicate type: