Training and Analyzing Sparse Autoencoders (SAEs)
Project description
SAE Lens
SAELens exists to help researchers:
- Train sparse autoencoders.
- Analyse sparse autoencoders / research mechanistic interpretability.
- Generate insights which make it easier to create safe and aligned AI systems.
Please refer to the documentation for information on how to:
- Download and Analyse pre-trained sparse autoencoders.
- Train your own sparse autoencoders.
- Generate feature dashboards with the SAE-Vis Library.
Join the Slack!
Feel free to join the Open Source Mechanistic Interpretability Slack for support!
Citations and References
Research:
Reference Implementations:
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
sae_lens-0.2.2.tar.gz
(44.7 kB
view hashes)
Built Distribution
sae_lens-0.2.2-py3-none-any.whl
(54.9 kB
view hashes)