Skip to main content

Training and Analyzing Sparse Autoencoders (SAEs)

Project description

Screenshot 2024-03-21 at 3 08 28 pm

SAE Lens

PyPI License: MIT build Deploy Docs codecov

SAELens exists to help researchers:

  • Train sparse autoencoders.
  • Analyse sparse autoencoders / research mechanistic interpretability.
  • Generate insights which make it easier to create safe and aligned AI systems.

Please refer to the documentation for information on how to:

  • Download and Analyse pre-trained sparse autoencoders.
  • Train your own sparse autoencoders.
  • Generate feature dashboards with the SAE-Vis Library.

SAE Lens is the result of many contributors working collectively to improve humanity's understanding of neural networks, many of whom are motivated by a desire to safeguard humanity from risks posed by artificial intelligence.

This library is maintained by Joseph Bloom and David Chanin.

Loading Pre-trained SAEs.

Pre-trained SAEs for various models can be imported via SAE Lens. See this page in the readme for a list of all SAEs.

Tutorials

Join the Slack!

Feel free to join the Open Source Mechanistic Interpretability Slack for support!

Citation

Please cite the package as follows:

@misc{bloom2024saetrainingcodebase,
   title = {SAELens},
   author = {Joseph Bloom, Curt Tigges and David Chanin},
   year = {2024},
   howpublished = {\url{https://github.com/jbloomAus/SAELens}},
}

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sae_lens-4.0.5.tar.gz (121.6 kB view details)

Uploaded Source

Built Distribution

sae_lens-4.0.5-py3-none-any.whl (131.4 kB view details)

Uploaded Python 3

File details

Details for the file sae_lens-4.0.5.tar.gz.

File metadata

  • Download URL: sae_lens-4.0.5.tar.gz
  • Upload date:
  • Size: 121.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/5.1.1 CPython/3.12.7

File hashes

Hashes for sae_lens-4.0.5.tar.gz
Algorithm Hash digest
SHA256 8b4b659889f1a85b74a0c281f05c86ff1b72ea51ec9834027a25c3c1012e2010
MD5 b31a83e86af3dae4464e153ddaaee819
BLAKE2b-256 8d82cd99d38a321293b7d27086be95a087a0751b018fc3502c770babcb5e4894

See more details on using hashes here.

File details

Details for the file sae_lens-4.0.5-py3-none-any.whl.

File metadata

  • Download URL: sae_lens-4.0.5-py3-none-any.whl
  • Upload date:
  • Size: 131.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/5.1.1 CPython/3.12.7

File hashes

Hashes for sae_lens-4.0.5-py3-none-any.whl
Algorithm Hash digest
SHA256 6b616c21b41b3ecad8d8b412d72c0b0dfbf6b7417f3fa4fdb2502e57e11076e2
MD5 92cc6c633051836f3325bc1a10d47019
BLAKE2b-256 73c4b60e68715650839146f1bb6765e751f200a6a8d64b5757741cf1af4a6ad5

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page