Skip to main content

Training and Analyzing Sparse Autoencoders (SAEs)

Project description

Screenshot 2024-03-21 at 3 08 28 pm

SAE Lens

PyPI License: MIT build Deploy Docs codecov

SAELens exists to help researchers:

  • Train sparse autoencoders.
  • Analyse sparse autoencoders / research mechanistic interpretability.
  • Generate insights which make it easier to create safe and aligned AI systems.

Please refer to the documentation for information on how to:

  • Download and Analyse pre-trained sparse autoencoders.
  • Train your own sparse autoencoders.
  • Generate feature dashboards with the SAE-Vis Library.

SAE Lens is the result of many contributors working collectively to improve humanity's understanding of neural networks, many of whom are motivated by a desire to safeguard humanity from risks posed by artificial intelligence.

This library is maintained by Joseph Bloom, Curt Tigges, Anthony Duong and David Chanin.

Loading Pre-trained SAEs.

Pre-trained SAEs for various models can be imported via SAE Lens. See this page in the readme for a list of all SAEs.

Tutorials

Join the Slack!

Feel free to join the Open Source Mechanistic Interpretability Slack for support!

Citation

Please cite the package as follows:

@misc{bloom2024saetrainingcodebase,
   title = {SAELens},
   author = {Bloom, Joseph and Tigges, Curt and Duong, Anthony and Chanin, David},
   year = {2024},
   howpublished = {\url{https://github.com/jbloomAus/SAELens}},
}

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sae_lens-5.11.0.tar.gz (124.3 kB view details)

Uploaded Source

Built Distribution

sae_lens-5.11.0-py3-none-any.whl (132.1 kB view details)

Uploaded Python 3

File details

Details for the file sae_lens-5.11.0.tar.gz.

File metadata

  • Download URL: sae_lens-5.11.0.tar.gz
  • Upload date:
  • Size: 124.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for sae_lens-5.11.0.tar.gz
Algorithm Hash digest
SHA256 b9e049316b8211f831d9285e85231aa295576cf78a8d0a7b98d1db03c904e067
MD5 85dba2ef1d9e00168aed627e45ec9e0c
BLAKE2b-256 13c7754e8e75290c433292203ea1dcef9e5e6fd0cb1a9c6699b60513b7a22509

See more details on using hashes here.

Provenance

The following attestation bundles were made for sae_lens-5.11.0.tar.gz:

Publisher: build.yml on jbloomAus/SAELens

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file sae_lens-5.11.0-py3-none-any.whl.

File metadata

  • Download URL: sae_lens-5.11.0-py3-none-any.whl
  • Upload date:
  • Size: 132.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for sae_lens-5.11.0-py3-none-any.whl
Algorithm Hash digest
SHA256 afa015c1153f8ff5a01100a46698413a8d03a646ad63cf4dba12c296fbf664bf
MD5 172098c448766e5c9e8d4acf17b3b40a
BLAKE2b-256 e320385bdbabbf299af6ce1c79aa702153a467588e11ef7f607877cdacc2c27a

See more details on using hashes here.

Provenance

The following attestation bundles were made for sae_lens-5.11.0-py3-none-any.whl:

Publisher: build.yml on jbloomAus/SAELens

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page