Skip to main content

Training and Analyzing Sparse Autoencoders (SAEs)

Project description

Screenshot 2024-03-21 at 3 08 28 pm

SAE Lens

PyPI License: MIT build Deploy Docs codecov

SAELens exists to help researchers:

  • Train sparse autoencoders.
  • Analyse sparse autoencoders / research mechanistic interpretability.
  • Generate insights which make it easier to create safe and aligned AI systems.

Please refer to the documentation for information on how to:

  • Download and Analyse pre-trained sparse autoencoders.
  • Train your own sparse autoencoders.
  • Generate feature dashboards with the SAE-Vis Library.

SAE Lens is the result of many contributors working collectively to improve humanity's understanding of neural networks, many of whom are motivated by a desire to safeguard humanity from risks posed by artificial intelligence.

This library is maintained by Joseph Bloom and David Chanin.

Loading Pre-trained SAEs.

Pre-trained SAEs for various models can be imported via SAE Lens. See this page in the readme for a list of all SAEs.

Tutorials

Join the Slack!

Feel free to join the Open Source Mechanistic Interpretability Slack for support!

Citation

Please cite the package as follows:

@misc{bloom2024saetrainingcodebase,
   title = {SAELens},
   author = {Joseph Bloom, Curt Tigges and David Chanin},
   year = {2024},
   howpublished = {\url{https://github.com/jbloomAus/SAELens}},
}

Project details


Release history Release notifications | RSS feed

This version

4.1.0

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sae_lens-4.1.0.tar.gz (122.3 kB view details)

Uploaded Source

Built Distribution

sae_lens-4.1.0-py3-none-any.whl (132.1 kB view details)

Uploaded Python 3

File details

Details for the file sae_lens-4.1.0.tar.gz.

File metadata

  • Download URL: sae_lens-4.1.0.tar.gz
  • Upload date:
  • Size: 122.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/5.1.1 CPython/3.12.7

File hashes

Hashes for sae_lens-4.1.0.tar.gz
Algorithm Hash digest
SHA256 b3eea72c9fd79191411fef228b729f44c66fe9faf95bf8fff4686154ee85b2d9
MD5 9715ccc4ea9cd00540e4229d7f322c84
BLAKE2b-256 69d3b6d60688fa1703deb59b1d244f6ea0b1f2137da6f2a7e64bfa75f02b35cb

See more details on using hashes here.

Provenance

The following attestation bundles were made for sae_lens-4.1.0.tar.gz:

Publisher: build.yml on jbloomAus/SAELens

Attestations:

File details

Details for the file sae_lens-4.1.0-py3-none-any.whl.

File metadata

  • Download URL: sae_lens-4.1.0-py3-none-any.whl
  • Upload date:
  • Size: 132.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/5.1.1 CPython/3.12.7

File hashes

Hashes for sae_lens-4.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 dd81b382efffc63d398308a58206cbaa3d8d6b822629895cb84eafc7e55ef4e8
MD5 0412d43062aa8b1a1a1ddcbe9855b375
BLAKE2b-256 c28a0787c055981e5d0a0ce4759bc50eda228409da951e9371b42b68a4ccb74d

See more details on using hashes here.

Provenance

The following attestation bundles were made for sae_lens-4.1.0-py3-none-any.whl:

Publisher: build.yml on jbloomAus/SAELens

Attestations:

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page