Profile of EleutherAI

Last released Jul 20, 2023

Keeping language models honest by directly eliciting knowledge encoded in their activations

Last released Mar 12, 2026

Automated Interpretability

Last released Jul 22, 2026

Tracing the memory of neural nets with data attribution

Last released Jan 10, 2024

Erasing concepts from neural representations with provable guarantees

Last released Sep 2, 2024

Efficiently computing & storing token n-grams from large corpora

Last released Jul 16, 2026

Sparsify transformers with SAEs and transcoders

EleutherAI