Profile of EleutherAI

Last released Jul 20, 2023

Keeping language models honest by directly eliciting knowledge encoded in their activations

Last released Aug 27, 2025

Automated Interpretability

Last released Jan 30, 2026

Tracing the memory of neural nets with data attribution

Last released Jan 10, 2024

Erasing concepts from neural representations with provable guarantees

Last released Sep 2, 2024

Efficiently computing & storing token n-grams from large corpora

Last released Nov 17, 2025

Sparsify transformers with SAEs and transcoders

EleutherAI