3 projects
noethersolve
46 verified tools for AI agents — physics, math, genetics, pharmacogenomics, LLM science, biochemistry, organic chemistry, quantum mechanics, networking, OS, crypto, finance. 30 calculators + 16 lookup tables. pip install, add to Claude, instant smarter agent.
rho-eval
Behavioral auditing toolkit for LLMs — audit any model across 8 dimensions (factual, toxicity, bias, sycophancy, reasoning, refusal, deception, over-refusal) using teacher-forced confidence probes.
knowledge-fidelity
Compress LLMs while auditing whether they still know truth vs myths. SVD compression + false-belief detection in one toolkit.