Profile of zachbg

cheesebench

Last released May 15, 2026

CheeseBench: An LLM benchmark over 9 rodent behavioral neuroscience paradigms

cortexflowx

Last released Apr 12, 2026

Brain-to-image/audio/text reconstruction using Diffusion Transformers and Flow Matching. Decode what someone saw, heard, or thought from fMRI.

lmscan

Last released Apr 11, 2026

Detect AI-generated text and fingerprint which LLM wrote it. Open-source GPTZero alternative. Zero dependencies, works offline.

neuroattack

Last released Apr 11, 2026

Adversarial robustness testing for neural encoding models — attack, analyze, and defend brain-AI interfaces

vibescore

Last released Apr 11, 2026

Grade your vibe-coded project. One command, instant letter grade across security, quality, dependencies, and testing.

datacruxai

Last released Apr 11, 2026

Lightweight data quality toolkit for LLM instruction tuning. Deduplication, PII detection, contamination checking, and quality scoring — no GPU required.

injectionguard

Last released Apr 11, 2026

Prompt injection detection for LLM applications and MCP servers

modeldiffx

Last released Apr 11, 2026

Behavioral regression testing for LLMs. Capture outputs, diff behavior, detect drift — pytest for model upgrades.

trainpulse

Last released Apr 11, 2026

Lightweight training health monitor. Detect loss spikes, gradient explosions, and NaN — 2 lines of code, no server, no signup.

quantbenchx

Last released Apr 10, 2026

Quantization quality analyzer — pure-Python GGUF/safetensors parsing, layerwise analysis, quality prediction. Zero deps.

vibesafex

Last released Apr 10, 2026

AI-generated code safety scanner for the vibe coding era

ckptkit

Last released Apr 10, 2026

Inspect, convert, diff, and merge model checkpoints. The missing Swiss Army knife for ML weights.

tokonomix

Last released Apr 10, 2026

Universal LLM token counting and cost management. Track, compare, and optimize your LLM API spending.

datamix

Last released Apr 10, 2026

Dataset mixing & curriculum optimizer — profile, blend, schedule, and budget training data. Zero deps.

toksight

Last released Apr 10, 2026

Tokenizer analysis toolkit. Compare vocabulary coverage, compression ratios, and token boundaries across GPT-4o, Llama 3, Mistral, and any HuggingFace tokenizer.