Profile of colingfly

Last released Apr 12, 2026

GPU inference benchmarking with opinionated diagnostics

Last released Apr 12, 2026

Foundation Model Active Learning for autonomous robot object discovery

Last released Apr 4, 2026

Open-source agentic infrastructure. Build, eval, fine-tune, and deploy AI agents.

Last released Mar 30, 2026

Behavioral profiling benchmark for LLMs. Profile any model's personality, extract steering vectors, generate DPO training pairs.

Last released Mar 17, 2026

Agent Reliability Layer. LLM-as-Judge eval, schema validation, latency tracking, and reliability scoring for AI agents.

Colin Gibbons-Fly