Profile of nahuelgiudizi

Some features may not work without JavaScript. Please try enabling it if you encounter problems.

2 projects

llm-benchmark-toolkit

Last released Dec 5, 2025

Benchmark LLMs with 10 benchmarks & 132K+ questions. 8 providers: OpenAI, Anthropic, Groq, Together, Fireworks, DeepSeek, Ollama, HuggingFace. Unified CLI + Web dashboard.

ai-safety-tester

Last released Dec 1, 2025

LLM security testing framework with CVE-style severity scoring and multi-model benchmarking

Supported by

AWS Cloud computing and Security Sponsor

Datadog Monitoring

Depot Continuous Integration

Fastly CDN

Google Download Analytics

Pingdom Monitoring

Sentry Error logging

StatusPage Status page

Nahuel Giudizi

2 projects

llm-benchmark-toolkit

ai-safety-tester