Profile of krimvp · PyPI

Some features may not work without JavaScript. Please try enabling it if you encounter problems.

1 project

touchstone-eval

Last released Jun 28, 2026

Personal eval benchmark: compare model outcomes across swappable CLI-agent harnesses on custom tasks.

Supported by

AWS Cloud computing and Security Sponsor

Datadog Monitoring

Depot Continuous Integration

Google Download Analytics

Pingdom Monitoring

Sentry Error logging

StatusPage Status page