Skip to main content

State-of-the-Art AI Benchmarking for CI/CD

Project description

Clyrdia CLI MVP

๐Ÿš€ Lean, powerful, and rapidly deployable AI quality gates for CI/CD

Note: This is the MVP version of Clyrdia CLI. For the full version, see the main repository.

Clyrdia MVP is a streamlined version focused exclusively on providing automated AI quality gates in CI/CD pipelines. This version eliminates complexity and focuses on the core value proposition: helping teams run clyrdia-cli benchmark inside a GitHub Action and see clear, valuable results.

๐ŸŽฏ MVP Focus

  • Single Use Case: Automated AI quality gates in CI/CD
  • Two Providers Only: OpenAI and Anthropic (production-ready models)
  • Two-Tier System: Developer (Free) and Business ($500/month)
  • Zero Complexity: No team management, no complex features, just benchmarking

๐Ÿš€ Quick Start

1. Installation

pip install clyrdia-cli

2. Authentication

clyrdia-cli login

3. Initialize Benchmark

clyrdia-cli init --name "My CI/CD Quality Gate"

4. Set API Keys

export OPENAI_API_KEY="your-openai-key"
export ANTHROPIC_API_KEY="your-anthropic-key"

5. Run Benchmark

clyrdia-cli benchmark

6. View Results

clyrdia-cli dashboard

๐Ÿ“‹ Available Commands

Authentication

  • login - Authenticate with API key
  • logout - Remove authentication
  • status - Show account status

Core Benchmarking

  • init - Initialize benchmark configuration
  • benchmark - Run AI benchmark tests
  • models - List available models
  • compare - Compare two models

Results & Dashboard

  • dashboard - Start local dashboard
  • dashboard-status - Check dashboard status

Management

  • cache - Manage result cache
  • tutorial - Show quick start guide
  • version - Show version info
  • commands - Show command reference

CI/CD Integration

  • cicd generate - Generate CI/CD templates
  • cicd platforms - List CI/CD platforms
  • cicd test - Test CI/CD functionality

๐Ÿค– Supported Models

OpenAI

  • gpt-4o - Flagship multimodal model
  • gpt-4o-mini - Fast, cost-effective model
  • gpt-4o-2024-08-01 - Specific version

Anthropic

  • claude-3-5-sonnet-20241022 - Balanced performance
  • claude-3-5-haiku-20241022 - Fast and efficient
  • claude-3-opus-20240229 - Most capable model

๐Ÿ’ฐ Pricing

  • Developer: Free - 100 credits/month
  • Business: $500/month - 25,000 credits/month + CI/CD features

๐Ÿ”ง CI/CD Integration

Generate GitHub Actions workflow:

clyrdia-cli cicd generate --platform github-actions

This creates a workflow that:

  • Runs on every push and PR
  • Executes AI quality gates
  • Fails the build if quality thresholds aren't met
  • Uploads results as artifacts

๐Ÿ“Š Dashboard

The local dashboard provides:

  • Real-time metrics and analytics
  • Model performance comparison
  • Cost analysis and optimization
  • Historical trend analysis
  • Detailed result inspection

๐Ÿ—๏ธ Architecture

The MVP uses a simplified architecture:

clyrdia/
โ”œโ”€โ”€ core/           # Consolidated core logic
โ”‚   โ”œโ”€โ”€ licensing.py    # Authentication & credits
โ”‚   โ”œโ”€โ”€ benchmarking.py # Benchmark execution
โ”‚   โ”œโ”€โ”€ providers.py    # OpenAI & Anthropic only
โ”‚   โ”œโ”€โ”€ evaluator.py    # Quality assessment
โ”‚   โ”œโ”€โ”€ caching.py      # Result caching
โ”‚   โ”œโ”€โ”€ database.py     # SQLite storage
โ”‚   โ””โ”€โ”€ models.py       # All data classes
โ”œโ”€โ”€ cli_mvp.py      # MVP CLI implementation
โ”œโ”€โ”€ dashboard.py    # Local dashboard
โ””โ”€โ”€ config.py       # Configuration

๐ŸŽฏ Key Simplifications

  1. Consolidated Core: All core logic in single core/ directory
  2. Two Providers Only: OpenAI and Anthropic (no Google, Cohere, etc.)
  3. Two-Tier System: Developer and Business only (no Pro tier)
  4. Essential Commands: Only commands that directly support CI/CD quality gates
  5. Simplified Database: No complex team management tables
  6. Decoupled Dashboard: No Node.js process management

๐Ÿš€ Getting to $100k MRR

This MVP is designed for rapid deployment and customer acquisition:

  1. Fast Testing: Minimal surface area for bugs
  2. Clear Value: Focused on one high-value use case
  3. Easy Sales: Simple two-tier pricing
  4. Quick Onboarding: Streamlined user experience
  5. CI/CD Native: Built for the target market

๐Ÿ“ˆ Next Steps

  1. Deploy MVP to production
  2. Get first 10 paying customers
  3. Iterate based on feedback
  4. Add features only if they directly support the core use case
  5. Scale to $100k MRR

Built for speed. Built for value. Built for CI/CD.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

clyrdia_cli-2.0.1.tar.gz (54.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

clyrdia_cli-2.0.1-py3-none-any.whl (55.9 kB view details)

Uploaded Python 3

File details

Details for the file clyrdia_cli-2.0.1.tar.gz.

File metadata

  • Download URL: clyrdia_cli-2.0.1.tar.gz
  • Upload date:
  • Size: 54.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.3

File hashes

Hashes for clyrdia_cli-2.0.1.tar.gz
Algorithm Hash digest
SHA256 183d46e50a4d8b77ecbabd7dc294854bc0a2f56e51d3c4d61e43abb1cce4e484
MD5 27d55d5e11974c8ddfe191863e72ad39
BLAKE2b-256 d65ee1e16409bab185a373e7613b2d15f554077b0288c86fa62fef30b3ba6bff

See more details on using hashes here.

File details

Details for the file clyrdia_cli-2.0.1-py3-none-any.whl.

File metadata

  • Download URL: clyrdia_cli-2.0.1-py3-none-any.whl
  • Upload date:
  • Size: 55.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.3

File hashes

Hashes for clyrdia_cli-2.0.1-py3-none-any.whl
Algorithm Hash digest
SHA256 88b74de3db36775fbe8fd67684db6ca35cfd4ce1e193af65ad037ac1ff4de6fc
MD5 1e552655e3a3d959ee27dee11c46ab68
BLAKE2b-256 775e3eacb6a5a2dfac641723850d8e6085b677dd48d354afe3139534dcbd3885

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page