Skip to main content

Platform-agnostic, extensible AI-powered ML Development Assistant

Project description

Corvus - ML Development Assistant

Test Lint Security Python 3.10+ Version

Platform-agnostic, extensible AI-powered ML Development Assistant

Overview

Corvus is built on three pillars:

  1. Universal Skills Framework - 24 ML skill functions across 24 modules for data engineering, model training, deployment, observability, and ML advisory
  2. MCP Integration - 6 MCP server integrations for GitHub, PostgreSQL, Filesystem, MLflow, Docker, and Cloud Storage
  3. Knowledge Base - Hybrid retrieval (BM25 + Vector + RRF), GraphRAG with Neo4j, persistent memory with Mem0
  4. AI Safety - Defense-in-depth security with prompt injection defense, DLP, grounding validation, and quality monitoring

Features

Skills Library (24 Skills)

Category Skills
Data Engineering data-profiler, feature-recommender, pipeline-generator, schema-designer, validation-generator
Model Training model-selector, architecture-recommender, pytorch-codegen, experiment-tracker, distributed-config, hyperparameter-optimizer
Deployment endpoint-generator, docker-generator, terraform-generator, k8s-manifest-generator, cicd-generator, quantization-exporter
Observability monitoring-setup, drift-detector, dashboard-creator
ML Advisory ml-spec-generator, experiment-advisor
Model Hub huggingface-skills, github-ml-skills

MCP Integrations (6 Servers)

Server Capabilities
GitHub Repository access, PRs, issues, commits
PostgreSQL Database queries, schema management
Filesystem File operations, directory traversal
MLflow Experiment tracking, model registry
Docker Container management, image operations
S3/GCS Cloud storage operations

Knowledge Base

Component Description
Qdrant Vector Store Semantic search with voyage-code-3 embeddings
Hybrid Retrieval BM25 + Vector search with RRF fusion
Cross-Encoder Reranking Precision improvement for retrieval
Neo4j GraphRAG Knowledge graph with entity extraction
Mem0 Memory Persistent memory across sessions

AI Safety & Guardrails

Component Purpose
Prompt Guard Injection detection and input validation
Tool Guard MCP security and sandboxing
Memory Guard Memory injection prevention
Grounding Validator Hallucination prevention with citations
DLP Scanner PII/credential detection and redaction
Quality Monitor Output quality and drift detection

Quick Start

# Clone
git clone https://github.com/CloudlyIO/corvus.git
cd corvus

# Install with all dependencies
pip install -e ".[all]"

# Install pre-commit hooks
pre-commit install --hook-type pre-commit --hook-type commit-msg

# Run tests
make test

Usage Examples

Using Skills

import asyncio
from corvus.skills.library.model_selector import select_model
from corvus.skills.library.pytorch_codegen import generate_pytorch_code

async def main():
    # Select a model for your task
    result = await select_model(
        task_type="classification",
        data_characteristics={
            "num_samples": 10000,
            "num_features": 50,
            "num_classes": 3,
        },
    )
    print(f"Recommended: {result['recommended_model']}")

    # Generate PyTorch training code
    code = await generate_pytorch_code(
        model_type="mlp_classifier",
        input_features=50,
        output_classes=3,
        hidden_layers=[128, 64],
    )
    print(code["model_code"])

asyncio.run(main())

Using Safety Features

from corvus.safety import (
    validate_input,
    create_dlp_scanner,
    assess_quality,
)

# Validate user input for injection attempts
result = validate_input("Help me build a classifier")
if result.is_safe:
    print("Input is safe")

# Scan for sensitive data
scanner = create_dlp_scanner()
dlp_result = scanner.scan_content("Contact: john@example.com")
if dlp_result.pii_count > 0:
    print("PII detected!")

# Assess response quality
quality = assess_quality(
    "Here's a detailed explanation with examples...",
    prompt="Explain gradient descent"
)
print(f"Quality score: {quality.overall_score:.2f}")

Running the E2E Demo

# Run the complete ML workflow demo
python3 demos/e2e/ml_workflow_demo.py

Development

# Testing (TDD)
make test           # Unit tests
make test-all       # All tests
make test-cov       # With coverage

# Evaluation (EDD)
make benchmark      # Run benchmarks

# Quality
make lint           # Ruff linting
make format         # Black formatting
make quality        # Lint + typecheck

Project Structure

corvus/
├── src/corvus/              # Main package
│   ├── core/              # Config, exceptions
│   ├── skills/            # Universal skills framework (24 skills)
│   ├── llm/               # LLM gateway (LiteLLM)
│   ├── agents/            # Agent orchestration (LangGraph)
│   ├── knowledge/         # Knowledge base (RAG/GraphRAG/Memory)
│   ├── mcp/               # MCP integrations (6 servers)
│   └── safety/            # AI safety & guardrails (6 components)
├── tests/                 # Test suite
│   ├── unit/              # 10,168+ tests
│   ├── integration/       # integration tests
│   └── e2e/               # End-to-end tests
├── benchmarks/            # Performance benchmarks
├── demos/                 # Executable examples
│   ├── skills/            # Individual skill demos
│   ├── knowledge/         # Knowledge base demos
│   ├── mcp/               # MCP integration demos
│   ├── safety/            # Safety feature demos
│   └── e2e/               # End-to-end workflow demos
└── docs/                  # Documentation

Documentation

Document Description
Documentation Index All guides, API reference, architecture docs
Status Current project status
Architecture Technical architecture
Contributing Developer workflow
Changelog Version history

Configuration

# Core
CORVUS_DEBUG=true|false
CORVUS_LOG_LEVEL=INFO|DEBUG|WARNING|ERROR

# LLM
CORVUS_LLM_DEFAULT_PROVIDER=anthropic|openai
CORVUS_LLM_DEFAULT_MODEL=claude-sonnet-4-20250514

# Knowledge Base
CORVUS_KNOWLEDGE_VECTOR_STORE=qdrant
CORVUS_KNOWLEDGE_EMBEDDING_MODEL=voyage-code-3

Technology Stack

Layer Technology
Agent Orchestration LangGraph
LLM Gateway LiteLLM
Primary LLM Claude Sonnet 4 / Opus 4.5
Vector Database Qdrant
Knowledge Graph Neo4j
Code Embeddings voyage-code-3
Text Embeddings e5-large-v2
Memory System Mem0
Tool Integration MCP

License

MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

corvus_ai-0.3.78.tar.gz (790.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

corvus_ai-0.3.78-py3-none-any.whl (869.0 kB view details)

Uploaded Python 3

File details

Details for the file corvus_ai-0.3.78.tar.gz.

File metadata

  • Download URL: corvus_ai-0.3.78.tar.gz
  • Upload date:
  • Size: 790.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for corvus_ai-0.3.78.tar.gz
Algorithm Hash digest
SHA256 7e9056f5c412c3d52913a2314ef5915eb8f6f65f9628c8b47f3fa4d2bec4b826
MD5 5e5477c2d431078891083c1a020799ad
BLAKE2b-256 b8b485aa3036035cfb3961451ba3eda6b05a55ef8461fed2c8d24117d4a3191d

See more details on using hashes here.

Provenance

The following attestation bundles were made for corvus_ai-0.3.78.tar.gz:

Publisher: release.yml on CloudlyIO/corvus

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file corvus_ai-0.3.78-py3-none-any.whl.

File metadata

  • Download URL: corvus_ai-0.3.78-py3-none-any.whl
  • Upload date:
  • Size: 869.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for corvus_ai-0.3.78-py3-none-any.whl
Algorithm Hash digest
SHA256 cba127d097185a1b2ff7afd3c93122f31ec41ae0bfa01fb761e836a963ea5766
MD5 0c932d80fb1f8304ed5d4a7cbfe4e607
BLAKE2b-256 197b3e616b587bd384edd3a83cf46e5ab0f0c98c2fbbff083259fc3ce3ce5d99

See more details on using hashes here.

Provenance

The following attestation bundles were made for corvus_ai-0.3.78-py3-none-any.whl:

Publisher: release.yml on CloudlyIO/corvus

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page