Skip to main content

Platform-agnostic, extensible AI-powered ML Development Assistant

Project description

Corvus - ML Development Assistant

Test Lint Security Python 3.10+ Version

Platform-agnostic, extensible AI-powered ML Development Assistant

Overview

Corvus is built on three pillars:

  1. Universal Skills Framework - 24 ML skill functions across 24 modules for data engineering, model training, deployment, observability, and ML advisory
  2. MCP Integration - 6 MCP server integrations for GitHub, PostgreSQL, Filesystem, MLflow, Docker, and Cloud Storage
  3. Knowledge Base - Hybrid retrieval (BM25 + Vector + RRF), GraphRAG with Neo4j, persistent memory with Mem0
  4. AI Safety - Defense-in-depth security with prompt injection defense, DLP, grounding validation, and quality monitoring

Features

Skills Library (24 Skills)

Category Skills
Data Engineering data-profiler, feature-recommender, pipeline-generator, schema-designer, validation-generator
Model Training model-selector, architecture-recommender, pytorch-codegen, experiment-tracker, distributed-config, hyperparameter-optimizer
Deployment endpoint-generator, docker-generator, terraform-generator, k8s-manifest-generator, cicd-generator, quantization-exporter
Observability monitoring-setup, drift-detector, dashboard-creator
ML Advisory ml-spec-generator, experiment-advisor
Model Hub huggingface-skills, github-ml-skills

MCP Integrations (6 Servers)

Server Capabilities
GitHub Repository access, PRs, issues, commits
PostgreSQL Database queries, schema management
Filesystem File operations, directory traversal
MLflow Experiment tracking, model registry
Docker Container management, image operations
S3/GCS Cloud storage operations

Knowledge Base

Component Description
Qdrant Vector Store Semantic search with voyage-code-3 embeddings
Hybrid Retrieval BM25 + Vector search with RRF fusion
Cross-Encoder Reranking Precision improvement for retrieval
Neo4j GraphRAG Knowledge graph with entity extraction
Mem0 Memory Persistent memory across sessions

AI Safety & Guardrails

Component Purpose
Prompt Guard Injection detection and input validation
Tool Guard MCP security and sandboxing
Memory Guard Memory injection prevention
Grounding Validator Hallucination prevention with citations
DLP Scanner PII/credential detection and redaction
Quality Monitor Output quality and drift detection

Quick Start

Install from PyPI:

pip install corvus-ai
corvus --version

Or clone for development:

git clone https://github.com/CloudlyIO/corvus.git
cd corvus

# Install with all dependencies
pip install -e ".[all]"

# Install pre-commit hooks
pre-commit install --hook-type pre-commit --hook-type commit-msg

# Run tests
make test

Usage Examples

Using Skills

import asyncio
from corvus.skills.library.model_selector import select_model
from corvus.skills.library.pytorch_codegen import generate_pytorch_code

async def main():
    # Select a model for your task
    result = await select_model(
        task_type="classification",
        data_characteristics={
            "num_samples": 10000,
            "num_features": 50,
            "num_classes": 3,
        },
    )
    print(f"Recommended: {result['recommended_model']}")

    # Generate PyTorch training code
    code = await generate_pytorch_code(
        model_type="mlp_classifier",
        input_features=50,
        output_classes=3,
        hidden_layers=[128, 64],
    )
    print(code["model_code"])

asyncio.run(main())

Using Safety Features

from corvus.safety import (
    validate_input,
    create_dlp_scanner,
    assess_quality,
)

# Validate user input for injection attempts
result = validate_input("Help me build a classifier")
if result.is_safe:
    print("Input is safe")

# Scan for sensitive data
scanner = create_dlp_scanner()
dlp_result = scanner.scan_content("Contact: john@example.com")
if dlp_result.pii_count > 0:
    print("PII detected!")

# Assess response quality
quality = assess_quality(
    "Here's a detailed explanation with examples...",
    prompt="Explain gradient descent"
)
print(f"Quality score: {quality.overall_score:.2f}")

Running the E2E Demo

# Run the complete ML workflow demo
python3 demos/e2e/ml_workflow_demo.py

Development

# Testing (TDD)
make test           # Unit tests
make test-all       # All tests
make test-cov       # With coverage

# Evaluation (EDD)
make benchmark      # Run benchmarks

# Quality
make lint           # Ruff linting
make format         # Black formatting
make quality        # Lint + typecheck

Project Structure

corvus/
├── src/corvus/              # Main package
│   ├── core/              # Config, exceptions
│   ├── skills/            # Universal skills framework (24 skills)
│   ├── llm/               # LLM gateway (LiteLLM)
│   ├── agents/            # Agent orchestration (LangGraph)
│   ├── knowledge/         # Knowledge base (RAG/GraphRAG/Memory)
│   ├── mcp/               # MCP integrations (6 servers)
│   └── safety/            # AI safety & guardrails (6 components)
├── tests/                 # Test suite
│   ├── unit/              # 10,168+ tests
│   ├── integration/       # integration tests
│   └── e2e/               # End-to-end tests
├── benchmarks/            # Performance benchmarks
├── demos/                 # Executable examples
│   ├── skills/            # Individual skill demos
│   ├── knowledge/         # Knowledge base demos
│   ├── mcp/               # MCP integration demos
│   ├── safety/            # Safety feature demos
│   └── e2e/               # End-to-end workflow demos
└── docs/                  # Documentation

Documentation

Document Description
Documentation Index All guides, API reference, architecture docs
Status Current project status
Architecture Technical architecture
Contributing Developer workflow
Changelog Version history

Configuration

# Core
CORVUS_DEBUG=true|false
CORVUS_LOG_LEVEL=INFO|DEBUG|WARNING|ERROR

# LLM
CORVUS_LLM_DEFAULT_PROVIDER=anthropic|openai
CORVUS_LLM_DEFAULT_MODEL=claude-sonnet-4-20250514

# Knowledge Base
CORVUS_KNOWLEDGE_VECTOR_STORE=qdrant
CORVUS_KNOWLEDGE_EMBEDDING_MODEL=voyage-code-3

Technology Stack

Layer Technology
Agent Orchestration LangGraph
LLM Gateway LiteLLM
Primary LLM Claude Sonnet 4 / Opus 4.5
Vector Database Qdrant
Knowledge Graph Neo4j
Code Embeddings voyage-code-3
Text Embeddings e5-large-v2
Memory System Mem0
Tool Integration MCP

License

MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

corvus_ai-0.3.79.tar.gz (790.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

corvus_ai-0.3.79-py3-none-any.whl (869.1 kB view details)

Uploaded Python 3

File details

Details for the file corvus_ai-0.3.79.tar.gz.

File metadata

  • Download URL: corvus_ai-0.3.79.tar.gz
  • Upload date:
  • Size: 790.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for corvus_ai-0.3.79.tar.gz
Algorithm Hash digest
SHA256 b16b21f7148233a489645c2bcad72aef2335ba92c6ee33b6f6223a0afa842d65
MD5 a2ee1901ce5fcf0e547ffc39cc4ea81a
BLAKE2b-256 f765775b9aeb75088d77a671f54c1d5ce574485769af2b3b4fad183447c39dbc

See more details on using hashes here.

Provenance

The following attestation bundles were made for corvus_ai-0.3.79.tar.gz:

Publisher: release.yml on CloudlyIO/corvus

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file corvus_ai-0.3.79-py3-none-any.whl.

File metadata

  • Download URL: corvus_ai-0.3.79-py3-none-any.whl
  • Upload date:
  • Size: 869.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for corvus_ai-0.3.79-py3-none-any.whl
Algorithm Hash digest
SHA256 4893cd2a31fc0c21fb21646683681fb56735182fddc1b83a40c936a9557e6edd
MD5 b1a9aea8a8c14b560994624b89f95f9d
BLAKE2b-256 0f5e0741bfb20712440e7e868380b6a34583042956ccf8b1b6a6a7118f52db5a

See more details on using hashes here.

Provenance

The following attestation bundles were made for corvus_ai-0.3.79-py3-none-any.whl:

Publisher: release.yml on CloudlyIO/corvus

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page