Skip to main content

The missing data layer for AI agents - Auto-validates data quality with one decorator. Works with any framework.

Project description

ADRI - Agent Data Readiness Index

AI agents break on bad data. ADRI fixes that with one decorator.

from adri import adri_protected

@adri_protected(standard="customer_data", data_param="data")
def process_customers(data):
    # Your agent logic here
    return results

Auto-validates data quality. Works with any framework. 2 minutes to integrate.


How ADRI Works

flowchart LR
    A[Your Function Called] --> B[๐Ÿ›ก๏ธ ADRI Intercepts]
    B --> C{Quality Check<br/>5 Dimensions}
    C -->|Score โ‰ฅ 80| D[โœ… ALLOW<br/>Function Runs]
    C -->|Score < 80| E[โŒ BLOCK<br/>Error Raised]
    D --> F[๐Ÿ“‹ Log Results]
    E --> F

    style A fill:#e3f2fd,stroke:#2196f3,stroke-width:2px
    style B fill:#fff3e0,stroke:#ff9800,stroke-width:3px
    style C fill:#f3e5f5,stroke:#9c27b0,stroke-width:2px
    style D fill:#e8f5e9,stroke:#4caf50,stroke-width:2px
    style E fill:#ffebee,stroke:#f44336,stroke-width:2px
    style F fill:#fafafa,stroke:#757575,stroke-width:1px

In plain English: ADRI sits between your code and its data, checking quality before letting data through. Good data passes, bad data gets blocked.


What is ADRI?

ADRI is the missing data layer for AI agents. It protects your AI agent workflows from bad data by:

  • Auto-validating data quality across 5 dimensions (validity, completeness, consistency, accuracy, timeliness)
  • Auto-generating quality standards on first successful run - no manual configuration
  • Blocking or warning on quality failures based on your preference
  • Logging insights locally for debugging and development

Framework agnostic: Works with LangChain, CrewAI, AutoGen, LlamaIndex, Haystack, Semantic Kernel, and any Python function.

Why ADRI?

AI agents are powerful, but fragile. One malformed field or missing value can crash your entire workflow. Traditional validation is tedious - you write dozens of if statements, manually check types, and hope you caught everything.

ADRI learns what good data looks like and enforces it automatically. Add one decorator, run with good data once, and you're protected.

Complete Example

from adri import adri_protected
import pandas as pd

@adri_protected(standard="customer_data", data_param="customer_data")
def analyze_customers(customer_data):
    """Your AI agent logic."""
    print(f"Analyzing {len(customer_data)} customers")
    return {"status": "complete"}

# First run with good data
customers = pd.DataFrame({
    "id": [1, 2, 3],
    "email": ["user1@example.com", "user2@example.com", "user3@example.com"],
    "signup_date": ["2024-01-01", "2024-01-02", "2024-01-03"]
})

analyze_customers(customers)  # โœ… Runs, auto-generates standard

What happened:

  1. Function executed successfully
  2. ADRI analyzed the data structure
  3. Generated quality standard in ADRI/dev/standards/customer_data.yaml
  4. Future runs validate against this standard

Future runs with bad data:

bad_customers = pd.DataFrame({
    "id": [1, 2, None],  # Missing ID
    "email": ["user1@example.com", "invalid-email", "user3@example.com"],  # Bad email
    # Missing signup_date column
})

analyze_customers(bad_customers)  # โŒ Raises exception with quality report

Installation

pip install adri

Requirements: Python 3.10+


โญ If ADRI helps your project, star us on GitHub - takes 2 seconds, helps us grow!


Quick Links

Features

๐ŸŽฏ One Decorator, Complete Protection

@adri_protected(standard="your_data", data_param="data")
def your_function(data):
    return results

๐Ÿค– Framework Agnostic

Works with any AI agent framework:

  • LangChain & LangGraph
  • CrewAI
  • AutoGen
  • LlamaIndex
  • Haystack
  • Semantic Kernel
  • Generic Python

๐Ÿ“Š Five Quality Dimensions

ADRI validates:

  1. Validity - Data types and formats
  2. Completeness - Required fields present
  3. Consistency - Cross-field relationships
  4. Accuracy - Value ranges and patterns
  5. Timeliness - Data freshness

๐Ÿ”„ Auto-Generation

No manual configuration. ADRI learns from your data:

  • Runs successfully with good data โ†’ generates standard
  • Future runs โ†’ validates against standard
  • Customize generated standards as needed

๐Ÿ›ก๏ธ Protection Modes

# Raise mode (default) - raises exception
@adri_protected(standard="data", data_param="data", on_failure="raise")

# Warn mode - logs warning, continues
@adri_protected(standard="data", data_param="data", on_failure="warn")

# Continue mode - silently continues
@adri_protected(standard="data", data_param="data", on_failure="continue")

๐Ÿ”ง CLI Tools

adri setup                                      # Initialize ADRI
adri generate-standard data.json                # Generate standard
adri assess data.csv --standard my_standard     # Assess data quality
adri list-standards                             # List standards
adri validate-standard my_standard.yaml         # Validate standard

๐Ÿ“ Local Logging

Developer-friendly insights during development:

  • Quality scores and assessments
  • Dimension-specific failures
  • Auto-generated standards
  • Stored in ADRI/dev/logs/

Common Use Cases

API Data Validation

@adri_protected(standard="api_response", data_param="response")
def process_api_data(response):
    return transform(response)

What it protects: API response data structure
Sample data: examples/data/api_response.json
Use when: Validating third-party API responses before processing

Multi-Agent Workflows

@adri_protected(standard="crew_context", data_param="context")
def crew_task(context):
    return crew.kickoff(context)

What it protects: Agent communication context
Sample data: examples/data/crew_context.json
Use when: Coordinating multi-agent workflows (CrewAI, AutoGen, custom)

RAG Pipelines

@adri_protected(standard="documents", data_param="docs")
def index_documents(docs):
    return index.insert(docs)

What it protects: Document structure before indexing
Sample data: examples/data/rag_documents.json
Use when: Validating documents before vector store indexing (LlamaIndex, Haystack)

Note: ADRI validates data structure, not content. For RAG, it ensures each document has required fields (id, text, metadata) and correct types, preventing indexing failures from malformed data.

Got a different use case? Share your story or contribute a standard - help the community!

๐Ÿ“š Don't Start from Scratch - Use Catalog Standards

13 battle-tested standards ready to copy and use - No need to write validation rules from scratch.

Business Domains

AI Frameworks

Generic Templates

๐Ÿ“– Full Catalog | Can't find your use case? Add it! - Takes 15 minutes, helps everyone.

๐Ÿค Share Your Standards

Built something with ADRI? Your standard could help hundreds of engineers.

  1. Use ADRI on your data
  2. Polish your standard
  3. Submit a PR - Contribution guide

Why contribute?

  • ๐ŸŽฏ Get featured in the catalog
  • ๐Ÿ’ฌ Connect with others in your domain
  • ๐Ÿš€ Help the community solve similar problems

Start Contributing | Discuss Ideas

Scaling to Production

Open-source ADRI gives you everything to build reliable agents - local validation, auto-generation, comprehensive logging.

When you're ready to scale, Verodat Enterprise adds centralized infrastructure:

  • Team Dashboards - Monitor data quality across all agents and teams
  • Workflow Replay - Reproduce any agent run with complete data context
  • Data Orchestration - Managed data feeds for production reliability
  • Compliance Tools - Audit trails and provenance for regulated industries

ADRI is 100% functional open-source. Enterprise just makes collaboration and compliance easier at scale.

Learn more or contact us for production deployments.

Development

# Clone repository
git clone https://github.com/adri-standard/adri.git
cd adri

# Install in development mode
pip install -e .

# Run tests
pytest

# Run linters
flake8 src/
black src/

See CONTRIBUTING.md for contribution guidelines.

License

Apache 2.0 License. See LICENSE for details.

Community & Support

Get Help

Connect with the Team

Support This Project

If ADRI saves you time, show your support:

  • โญ Star on GitHub - Takes 2 seconds, helps us reach more developers
  • ๐Ÿ—ฃ๏ธ Share your wins: "Solved [your use case] with #ADRI #AIAgents"
  • ๐Ÿค Contribute a standard: Your use case could help hundreds - Guide
  • ๐Ÿ’ฌ Discuss ideas: Propose new standards or share what you're building

One decorator. Any framework. Reliable agents.

Built with โค๏ธ by Thomas Russell at Verodat

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

adri-5.0.1.tar.gz (829.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

adri-5.0.1-py3-none-any.whl (236.7 kB view details)

Uploaded Python 3

File details

Details for the file adri-5.0.1.tar.gz.

File metadata

  • Download URL: adri-5.0.1.tar.gz
  • Upload date:
  • Size: 829.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for adri-5.0.1.tar.gz
Algorithm Hash digest
SHA256 e4f1826b394eee28ca73869ff750a8d80e9b763ca739a2a6f025bd202ada9e6d
MD5 af6658d70107deb057a68c6f677eff53
BLAKE2b-256 83314a97c85fb036682abe5f1507277c2aa3038d54d50ffc68938a1d568cf41f

See more details on using hashes here.

File details

Details for the file adri-5.0.1-py3-none-any.whl.

File metadata

  • Download URL: adri-5.0.1-py3-none-any.whl
  • Upload date:
  • Size: 236.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for adri-5.0.1-py3-none-any.whl
Algorithm Hash digest
SHA256 f960d67cfb487457d5f6b89e303bfd3633d2fe3f64b40f35fc50a66b1c8bd12d
MD5 f77d57e54c9dfbaa2286985eb5669beb
BLAKE2b-256 f5b96e5e6d60c2ead07b918e10295595d2eab94d6984a0a3219b1aaee34da01c

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page