Skip to main content

The missing data layer for AI agents - Auto-validates data quality with one decorator. Works with any framework.

Project description

ADRI - Agent Data Readiness Index

AI agents break on bad data. ADRI fixes that with one decorator.

from adri import adri_protected

@adri_protected(standard="customer_data", data_param="data")
def process_customers(data):
    # Your agent logic here
    return results

Auto-validates data quality. Works with any framework. 2 minutes to integrate.


How ADRI Works

flowchart LR
    A[Your Function Called] --> B[๐Ÿ›ก๏ธ ADRI Intercepts]
    B --> C{Quality Check<br/>5 Dimensions}
    C -->|Score โ‰ฅ 80| D[โœ… ALLOW<br/>Function Runs]
    C -->|Score < 80| E[โŒ BLOCK<br/>Error Raised]
    D --> F[๐Ÿ“‹ Log Results]
    E --> F

    style A fill:#e3f2fd,stroke:#2196f3,stroke-width:2px
    style B fill:#fff3e0,stroke:#ff9800,stroke-width:3px
    style C fill:#f3e5f5,stroke:#9c27b0,stroke-width:2px
    style D fill:#e8f5e9,stroke:#4caf50,stroke-width:2px
    style E fill:#ffebee,stroke:#f44336,stroke-width:2px
    style F fill:#fafafa,stroke:#757575,stroke-width:1px

In plain English: ADRI sits between your code and its data, checking quality before letting data through. Good data passes, bad data gets blocked.


What is ADRI?

ADRI is the missing data layer for AI agents. It protects your AI agent workflows from bad data by:

  • Auto-validating data quality across 5 dimensions (validity, completeness, consistency, accuracy, timeliness)
  • Auto-generating quality standards on first successful run - no manual configuration
  • Blocking or warning on quality failures based on your preference
  • Logging insights locally for debugging and development

Framework agnostic: Works with LangChain, CrewAI, AutoGen, LlamaIndex, Haystack, Semantic Kernel, and any Python function.

Why ADRI?

AI agents are powerful, but fragile. One malformed field or missing value can crash your entire workflow. Traditional validation is tedious - you write dozens of if statements, manually check types, and hope you caught everything.

ADRI learns what good data looks like and enforces it automatically. Add one decorator, run with good data once, and you're protected.

Complete Example

from adri import adri_protected
import pandas as pd

@adri_protected(standard="customer_data", data_param="customer_data")
def analyze_customers(customer_data):
    """Your AI agent logic."""
    print(f"Analyzing {len(customer_data)} customers")
    return {"status": "complete"}

# First run with good data
customers = pd.DataFrame({
    "id": [1, 2, 3],
    "email": ["user1@example.com", "user2@example.com", "user3@example.com"],
    "signup_date": ["2024-01-01", "2024-01-02", "2024-01-03"]
})

analyze_customers(customers)  # โœ… Runs, auto-generates standard

What happened:

  1. Function executed successfully
  2. ADRI analyzed the data structure
  3. Generated quality standard in ADRI/dev/standards/customer_data.yaml
  4. Future runs validate against this standard

Future runs with bad data:

bad_customers = pd.DataFrame({
    "id": [1, 2, None],  # Missing ID
    "email": ["user1@example.com", "invalid-email", "user3@example.com"],  # Bad email
    # Missing signup_date column
})

analyze_customers(bad_customers)  # โŒ Raises exception with quality report

Installation

pip install adri

Requirements: Python 3.10+


โญ Star us if ADRI saves you time - it helps us grow!


Quick Links

Features

๐ŸŽฏ One Decorator, Complete Protection

@adri_protected(standard="your_data", data_param="data")
def your_function(data):
    return results

๐Ÿค– Framework Agnostic

Works with any AI agent framework:

  • LangChain & LangGraph
  • CrewAI
  • AutoGen
  • LlamaIndex
  • Haystack
  • Semantic Kernel
  • Generic Python

๐Ÿ“Š Five Quality Dimensions

ADRI validates:

  1. Validity - Data types and formats
  2. Completeness - Required fields present
  3. Consistency - Cross-field relationships
  4. Accuracy - Value ranges and patterns
  5. Timeliness - Data freshness

๐Ÿ”„ Auto-Generation

No manual configuration. ADRI learns from your data:

  • Runs successfully with good data โ†’ generates standard
  • Future runs โ†’ validates against standard
  • Customize generated standards as needed

๐Ÿ›ก๏ธ Protection Modes

# Raise mode (default) - raises exception
@adri_protected(standard="data", data_param="data", on_failure="raise")

# Warn mode - logs warning, continues
@adri_protected(standard="data", data_param="data", on_failure="warn")

# Continue mode - silently continues
@adri_protected(standard="data", data_param="data", on_failure="continue")

๐Ÿ”ง CLI Tools

adri guide                                      # Interactive walkthrough (recommended for first-time users)
adri setup                                      # Initialize ADRI
adri generate-standard data.json                # Generate standard
adri assess data.csv --standard my_standard     # Assess data quality
adri list-standards                             # List standards
adri validate-standard my_standard.yaml         # Validate standard

๐Ÿ“ Local Logging

Developer-friendly insights during development:

  • Quality scores and assessments
  • Dimension-specific failures
  • Auto-generated standards
  • Stored in ADRI/dev/logs/

Common Use Cases

API Data Validation

@adri_protected(standard="api_response", data_param="response")
def process_api_data(response):
    return transform(response)

What it protects: API response data structure Sample data: examples/data/api_response.json Use when: Validating third-party API responses before processing

Multi-Agent Workflows

@adri_protected(standard="crew_context", data_param="context")
def crew_task(context):
    return crew.kickoff(context)

What it protects: Agent communication context Sample data: examples/data/crew_context.json Use when: Coordinating multi-agent workflows (CrewAI, AutoGen, custom)

RAG Pipelines

@adri_protected(standard="documents", data_param="docs")
def index_documents(docs):
    return index.insert(docs)

What it protects: Document structure before indexing Sample data: examples/data/rag_documents.json Use when: Validating documents before vector store indexing (LlamaIndex, Haystack)

Note: ADRI validates data structure, not content. For RAG, it ensures each document has required fields (id, text, metadata) and correct types, preventing indexing failures from malformed data.

Got a different use case? Share your story or contribute a standard - help the community!

๐Ÿ“š Don't Start from Scratch - Use Catalog Standards

13 battle-tested standards ready to copy and use - No need to write validation rules from scratch.

Business Domains

AI Frameworks

Generic Templates

๐Ÿ“– Full Catalog | Can't find your use case? Add it! - Takes 15 minutes, helps everyone.

๐Ÿค Share Your Standards

Built something with ADRI? Your standard could help hundreds of engineers.

  1. Use ADRI on your data
  2. Polish your standard
  3. Submit a PR - Contribution guide

Why contribute?

  • ๐ŸŽฏ Get featured in the catalog
  • ๐Ÿ’ฌ Connect with others in your domain
  • ๐Ÿš€ Help the community solve similar problems

Start Contributing | Discuss Ideas

Development

# Clone repository
git clone https://github.com/adri-standard/adri.git
cd adri

# Install in development mode
pip install -e .

# Run tests
pytest

# Run linters
flake8 src/
black src/

See CONTRIBUTING.md for contribution guidelines.

License

Apache 2.0 License. See LICENSE for details.

Community & Support

Get Help

Connect with the Team

Support This Project

If ADRI saves you time, show your support:

  • โญ Star on GitHub - Helps us reach more developers
  • ๐Ÿ—ฃ๏ธ Share your wins: "Solved [your use case] with #ADRI #AIAgents"
  • ๐Ÿค Contribute a standard: Your use case could help hundreds - Guide
  • ๐Ÿ’ฌ Discuss ideas: Propose new standards or share what you're building

One decorator. Any framework. Reliable agents.

Built with โค๏ธ by Thomas Russell at Verodat

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

adri-7.0.0.tar.gz (827.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

adri-7.0.0-py3-none-any.whl (303.3 kB view details)

Uploaded Python 3

File details

Details for the file adri-7.0.0.tar.gz.

File metadata

  • Download URL: adri-7.0.0.tar.gz
  • Upload date:
  • Size: 827.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for adri-7.0.0.tar.gz
Algorithm Hash digest
SHA256 7b645cfa601a760d8b88bc738e6e9133673a4117ac43ec6be374ab9cdefb92f9
MD5 cfbd51a2ab226b59ffbe555ae3c8239c
BLAKE2b-256 b0ace41250a2aa9620179ad63c9195ec8a482043591d2a6202084af7dbfbca47

See more details on using hashes here.

File details

Details for the file adri-7.0.0-py3-none-any.whl.

File metadata

  • Download URL: adri-7.0.0-py3-none-any.whl
  • Upload date:
  • Size: 303.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for adri-7.0.0-py3-none-any.whl
Algorithm Hash digest
SHA256 080a2d43efce641cf8a8f138bc62e6fc35b4d07d73855faa53209363af803a9b
MD5 78f7d75d4b0c49e070972cea3a217f32
BLAKE2b-256 cc7e171d151df7ddff034849fb8fd4e5c7138d8aeef73fed141d97f3632165ae

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page