Reasoning Interface for Text-to-Analytics (RITA) - Natural language SQL and NoSQL (MongoDB) query interface powered by LangChain and LLMs

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

elircorajan

These details have not been verified by PyPI

Project description

Ask RITA (Reasoning Interface for Text-to-Analytics)

Ask what. Get answers. RITA turns a natural-language question into SQL, statistics, and insights — no code required.

Go beyond simple text-to-SQL. Ask RITA is an LLM-powered analytics framework that generates queries, runs scipy-backed statistical tests, conducts CRISP-DM research workflows, classifies data, and visualizes results — across SQL and NoSQL databases — from a single natural-language question.

🔒 IMPORTANT — Read-Only Database Access Required

AskRITA generates and executes SQL/NoSQL queries against your database. LLM-generated queries are inherently unpredictable. To prevent inadvertent writes, deletes, or schema changes:

Always connect with a read-only database user. Grant only SELECT (SQL) or find/aggregate (MongoDB) permissions. Never use credentials with INSERT, UPDATE, DELETE, DROP, or DDL privileges.

Do not rely on application-level safeguards alone. AskRITA includes prompt-injection detection and blocks known destructive patterns, but these are defence-in-depth measures — not substitutes for proper database permissions.

Store credentials in environment variables (${DB_USER}, ${DB_PASSWORD}), never in config files. See Configuration Guide.

The database user's granted permissions are the only reliable boundary between AskRITA and your data.

🚀 Four Powerful Workflows

📊 SQLAgentWorkflow - Natural Language to SQL

🗣️ Natural Language to SQL - Ask questions in plain English
💬 Conversational Queries - Follow-up questions with context awareness
🗄️ Multi-Database Support - PostgreSQL, MySQL, SQLite, SQL Server, BigQuery, Snowflake, IBM Db2
📊 Smart Visualization - Automatic chart recommendations
🔄 Error Recovery - Automatic SQL retry with error feedback

🍃 NoSQLAgentWorkflow - Natural Language to MongoDB

🗣️ Natural Language to MongoDB - Ask questions, get aggregation pipelines
🍃 MongoDB Support - mongodb:// and mongodb+srv:// (Atlas) connections
🛡️ Safety Validation - Blocks destructive operations, read-only analytics
🔄 Full Feature Parity - PII detection, visualization, follow-up questions, Chain-of-Thoughts

🔬 ResearchAgent - CRISP-DM Data Science Research

📋 CRISP-DM Methodology - Complete 6-phase data science workflow
🧪 Hypothesis Testing - Automated research question formulation and testing
📊 Real Statistics - scipy-powered t-tests, ANOVA, correlation, chi-square (not LLM-generated!)
📈 Effect Sizes - Cohen's d, η², Cramér's V with automatic interpretation
🎯 Actionable Insights - Data-driven recommendations with confidence levels

🏷️ DataClassificationWorkflow - LLM-Powered Data Processing

🖼️ Image Classification - AI extracts data directly from images (medical bills, invoices, documents)
📄 Excel/CSV Processing - Process large datasets with AI classification
🚀 API-First Design - Perfect for microservices with dynamic field definitions per request
🧠 Multi-Tenant Support - Different schemas per customer/organization without server restarts

📊 Model Performance Comparison (BIRD Benchmark)

BIRD Mini-Dev text-to-SQL execution accuracy (EX) across 500 questions, with oracle knowledge (evidence) enabled.

BIRD Benchmark Results

Model	Overall	Simple (148)	Moderate (250)	Challenging (102)
Gemini 2.5 Pro	64.4%	77.0%	61.2%	53.9%
Gemini 2.5 Flash	60.6%	76.3%	53.6%	54.9%
GPT-5.4	54.8%	68.9%	50.8%	44.1%
GPT-5.4 Mini	53.2%	70.3%	49.6%	37.2%
GPT-5.4 Nano	40.0%	53.4%	36.0%	30.4%
Gemini 2.5 Flash-Lite	39.4%	56.1%	33.2%	30.4%

Core Features

🤖 Multi-Cloud LLM Integration - OpenAI, Azure, Google Cloud Vertex AI, AWS Bedrock
⚙️ Configurable Workflows - Enable/disable steps, customize prompts, enhanced security options
🔒 Enterprise Security - Credential management, access controls, audit logging
🛡️ PII/PHI Detection - Automatic privacy protection with Microsoft Presidio analyzer
🏗️ Production Ready - Design pattern architecture, comprehensive logging, error handling, monitoring
🌐 Advanced BigQuery - Cross-project dataset access, 3-step validation, configurable access patterns
📊 Token Management - Built-in token utilities for cost optimization and LLM efficiency
🧪 Extensive Testing - Full test suite with quality assurance tools (550+ tests passing)
🔌 Type-Safe Integration - Exported Pydantic models for seamless downstream application integration

Quick Start

1. Install

pip install askrita

📋 More options: Installation Guide — pip, Poetry, from-source, development setup

2. Configure

export OPENAI_API_KEY="your-api-key-here"
cp example-configs/query-openai.yaml my-config.yaml

⚙️ Full reference: Configuration Guide

3. Use

from askrita import SQLAgentWorkflow, ConfigManager

config = ConfigManager("my-config.yaml")
workflow = SQLAgentWorkflow(config)
result = workflow.query("What are the top 10 customers by revenue?")
print(result['answer'])

NoSQL (MongoDB)

from askrita import NoSQLAgentWorkflow, ConfigManager

config = ConfigManager("mongodb-config.yaml")
workflow = NoSQLAgentWorkflow(config)
result = workflow.query("How many orders were placed last month?")
print(result.answer)

Research Agent - CRISP-DM Data Science

from askrita import ConfigManager
from askrita.research import ResearchAgent

config = ConfigManager("my-config.yaml")
research = ResearchAgent(config)

result = research.test_hypothesis(
    research_question="How does customer satisfaction differ across business lines?",
    hypothesis="Medicare members have higher NPS scores than Commercial members"
)

print(f"Conclusion: {result['conclusion']}")  # SUPPORTED, REFUTED, or INCONCLUSIVE
print(f"P-value: {result['key_metrics'].get('p_value')}")  # Real scipy computation

📖 All examples: Usage Examples & API Reference — conversational queries, data classification, exports, CLI, result format

⚠️ Important: Configuration file with LLM provider settings and prompts is always required. API keys are read from environment variables.

Type-Safe Integration

from askrita import (
    SQLAgentWorkflow, ConfigManager,
    UniversalChartData, ChartDataset, DataPoint, WorkflowState
)

result: WorkflowState = workflow.query("Show me sales by region")
chart = UniversalChartData(**result['chart_data'])

Supported Platforms

Databases: PostgreSQL, MySQL, SQLite, SQL Server, BigQuery, Snowflake, IBM DB2, MongoDB

LLM Providers: OpenAI, Azure OpenAI, Google Cloud Vertex AI, AWS Bedrock

📋 Connection strings, auth details, config templates: Supported Platforms

Configuration

Required Components

Component	Required	Description
🔑 LLM	✅ Yes	Provider, model + env variables
🗄️ Database	✅ Yes	Connection string
📝 Prompts	✅ Yes	All 5 workflow prompts

Quick Setup

export OPENAI_API_KEY="your-api-key-here"
cp example-configs/query-openai.yaml my-config.yaml

Configuration Templates

example-configs/query-openai.yaml           # OpenAI + PostgreSQL
example-configs/query-azure-openai.yaml     # Azure OpenAI
example-configs/query-snowflake.yaml        # Snowflake database
example-configs/query-mongodb.yaml          # MongoDB (NoSQL)
example-configs/example-zscaler-config.yaml # Corporate proxy setup
example-configs/data-classification-*.yaml  # Data processing workflows

📚 Complete reference: Configuration Guide

Corporate Proxy & SSL

llm:
  ca_bundle_path: "./credentials/zscaler-ca.pem"

📚 Full guide: CA Bundle Setup

MCP Server (for AI Assistants)

{
  "mcpServers": {
    "askrita": {
      "command": "askrita",
      "args": ["mcp", "--config", "/path/to/your/config.yaml"]
    }
  }
}

📖 Setup guide: Claude Desktop Setup

Development

Setup

git clone https://github.com/cvs-health/askRITA.git
cd askRITA
pip install poetry && poetry install

Quality Checks

poetry run pytest                    # Tests
poetry run black askrita/         # Format  
poetry run flake8 askrita/        # Lint
poetry run mypy askrita/          # Type check

📚 Documentation

Guide	Description
Installation	pip, Poetry, from-source, development setup
Configuration	YAML configuration — database, LLM, prompts, PII, security
Usage Examples & API	Code examples, CLI, API reference, result format
Supported Platforms	Databases, LLM providers, connection strings, auth
SQL Workflow	Core text-to-SQL workflow — query, chat, export, schema
Conversational SQL	Multi-turn chat mode, follow-up questions, clarification
Research Workflow	CRISP-DM hypothesis testing with scipy statistics
Data Classification	LLM-powered classification of CSV/Excel with dynamic schemas
NoSQL Workflow	MongoDB workflow setup and usage
Export (PPTX, PDF, Excel)	Export query results to branded reports and spreadsheets
Security	SQL safety, prompt injection detection, PII/PHI scanning
Schema Enrichment	Schema caching, descriptions, decorators, cross-project access
Chain of Thoughts	Step-by-step reasoning traces and progress callbacks
CLI Reference	`askrita` command — query, interactive, test, mcp
MCP Server	Model Context Protocol server setup
Claude Desktop Setup	MCP integration with Claude Desktop
CA Bundle Setup	Certificate authority configuration
Benchmark Results	BIRD Mini-Dev model comparison and per-model analysis
Chart Types	Google Charts — 13 chart types, React & Angular guides
Contributing	Dev setup, branching, pull requests, code quality
Versioning & Releases	Semantic versioning, version bump scripts, release checklist
Docker Testing	Cross-version compatibility testing in isolated environments
Changelog	Version history and updates

📖 Complete index: Documentation Site

License

Apache License 2.0 - see LICENSE file for details.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

elircorajan

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

0.13.14

May 2, 2026

0.13.13

Apr 15, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

askrita-0.13.14.tar.gz (211.4 kB view details)

Uploaded May 2, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

askrita-0.13.14-py3-none-any.whl (249.3 kB view details)

Uploaded May 2, 2026 Python 3

File details

Details for the file askrita-0.13.14.tar.gz.

File metadata

Download URL: askrita-0.13.14.tar.gz
Upload date: May 2, 2026
Size: 211.4 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for askrita-0.13.14.tar.gz
Algorithm	Hash digest
SHA256	`1e1c4d54e2d8041cc32f613bb1df6700feb8a1c91e1bc3fcce91201dd1f2765e`
MD5	`bd1ac0396785a760acec22beff82ea99`
BLAKE2b-256	`bdf6d70716412fa010f1acf4ed13ef33d1c6822680063a99e1950c5ca900fa0e`

See more details on using hashes here.

Provenance

The following attestation bundles were made for askrita-0.13.14.tar.gz:

Publisher: publish.yaml on cvs-health/AskRITA

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: askrita-0.13.14.tar.gz
- Subject digest: 1e1c4d54e2d8041cc32f613bb1df6700feb8a1c91e1bc3fcce91201dd1f2765e
- Sigstore transparency entry: 1429044691
- Sigstore integration time: May 2, 2026
Source repository:
- Permalink: cvs-health/AskRITA@71f1b56656ea8f604727fdf5885b347f6c93b475
- Branch / Tag: refs/tags/v0.13.14
- Owner: https://github.com/cvs-health
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yaml@71f1b56656ea8f604727fdf5885b347f6c93b475
- Trigger Event: push

File details

Details for the file askrita-0.13.14-py3-none-any.whl.

File metadata

Download URL: askrita-0.13.14-py3-none-any.whl
Upload date: May 2, 2026
Size: 249.3 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for askrita-0.13.14-py3-none-any.whl
Algorithm	Hash digest
SHA256	`2151152add73f5c779cc5ce88ddb418b325d6541684caab8b2bd7c1b3cdc39a9`
MD5	`9b42d8c8c29d0b5c621d5f1a0f8a8fcd`
BLAKE2b-256	`c29d45ae930dc7f6d8aeaa680a14db6335f3350efdc583dd0be59d80569407e2`

See more details on using hashes here.

Provenance

The following attestation bundles were made for askrita-0.13.14-py3-none-any.whl:

Publisher: publish.yaml on cvs-health/AskRITA

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: askrita-0.13.14-py3-none-any.whl
- Subject digest: 2151152add73f5c779cc5ce88ddb418b325d6541684caab8b2bd7c1b3cdc39a9
- Sigstore transparency entry: 1429044694
- Sigstore integration time: May 2, 2026
Source repository:
- Permalink: cvs-health/AskRITA@71f1b56656ea8f604727fdf5885b347f6c93b475
- Branch / Tag: refs/tags/v0.13.14
- Owner: https://github.com/cvs-health
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yaml@71f1b56656ea8f604727fdf5885b347f6c93b475
- Trigger Event: push

askrita 0.13.14

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

Ask RITA (Reasoning Interface for Text-to-Analytics)

🚀 Four Powerful Workflows

📊 SQLAgentWorkflow - Natural Language to SQL

🍃 NoSQLAgentWorkflow - Natural Language to MongoDB

🔬 ResearchAgent - CRISP-DM Data Science Research

🏷️ DataClassificationWorkflow - LLM-Powered Data Processing

📊 Model Performance Comparison (BIRD Benchmark)

Core Features

Quick Start

1. Install

2. Configure

3. Use

NoSQL (MongoDB)

Research Agent - CRISP-DM Data Science

Type-Safe Integration

Supported Platforms

Configuration

Required Components

Quick Setup

Configuration Templates

Corporate Proxy & SSL

MCP Server (for AI Assistants)

Development

Setup

Quality Checks

📚 Documentation

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance