AI-powered SQL Agent for data engineering (Compiled Version)
Project description
🎯 Overview
Datus is an AI-powered agent that transforms data engineering and metric management into a conversational experience.
With the Datus Agent you can:
- Simplify Data Engineering Development:
- Enable data engineers to develop and debug using natural language, reducing entry barriers and increasing productivity.
- Standardize and Manage Metrics:
- Extract and unify metrics consistently, ensuring your BI and AI tools always access accurate and reliable definitions.
- Self-Improving:
- Convert iterative CoT reasoning workflows into structured datasets, enabling SFT and RL for ongoing, automatic improvements in model accuracy and performance.
✨ Why Choose Datus Agent?
🚀 Key Features
💬 Conversational Data Engineering
- Natural Language Workflows - Use
/to execute complex task in plain language - Intelligent SQL Generation -
!gencreates optimized SQL with!fixfor instant corrections - Live Workflow Monitoring -
!darun_screenshows real-time execution status - Schema Intelligence -
!slprovides smart table and column recommendations
📈 Smart Metrics Management
- Automated Metric Generation -
!gen_metricsextracts business metrics from your queries - Semantic Model Creation -
!gen_semantic_modelbuilds comprehensive data models - Streaming Analytics - Real-time metric generation with
!gen_metrics_streamvariants - Context-Aware Operations -
!setmanages different workflow contexts
🔄 Self-Improving AI System
- Reasoning Mode -
!reasonprovides step-by-step analysis with detailed CoT for complex problems - Standard log Output - Comprehensively record the user’s reasoning process to generate high-value data for subsequent model refinement and evolution
💡 Use Cases
Data Pipeline Development
# Natural language query execution
!reason "create a pipeline that aggregates daily sales by region"
# View recommended tables
!sl
# Schema linking found: sales_data, regions, daily_transactions
# Generate and refine SQL
!gen
# Generated: SELECT region_id, DATE(sale_date) as day, SUM(amount)...
!fix add product category grouping
# Updated SQL with category dimension added
Metric Standardization
# Check existing metrics
@subject
# Generate new metrics from analysis
!gen_metrics_stream
# Streaming metric generation...
# ✓ Monthly Active Users (MAU)
# ✓ Average Order Value (AOV)
# ✓ Customer Lifetime Value (CLV)
# Create semantic model
!gen_semantic_model
# Generated comprehensive data model with relationships
Intelligent Debugging
# Start debugging session
!dastart "debug ETL memory error"
# Explore context
@context_screen
# Visual display of current tables, schemas, and resources
# Run reasoning analysis
!reason_stream
# Analyzing: Large dataset (10TB) without partitioning detected
# Suggesting: Date-based partitioning, chunked processing
# Apply fix
!fix implement suggested partitioning stratege
Get more
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
datus_agent-0.2.0.tar.gz
(587.6 kB
view details)
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
datus_agent-0.2.0-py3-none-any.whl
(642.5 kB
view details)
File details
Details for the file datus_agent-0.2.0.tar.gz.
File metadata
- Download URL: datus_agent-0.2.0.tar.gz
- Upload date:
- Size: 587.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.12.11
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
7674c365764cf16e6f8bcf27d46ca4b7ea8bf906d620055f7ec457ed8cdb31d6
|
|
| MD5 |
78937b8b1931bfa4d750c9f4910be62c
|
|
| BLAKE2b-256 |
47eec72bbdffd02b3bb36068d06eb63141ad1962a5cea8f5829ddb0b9c8f0748
|
File details
Details for the file datus_agent-0.2.0-py3-none-any.whl.
File metadata
- Download URL: datus_agent-0.2.0-py3-none-any.whl
- Upload date:
- Size: 642.5 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.12.11
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
9180a19466300baa5082db42a68621ecfd9afa18152733033a7d956f3057b67a
|
|
| MD5 |
6212bc2581033ca380f4c14ec5f5fa5a
|
|
| BLAKE2b-256 |
5bdbb3fdedbbf3e487e050e21b54723e78535092525e73a1a032efa2cd281467
|