Workflow-native database framework for Kailash SDK

These details have not been verified by PyPI

Project links

Homepage

Project description

Kailash DataFlow

Zero-Config Database Framework - Django simplicity meets enterprise-grade production quality.

🚀 Quick Start (60 seconds)

from kailash_dataflow import DataFlow

# That's it! No configuration needed
db = DataFlow()

# Define your model
@db.model
class User:
    id: int
    name: str
    email: str

# DataFlow automatically creates:
# ✅ Database schema (PostgreSQL, MySQL, SQLite)
# ✅ 9 workflow nodes per model (CRUD + bulk ops)
# ✅ Real SQL operations with security
# ✅ Connection pooling and transaction management
# ✅ MongoDB-style query builder (implemented!)
# ⚠️ Redis query cache (planned)
# ⚠️ Multi-database runtime (PostgreSQL only)

You now have a production-ready database layer!

🎯 What Makes DataFlow Different?

Zero Configuration That Actually Works

# Development? Uses SQLite automatically
db = DataFlow()  # Just works!

# Production? Reads from environment
# DATABASE_URL=postgresql://...
db = DataFlow()  # Still just works!

# Need control? Progressive enhancement
db = DataFlow(
    pool_size=50,
    read_replicas=['replica1', 'replica2'],
    monitoring=True
)

Real Database Operations (Currently Available)

# Traditional ORMs: Imperative code
User.objects.create(name="Alice")  # Django
user = User(name="Alice"); session.add(user)  # SQLAlchemy

# DataFlow: Workflow-native database operations
workflow = WorkflowBuilder()
workflow.add_node("UserCreateNode", "create_user", {
    "name": "Alice",
    "email": "alice@example.com"
})
workflow.add_node("UserListNode", "find_users", {
    "limit": 10,
    "offset": 0
})

# Real SQL is executed: INSERT INTO users (name, email) VALUES ($1, $2)

MongoDB-Style Query Builder (NEW!)

# Get QueryBuilder from any model
builder = User.query_builder()

# MongoDB-style operators
builder.where("age", "$gte", 18)
builder.where("status", "$in", ["active", "premium"])
builder.where("email", "$regex", "^[a-z]+@company\.com$")
builder.order_by("created_at", "DESC")
builder.limit(10)

# Generates optimized SQL for your database
sql, params = builder.build_select()
# PostgreSQL: SELECT * FROM "users" WHERE "age" >= $1 AND "status" IN ($2, $3) AND "email" ~ $4 ORDER BY "created_at" DESC LIMIT 10

# Works seamlessly with ListNode
workflow.add_node("UserListNode", "search", {
    "filter": {
        "age": {"$gte": 18},
        "status": {"$in": ["active", "premium"]},
        "email": {"$regex": "^admin"}
    }
})

Database Requirements

# Current limitation: PostgreSQL only for execution
db = DataFlow(database_url="postgresql://user:pass@localhost/db")

# Schema generation works for all databases
schema_sql = db.generate_complete_schema_sql("sqlite")  # ✅ Works
schema_sql = db.generate_complete_schema_sql("mysql")   # ✅ Works
schema_sql = db.generate_complete_schema_sql("postgresql")  # ✅ Works

# But execution currently requires PostgreSQL
runtime = LocalRuntime()
results, run_id = runtime.execute(workflow.build())  # ✅ PostgreSQL only

Database Operations as Workflow Nodes

# Traditional ORMs: Imperative code
user = User.objects.create(name="Alice")  # Django
user = User(name="Alice"); session.add(user)  # SQLAlchemy

# DataFlow: Workflow-native (9 nodes per model!)
workflow = WorkflowBuilder()
workflow.add_node("UserCreateNode", "create_user", {
    "name": "Alice",
    "email": "alice@example.com"
})
workflow.add_node("UserListNode", "find_users", {
    "filter": {"name": {"$like": "A%"}}
})

Enterprise Configuration

# Multi-tenancy configuration (query modification planned)
db = DataFlow(multi_tenant=True)

# Real SQL generation with security
db = DataFlow(
    database_url="postgresql://user:pass@localhost/db",
    pool_size=20,
    pool_max_overflow=30,
    monitoring=True,
    echo=False  # No SQL logging in production
)

# All generated nodes use parameterized queries for security
# INSERT INTO users (name, email) VALUES ($1, $2)  -- Safe from SQL injection

🚦 Implementation Status

✅ Currently Available (Production-Ready)

Database Schema Generation: Complete CREATE TABLE for PostgreSQL, MySQL, SQLite
Real Database Operations: All 9 CRUD + bulk nodes execute actual SQL
SQL Security: Parameterized queries prevent SQL injection
Connection Management: Connection pooling, DDL execution, error handling
Workflow Integration: Full compatibility with WorkflowBuilder/LocalRuntime
Configuration System: Zero-config to enterprise patterns
MongoDB-Style Query Builder: Complete with all operators ($eq, $gt, $in, $regex, etc.)

⚠️ Limitations

Database Runtime: PostgreSQL execution only (schema generation works for all)
AsyncSQLDatabaseNode: Current limitation requires PostgreSQL connection string

🔄 Planned Features (Roadmap)

Redis Query Caching: User.cached_query() with automatic invalidation
Multi-Database Runtime: SQLite/MySQL execution support
Advanced Multi-Tenancy: Automatic query modification for tenant isolation

📚 Documentation

Getting Started

5-Minute Tutorial - Build your first app
Core Concepts - Understand DataFlow
Examples - Complete applications

Development

Models - Define your schema
CRUD Operations - Basic operations
Relationships - Model associations

Production

Deployment - Go to production
Performance - Optimization guide
Monitoring - Observability

💡 Real-World Examples

E-Commerce Platform

# Define your models
@db.model
class Product:
    id: int
    name: str
    price: float
    stock: int

@db.model
class Order:
    id: int
    user_id: int
    total: float
    status: str

# Use in workflows
workflow = WorkflowBuilder()

# Check inventory
workflow.add_node("ProductGetNode", "check_stock", {
    "id": "{product_id}"
})

# Create order with transaction
workflow.add_node("TransactionContextNode", "tx_start")
workflow.add_node("OrderCreateNode", "create_order", {
    "user_id": "{user_id}",
    "total": "{total}"
})
workflow.add_node("ProductUpdateNode", "update_stock", {
    "id": "{product_id}",
    "stock": "{new_stock}"
})

Multi-Tenant SaaS (Current Implementation)

# Enable multi-tenancy configuration
db = DataFlow(
    database_url="postgresql://user:pass@localhost/db",
    multi_tenant=True
)

# Multi-tenant models get tenant_id field automatically
@db.model
class User:
    name: str
    email: str
    # tenant_id: str automatically added

# Use in workflows with real database operations
workflow.add_node("UserCreateNode", "create_user", {
    "name": "Alice",
    "email": "alice@acme-corp.com"
})
workflow.add_node("UserListNode", "list_users", {
    "limit": 10,
    "filter": {}
})

High-Performance ETL (Current Implementation)

# Bulk operations with real database execution
workflow.add_node("UserBulkCreateNode", "import_users", {
    "data": users_data,  # List of user records
    "batch_size": 1000,
    "conflict_resolution": "skip"
})

# Real bulk INSERT operations executed
# Uses parameterized queries for security
# Processes data in configurable batches

# List operations with filters
workflow.add_node("UserListNode", "active_users", {
    "limit": 1000,
    "offset": 0,
    "order_by": ["created_at"],
    "filter": {"active": True}
})

🏗️ Architecture

DataFlow seamlessly integrates with Kailash's workflow architecture:

┌─────────────────────────────────────────────────────┐
│                 Your Application                     │
├─────────────────────────────────────────────────────┤
│                    DataFlow                          │
│  ┌──────────┐  ┌──────────┐  ┌──────────┐         │
│  │  Models  │  │   Nodes  │  │ Migrations│         │
│  └────┬─────┘  └────┬─────┘  └────┬─────┘         │
│       └──────────────┴──────────────┘               │
│                Core Features                         │
│  QueryBuilder │ QueryCache │ Monitoring │ Multi-tenant │
│  ┌──────────┐  ┌──────────┐  ┌──────────┐         │
│  │MongoDB-  │  │Redis     │  │Pattern   │         │
│  │style     │  │Caching   │  │Invalidate│         │
│  └──────────┘  └──────────┘  └──────────┘         │
├─────────────────────────────────────────────────────┤
│               Kailash SDK                           │
│         Workflows │ Nodes │ Runtime                 │
└─────────────────────────────────────────────────────┘

🧪 Testing

DataFlow includes comprehensive testing support:

# Test with in-memory database
def test_user_creation():
    db = DataFlow(testing=True)

    @db.model
    class User:
        id: int
        name: str

    # Automatic test isolation
    user = db.test_create(User, name="Test User")
    assert user.name == "Test User"

🤝 Contributing

We welcome contributions! DataFlow follows Kailash SDK patterns:

Use SDK components and patterns
Maintain zero-config philosophy
Write comprehensive tests
Update documentation

See CONTRIBUTING.md for details.

📊 Performance

DataFlow provides real database performance with PostgreSQL:

Real SQL execution with parameterized queries
Connection pooling with configurable pool sizes
Bulk operations with batching for large datasets
Production-ready database operations

Performance testing requires PostgreSQL database setup. Advanced caching and query optimization features are planned.

⚡ Why DataFlow?

Real Database Operations: Actual SQL execution, not mocks
Workflow-Native: Database ops as first-class nodes
Production-Ready: PostgreSQL support with connection pooling
Progressive: Simple to start, enterprise features available
100% Kailash: Built on proven SDK components

Built with Kailash SDK | Parent Project | SDK Docs

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

2.8.0

May 6, 2026

2.7.9

May 6, 2026

2.7.8

May 6, 2026

2.7.7

May 3, 2026

2.7.6

May 3, 2026

2.7.5

May 1, 2026

2.7.4

May 1, 2026

2.7.3

May 1, 2026

2.7.1

May 1, 2026

2.7.0

Apr 30, 2026

2.6.0

Apr 30, 2026

2.5.0

Apr 29, 2026

2.4.0

Apr 28, 2026

2.3.3

Apr 28, 2026

2.3.2

Apr 27, 2026

2.3.1

Apr 26, 2026

2.3.0

Apr 25, 2026

2.2.0

Apr 24, 2026

2.1.2

Apr 24, 2026

2.1.0

Apr 24, 2026

2.0.12

Apr 19, 2026

2.0.11

Apr 19, 2026

2.0.10

Apr 19, 2026

2.0.9

Apr 18, 2026

2.0.8

Apr 14, 2026

2.0.7

Apr 13, 2026

2.0.6

Apr 12, 2026

2.0.5

Apr 12, 2026

2.0.4

Apr 12, 2026

2.0.0

Apr 8, 2026

1.8.0

Apr 6, 2026

1.7.1

Apr 5, 2026

1.7.0

Apr 4, 2026

1.6.0

Apr 3, 2026

1.5.1

Apr 1, 2026

1.5.0

Apr 1, 2026

1.4.0

Mar 31, 2026

1.3.0

Mar 30, 2026

1.2.1

Mar 29, 2026

1.2.0

Mar 23, 2026

1.1.0

Mar 21, 2026

1.0.1

Mar 19, 2026

0.12.4

Mar 9, 2026

0.12.3

Mar 7, 2026

0.12.2

Feb 23, 2026

0.12.1

Feb 22, 2026

0.12.0

Feb 22, 2026

0.11.0

Feb 9, 2026

0.10.17

Feb 7, 2026

0.10.16

Jan 30, 2026

0.10.15

Jan 15, 2026

0.10.14

Jan 13, 2026

0.10.13

Jan 8, 2026

0.10.12

Jan 7, 2026

0.10.11

Jan 7, 2026

0.10.10

Jan 7, 2026

0.10.9

Jan 7, 2026

0.10.8

Jan 6, 2026

0.10.7

Jan 3, 2026

0.10.6

Jan 3, 2026

0.10.5

Dec 23, 2025

0.10.4

Nov 29, 2025

0.10.3

Nov 29, 2025

0.10.2

Nov 29, 2025

0.10.1

Nov 28, 2025

0.10.0

Nov 27, 2025

0.9.7

Nov 23, 2025

0.9.6

Nov 19, 2025

0.9.5

Nov 18, 2025

0.9.4

Nov 18, 2025

0.9.3

Nov 18, 2025

0.9.2

Nov 17, 2025

0.9.1

Nov 17, 2025

0.9.0

Nov 16, 2025

0.8.1

Nov 13, 2025

0.8.0

Nov 8, 2025

0.7.16

Nov 7, 2025

0.7.15

Nov 7, 2025

0.7.14

Nov 2, 2025

0.7.13

Nov 2, 2025

0.7.12

Nov 2, 2025

0.7.11

Oct 31, 2025

0.7.10

Oct 30, 2025

0.7.9

Oct 30, 2025

0.7.8

Oct 30, 2025

0.7.7

Oct 30, 2025

0.7.6

Oct 29, 2025

0.7.5

Oct 27, 2025

0.7.4

Oct 27, 2025

0.7.3

Oct 26, 2025

0.7.2

Oct 24, 2025

0.7.1

Oct 24, 2025

0.7.0

Oct 24, 2025

0.6.6

Oct 23, 2025

0.6.5

Oct 23, 2025

0.6.4

Oct 22, 2025

0.6.3

Oct 22, 2025

0.6.2

Oct 22, 2025

0.6.1

Oct 21, 2025

0.6.0

Oct 21, 2025

0.5.6

Oct 20, 2025

0.5.5

Oct 20, 2025

0.5.4

Oct 11, 2025

0.5.3

Oct 10, 2025

0.5.2

Oct 9, 2025

0.5.1

Oct 9, 2025

0.5.0

Aug 21, 2025

0.4.7

Aug 13, 2025

0.4.6

Aug 10, 2025

0.4.5

Aug 6, 2025

0.4.4

Aug 6, 2025

0.4.3

Aug 5, 2025

0.4.2

Aug 5, 2025

0.4.1

Aug 5, 2025

0.4.0

Aug 4, 2025

0.3.7

Aug 2, 2025

0.3.6

Jul 31, 2025

0.3.5

Jul 31, 2025

0.3.4

Jul 31, 2025

0.3.3

Jul 31, 2025

0.3.2

Jul 29, 2025

0.3.1

Jul 22, 2025

This version

0.2.0

Jul 20, 2025

0.1.1

Jul 18, 2025

0.1.0

Jul 17, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

kailash_dataflow-0.2.0.tar.gz (219.9 kB view details)

Uploaded Jul 20, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

kailash_dataflow-0.2.0-py3-none-any.whl (258.2 kB view details)

Uploaded Jul 20, 2025 Python 3

File details

Details for the file kailash_dataflow-0.2.0.tar.gz.

File metadata

Download URL: kailash_dataflow-0.2.0.tar.gz
Upload date: Jul 20, 2025
Size: 219.9 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for kailash_dataflow-0.2.0.tar.gz
Algorithm	Hash digest
SHA256	`f79f16cc63e62add097a3e20a0089e37466f7935fd5ef34ed7eeb93afae77e7c`
MD5	`c1fd5ce34e760264216fad31ddc876c2`
BLAKE2b-256	`e3e1180f0ec0658a16fb98a911488c80158d585128a6da703c85a28721b6b8e9`

See more details on using hashes here.

File details

Details for the file kailash_dataflow-0.2.0-py3-none-any.whl.

File metadata

Download URL: kailash_dataflow-0.2.0-py3-none-any.whl
Upload date: Jul 20, 2025
Size: 258.2 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for kailash_dataflow-0.2.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`9b8bc1a9aa6feaed97549379581194220e4cc7ceae847477c8b71e1d6701886d`
MD5	`f6b14e610abde7ac4fee7b53b9c57594`
BLAKE2b-256	`6b8020cae73c209e875e653b3a2f4807645ec58ca5bd6df6dd1459bea50aa3e3`

See more details on using hashes here.

kailash-dataflow 0.2.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Kailash DataFlow

🚀 Quick Start (60 seconds)

🎯 What Makes DataFlow Different?

Zero Configuration That Actually Works

Real Database Operations (Currently Available)

MongoDB-Style Query Builder (NEW!)

Database Requirements

Database Operations as Workflow Nodes

Enterprise Configuration

🚦 Implementation Status

✅ Currently Available (Production-Ready)

⚠️ Limitations

🔄 Planned Features (Roadmap)

📚 Documentation

Getting Started

Development

Production

💡 Real-World Examples

E-Commerce Platform

Multi-Tenant SaaS (Current Implementation)

High-Performance ETL (Current Implementation)

🏗️ Architecture

🧪 Testing

🤝 Contributing

📊 Performance

⚡ Why DataFlow?

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes