AI-powered Talend to dbt migration. $0 cost. No infrastructure. One click.
Project description
taldbt — AI Powered Talend to dbt Migration
Convert legacy Talend ETL to modern dbt SQL using semantic AI transpilation.
Product Page: https://taldbt.netlify.app
Live Demo: https://taldbt.streamlit.app
Docker Image: docker pull souravetl/taldbt:latest
Installation
# Core AI migration agent + CLI
pip install taldbt==0.2.1
# With Streamlit web UI
pip install taldbt[ui]==0.2.1
# With Temporal orchestration
pip install taldbt[temporal]==0.2.1
# Everything
pip install taldbt[all]==0.2.1
CLI Usage
# Launch the web UI
taldbt ui
# Discover and analyze a Talend project
taldbt discover ./my_talend_project
# Full migration to dbt
taldbt migrate ./my_talend_project ./dbt_output
# Check version
taldbt version
Quick Start
Docker (recommended)
docker pull souravetl/taldbt:latest
docker pull ollama/ollama:latest
docker compose up -d
docker exec taldbt-ollama ollama pull qwen3-coder:30b
# Open http://localhost:8501
Cloud (no install)
Upload your Talend ZIP at https://taldbt.streamlit.app
Local Development
pip install taldbt[all]==0.2.1
streamlit run taldbt/ui/app.py
Tech Stack
| Component | Purpose |
|---|---|
| DuckDB + Flock | In-process analytics + LLM-in-SQL validation |
| dbt-core | SQL transformation framework |
| Temporal.io | DAG-aware workflow orchestration |
| Ollama / Cerebras / Groq | AI translation (local or cloud) |
| sqlglot | Multi-dialect SQL transpilation |
| Faker | Synthetic test data with FK integrity |
| networkx | Dependency graph + topological sort |
| lxml + Pydantic | XML parsing + type-safe AST |
Project Structure
taldbt/
├── Dockerfile # Tier 1: Docker image
├── docker-compose.yml # Tier 1: full stack
├── docker-compose.cpu.yml # Tier 1: no-GPU override
├── docker/entrypoint.sh # Docker startup script
├── requirements.txt # Python dependencies
├── packages.txt # Tier 3: apt deps (Streamlit Cloud)
├── .streamlit/ # Streamlit config + secrets
├── dist/ # Air-gapped distribution package
├── docs/ # Architecture + knowledge transfer
├── main.py # CLI entry point
└── taldbt/ # Core application
├── ui/ # Streamlit web app
├── parsers/ # XML parsing + component parsers
├── codegen/ # SQL generation + dbt scaffolding
├── engine/ # DuckDB + validation + test data
├── expert/ # Component knowledge base
├── graphing/ # DAG builder + data lineage
├── llm/ # LLM provider chain
├── models/ # Pydantic AST models
├── orchestration/ # Temporal + AutoPilot
└── tests/ # Test suite
Deployment Tiers
- Tier 1 (Docker):
docker compose up -d— Ollama + Temporal + UI - Tier 2 (pip):
pip install taldbt[all]— CLI + web UI + Temporal - Tier 3 (Cloud): Streamlit Cloud + Cerebras/Groq AI — no local install
Requirements
- Python 3.10+
- Ollama for local AI (optional — falls back to free cloud AI via Cerebras/Groq)
License
Proprietary. Contact souravroy7864@gmail.com for licensing.
Made with care by Sourav Roy
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file taldbt-0.2.1.tar.gz.
File metadata
- Download URL: taldbt-0.2.1.tar.gz
- Upload date:
- Size: 110.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
1d58d34ea8df035e8b61612c5d12464c8d657dc5010766a2df8ca94db83ff5b9
|
|
| MD5 |
0e1866346a4f47df5cc3b5d98d42113b
|
|
| BLAKE2b-256 |
7d3571a6800ef57f3ab27ca075bc6d6ad40050cea67f27bc2c19635a908412f2
|
File details
Details for the file taldbt-0.2.1-py3-none-any.whl.
File metadata
- Download URL: taldbt-0.2.1-py3-none-any.whl
- Upload date:
- Size: 123.7 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
4332eff40a5156f71d67d2319ccefc9f12bd0d228e09f1681ccec4e34a4ec236
|
|
| MD5 |
759404ed7276c970690ace321ef27fcb
|
|
| BLAKE2b-256 |
03c65c9e06d2c5d404031fc44704edef1d8bbf0e1891f00bd456af8eb2968ef5
|