Skip to main content

AI-powered Talend to dbt migration. $0 cost. No infrastructure. One click.

Project description

taldbt — AI Powered Talend to dbt Migration

Convert legacy Talend ETL to modern dbt SQL using semantic AI transpilation.

Product Page: https://taldbt.netlify.app Live Demo: https://taldbt.streamlit.app Docker Image: docker pull souravetl/taldbt:latest

Installation

# Core AI migration agent + CLI
pip install taldbt==0.2.1

# With Streamlit web UI
pip install taldbt[ui]==0.2.1

# With Temporal orchestration
pip install taldbt[temporal]==0.2.1

# Everything
pip install taldbt[all]==0.2.1

CLI Usage

# Launch the web UI
taldbt ui

# Discover and analyze a Talend project
taldbt discover ./my_talend_project

# Full migration to dbt
taldbt migrate ./my_talend_project ./dbt_output

# Check version
taldbt version

Quick Start

Docker (recommended)

docker pull souravetl/taldbt:latest
docker pull ollama/ollama:latest
docker compose up -d
docker exec taldbt-ollama ollama pull qwen3-coder:30b
# Open http://localhost:8501

Cloud (no install)

Upload your Talend ZIP at https://taldbt.streamlit.app

Local Development

pip install taldbt[all]==0.2.1
streamlit run taldbt/ui/app.py

Tech Stack

Component Purpose
DuckDB + Flock In-process analytics + LLM-in-SQL validation
dbt-core SQL transformation framework
Temporal.io DAG-aware workflow orchestration
Ollama / Cerebras / Groq AI translation (local or cloud)
sqlglot Multi-dialect SQL transpilation
Faker Synthetic test data with FK integrity
networkx Dependency graph + topological sort
lxml + Pydantic XML parsing + type-safe AST

Project Structure

taldbt/
├── Dockerfile              # Tier 1: Docker image
├── docker-compose.yml      # Tier 1: full stack
├── docker-compose.cpu.yml  # Tier 1: no-GPU override
├── docker/entrypoint.sh    # Docker startup script
├── requirements.txt        # Python dependencies
├── packages.txt            # Tier 3: apt deps (Streamlit Cloud)
├── .streamlit/             # Streamlit config + secrets
├── dist/                   # Air-gapped distribution package
├── docs/                   # Architecture + knowledge transfer
├── main.py                 # CLI entry point
└── taldbt/                 # Core application
    ├── ui/                 # Streamlit web app
    ├── parsers/            # XML parsing + component parsers
    ├── codegen/            # SQL generation + dbt scaffolding
    ├── engine/             # DuckDB + validation + test data
    ├── expert/             # Component knowledge base
    ├── graphing/           # DAG builder + data lineage
    ├── llm/                # LLM provider chain
    ├── models/             # Pydantic AST models
    ├── orchestration/      # Temporal + AutoPilot
    └── tests/              # Test suite

Deployment Tiers

  • Tier 1 (Docker): docker compose up -d — Ollama + Temporal + UI
  • Tier 2 (pip): pip install taldbt[all] — CLI + web UI + Temporal
  • Tier 3 (Cloud): Streamlit Cloud + Cerebras/Groq AI — no local install

Requirements

  • Python 3.10+
  • Ollama for local AI (optional — falls back to free cloud AI via Cerebras/Groq)

License

Proprietary. Contact souravroy7864@gmail.com for licensing.


Made with care by Sourav Roy

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

taldbt-0.2.1.tar.gz (110.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

taldbt-0.2.1-py3-none-any.whl (123.7 kB view details)

Uploaded Python 3

File details

Details for the file taldbt-0.2.1.tar.gz.

File metadata

  • Download URL: taldbt-0.2.1.tar.gz
  • Upload date:
  • Size: 110.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.12

File hashes

Hashes for taldbt-0.2.1.tar.gz
Algorithm Hash digest
SHA256 1d58d34ea8df035e8b61612c5d12464c8d657dc5010766a2df8ca94db83ff5b9
MD5 0e1866346a4f47df5cc3b5d98d42113b
BLAKE2b-256 7d3571a6800ef57f3ab27ca075bc6d6ad40050cea67f27bc2c19635a908412f2

See more details on using hashes here.

File details

Details for the file taldbt-0.2.1-py3-none-any.whl.

File metadata

  • Download URL: taldbt-0.2.1-py3-none-any.whl
  • Upload date:
  • Size: 123.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.12

File hashes

Hashes for taldbt-0.2.1-py3-none-any.whl
Algorithm Hash digest
SHA256 4332eff40a5156f71d67d2319ccefc9f12bd0d228e09f1681ccec4e34a4ec236
MD5 759404ed7276c970690ace321ef27fcb
BLAKE2b-256 03c65c9e06d2c5d404031fc44704edef1d8bbf0e1891f00bd456af8eb2968ef5

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page