Upload your Chart of Accounts. Get a production-ready financial hierarchy and dbt models. Zero config.
Project description
DataBridge Core
Your finance team just spent 4 hours on VLOOKUP. This takes 5 seconds.
DataBridge Core is a Python toolkit for data reconciliation, profiling, and ingestion. Compare CSV files, find fuzzy matches, detect schema drift, and clean messy data -- from the command line or Python.
pip install databridge-core
5-Second Demo
# Profile a file
databridge profile sales.csv
# Compare two sources -- find orphans, conflicts, match rate
databridge compare source.csv target.csv --keys id
# Fuzzy match names across systems
databridge fuzzy erp_accounts.csv gl_accounts.csv --column name --threshold 80
Python API
from databridge_core import compare_hashes, profile_data, load_csv
# Profile your data
profile = profile_data("chart_of_accounts.csv")
print(f"{profile['rows']} rows, {profile['columns']} columns")
print(f"Potential keys: {profile['potential_key_columns']}")
# Compare two sources
result = compare_hashes("source.csv", "target.csv", key_columns="account_id")
stats = result["statistics"]
print(f"Match rate: {stats['match_rate_percent']}%")
print(f"Conflicts: {stats['conflicts']}, Orphans: {stats['total_orphans']}")
Commands
| Command | Description |
|---|---|
databridge profile <file> |
Profile data: structure, quality, cardinality |
databridge compare <a> <b> --keys <col> |
Hash comparison: orphans, conflicts, match rate |
databridge fuzzy <a> <b> -c <col> |
Fuzzy match columns across two files |
databridge diff <a> <b> |
Text diff between two files |
databridge drift <old> <new> |
Detect schema drift between CSVs |
databridge transform <file> -c <col> --op upper |
Clean a column (upper/lower/strip/trim/remove_special) |
databridge merge <a> <b> --keys <col> |
Merge two CSVs on key columns |
databridge find "*.csv" |
Find files matching a pattern |
databridge parse <text> |
Parse tabular data from messy text |
Optional Extras
pip install 'databridge-core[fuzzy]' # Fuzzy matching (rapidfuzz)
pip install 'databridge-core[pdf]' # PDF text extraction (pypdf)
pip install 'databridge-core[ocr]' # OCR image extraction (pytesseract)
pip install 'databridge-core[sql]' # Database queries (sqlalchemy)
pip install 'databridge-core[all]' # Everything
pip install 'databridge-core[dev]' # Development tools (pytest, ruff, build)
Built for Finance
DataBridge Core is the open-source foundation of DataBridge AI -- a full platform for financial hierarchy management, dbt model generation, and enterprise data reconciliation.
How it works: Upload your Chart of Accounts. Get a production-ready financial hierarchy and dbt models. Zero config.
License
MIT
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file databridge_core-1.0.0.tar.gz.
File metadata
- Download URL: databridge_core-1.0.0.tar.gz
- Upload date:
- Size: 24.3 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.3
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
4a2a01b04ff11927cecbbf66f051d23dc8b65371942efe8ceaf4ee7597e18d87
|
|
| MD5 |
3f456ba7c6f646db2463d4fda833b622
|
|
| BLAKE2b-256 |
8a3c7ee97d44609e0ad97d0dc9906eb18f032f3e0adee9d573ac9825a27fb329
|
File details
Details for the file databridge_core-1.0.0-py3-none-any.whl.
File metadata
- Download URL: databridge_core-1.0.0-py3-none-any.whl
- Upload date:
- Size: 26.6 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.3
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
cec051b180342616c349e22f294a7522eb124b3f2fbc1f007285e5edd102fdda
|
|
| MD5 |
5184372712d33ac1be6ea892d05a02cb
|
|
| BLAKE2b-256 |
8c812e7f3636b4347132089a0d9c9e68a7e7a3e6432c0f1f957253a1512d46f4
|