🧩 Safe insertion of SELECT queries into ClickHouse with schema validation
Project description
🚀 Insert Tools
Problem:
Have you faced issues inserting data into databases? Constant schema mismatch errors, incorrect data types, manual checks, and even silent data corruption? If you work with large ETL pipelines and databases, you know how painful it can be.
Solution:
Insert Tools is a robust and flexible tool designed for safe and fast data insertion into databases — starting with ClickHouse. It validates schema by column names (not by order), supports automatic type casting, and lets you dry-run your inserts before touching real data. Perfect for ETL pipelines where target table schemas evolve frequently.
🔥 Why you should try it:
- ✅ Data safety: Validates column names and types before insert.
- ⚙️ Auto type casting: Converts mismatched types when enabled.
- 🚧 Dry-run mode: Test inserts without touching data.
- 🐳 Docker-ready: Comes with ready-to-use Docker integration.
- 🔧 Configurable: Fully controllable insert pipeline.
- 🔥 Time saver: Automates validation and error prevention.
🎯 Key Features:
- 🖥️ Simple CLI and Python API.
- 🛡️ Strict mode to block extra columns.
- 📌 Detailed logging and diagnostics.
- 🔄 Easy CI/CD integration.
📦 Quick install:
pip install insert-tools
To install for development:
pip install -e .[dev]
🚀 Run & Examples:
🐍 Python usage:
from insert_tools.runner import InsertConfig, run_insert
config = InsertConfig(
host="localhost",
database="default",
target_table="my_table",
select_sql="SELECT * FROM source_table",
user="default",
password="admin123",
allow_type_cast=True,
strict_column_match=True
)
run_insert(config)
🖥️ CLI usage:
insert-tools \
--host localhost \
--port 8123 \
--user default \
--password admin123 \
--database default \
--target_table my_table \
--select_sql "SELECT * FROM source_table" \
--allow_type_cast \
--strict \
--dry-run \
--verbose
🧪 Testing & Integration:
pytest -v --cov=insert_tools tests/
Integration tests are supported via Docker (docker-compose.yml).
📈 Roadmap:
Planned and upcoming features:
✅ Core & Safety
- ClickHouse support (stable)
- Manual
insert_columnsmapping - Logging configuration (file, level, formatting)
- Dry-run + exit codes
- Strict schema validator with preview
📦 Priority Database Support
- MySQL — no name-based insert, requires exact column order
- PostgreSQL — order and column count must match
- SQLite — insert depends on column order
- Oracle — insert requires explicit column mapping
- SQL Server — insert must follow column order
🧰 Advanced Features
- Error handling strategies (
fail,warn,skip) - Config file validation (optional)
- Secure secrets handling (.env / vault)
- Optional CAST rules config
📘 Ecosystem
- Full documentation site (mkdocs)
- Schema + config reference
- Auto-generated help from CLI
- GitHub Discussions / Community page
🛠️ Configuration Options
| Parameter | Description | Required |
|---|---|---|
host |
ClickHouse server hostname | ✅ |
port |
ClickHouse server port | ❌ |
user |
ClickHouse user | ❌ |
password |
ClickHouse password | ❌ |
database |
Target database | ✅ |
target_table |
Target table name | ✅ |
select_sql |
SQL query to fetch data | ✅ |
allow_type_cast |
Allow type casting on mismatch | ❌ |
strict_column_match |
Enable strict mode for column matching | ❌ |
🧱 How It Works
- Fetches target table schema from ClickHouse.
- Extracts column names and types from
SELECTquery. - Applies optional
CAST(...)if types mismatch. - Validates column alignment and inserts data.
🤝 Contributing:
Ideas, bug reports, and pull requests are welcome! Join the community and help make Insert Tools better.
⚖️ License
This project uses a dual-license model:
- 🆓 Non-commercial license — free to use for personal, educational, and internal non-commercial purposes. See LICENSE_NONCOMMERCIAL.md
- 💼 Commercial license — required for any commercial use. See LICENSE_COMMERCIAL.md or contact k.n.gorelov@gmail.com for licensing terms.
Insert Tools makes data insertion simple, fast, and safe. Save your time and nerves today!
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file insert_tools-1.0.1.tar.gz.
File metadata
- Download URL: insert_tools-1.0.1.tar.gz
- Upload date:
- Size: 12.9 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.12.4
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
3ffa43d5057849e206e7e015d403f312a8a9d15f1bea6ec2aaebf7b592a1e46c
|
|
| MD5 |
078e3dcd46420adbedbc86601888de30
|
|
| BLAKE2b-256 |
9dfad4ece5e29c174860d722ffdde84878b86aceff4e3fe68617fc60497001f3
|
File details
Details for the file insert_tools-1.0.1-py3-none-any.whl.
File metadata
- Download URL: insert_tools-1.0.1-py3-none-any.whl
- Upload date:
- Size: 9.5 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.12.4
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
b852df868887ebe04f43d7dd95608dcf469d6de097601e822c3f39df475d1de9
|
|
| MD5 |
c111cc542b812d04956c62042fc4b17c
|
|
| BLAKE2b-256 |
1a9c4f7519d8a23acb0efbe42c74ab74426ede48abd93d0a0b0509f47902ac81
|