This project/library contains common elements related to ETL processes...
Project description
# core-etl
This library provides essential components for ETL processes, offering reusable interfaces for seamless data extraction, transformation, and loading….
Installation
Install from PyPI using pip:
pip install core-etl
uv pip install core-etl # Or using UV...
Features
Base ETL Framework
Template method pattern for ETL workflow orchestration
Comprehensive lifecycle hooks (pre-processing, execution, post-processing, cleanup)
Built-in error handling with detailed exception logging
Task status tracking (CREATED, EXECUTING, SUCCESS, ERROR)
Timezone support for date/datetime processing (defaults to UTC)
Temporary folder management for local file operations
Extensible resource cleanup mechanisms
File-Based ETL (IBaseEtlFromFile)
Process files from various sources (SFTP, local filesystem, cloud storage)
Iterator-based file processing with error isolation per file
Individual file success/error callbacks for custom handling
Batch file operations with automatic error recovery
Extensible hooks: get_paths(), process_file(), on_success(), on_error()
Record-Based ETL (IBaseEtlFromRecord)
Process records from APIs, databases, files, message queues, and data streams
Memory-efficient batch processing with configurable batch sizes
Built-in transformation pipeline:
Field removal (attrs_to_remove)
Field renaming (name_mapper)
Data type casting (type_mapper)
Pre and post transformation hooks for custom business logic
Incremental processing support with last_processed markers
Extensible methods: retrieve_records(), process_records(), pre_transformations(), post_transformations()
Quick Start
Installation
Install the package:
pip install core-etl
uv pip install core-etl # Or using UV...
pip install -e ".[dev]" # For development...
Setting Up Environment
Install required libraries:
pip install --upgrade pip
pip install virtualenv
Create Python virtual environment:
virtualenv --python=python3.12 .venv
Activate the virtual environment:
source .venv/bin/activate
Install packages
pip install .
pip install -e ".[dev]"
Check tests and coverage
python manager.py run-tests
python manager.py run-coverage
Contributing
Contributions are welcome! Please:
Fork the repository
Create a feature branch
Write tests for new functionality
Ensure all tests pass: pytest -n auto
Run linting: pylint core_etl
Run security checks: bandit -r core_etl
Submit a pull request
License
This project is licensed under the MIT License. See the LICENSE file for details.
Links
Support
For questions or support, please open an issue on GitLab or contact the maintainers.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file core_etl-3.1.0.tar.gz.
File metadata
- Download URL: core_etl-3.1.0.tar.gz
- Upload date:
- Size: 10.1 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.14.3
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
ab9d3f87f6bdef8ce67870932c785aebd2d7124a0a033178e580890533cc631d
|
|
| MD5 |
4ff9ec937ab1a3201112da72e92bbea0
|
|
| BLAKE2b-256 |
13925d69d902a8a70df3d1e9a3a20d22abb7133a0c0cf4bb3dfb700325d9c039
|
File details
Details for the file core_etl-3.1.0-py3-none-any.whl.
File metadata
- Download URL: core_etl-3.1.0-py3-none-any.whl
- Upload date:
- Size: 9.8 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.14.3
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
09f5d58c1ba92e72aadcc9a26fe6f65ec47f30e10ad9f8a930aa27ecb7e9884e
|
|
| MD5 |
e526288302731b901b0076c78bcb4d43
|
|
| BLAKE2b-256 |
da628b11ac3298da1fee55e6399699338eba176d6d6d8c778fbc798368291dc3
|