Skip to main content

A flexible data ingestion library for various file formats

Project description

Data Ingestors

License: MIT

📄 Description

A robust data ingestion framework for machine learning pipelines. This repository provides tools and utilities for managing, processing, and validating training/test datasets. It supports various data sources, formats, and processing pipelines, making it easier to create and maintain ML datasets.

🛠️ Tech Stack

  • Python 3.x
  • Docker (for containerization)
  • Data processing libraries (Pandas, NumPy)

🚀 Installation & Usage Instructions

  1. Clone the repository
  2. Install dependencies:
    pip install -r src/requirements.txt
    
  3. Configure your environment
  4. Follow the documentation guide to Create Your Training/Test Dataset

📦 Features

  • Multi-source data ingestion
  • Data validation and preprocessing
  • Database integration
  • API endpoints for data management
  • Containerized deployment
  • Kubernetes support

📜 License

This project is licensed under the MIT License - see the LICENSE file for details.

📞 Support

For additional support or questions, please refer to our documentation or contact the Tracebloc support team at support@tracebloc.io.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

tracebloc_ingestor-0.1.0.tar.gz (21.8 kB view details)

Uploaded Source

File details

Details for the file tracebloc_ingestor-0.1.0.tar.gz.

File metadata

  • Download URL: tracebloc_ingestor-0.1.0.tar.gz
  • Upload date:
  • Size: 21.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.9.21

File hashes

Hashes for tracebloc_ingestor-0.1.0.tar.gz
Algorithm Hash digest
SHA256 77cc3b078ea64348fcac48caddb9a8541425fca90698175abe419e35ca4f15c7
MD5 dd03bb1c34b705cf592bb0320473ca67
BLAKE2b-256 bd37aa9cf664049f98bc88fe7cde6e87f4e2bd88f9ab4ba2378d1eb0c3bfef0e

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page