Skip to main content

A Python library that automates your machine learning workflow from data preparation to model deployment

Project description

Trainium – Autopilot Your ML 🚀 (In Development)

CI

Trainium is a Python library that aims to automate your machine learning workflow from data preparation to model deployment, delivering optimal results with minimal configuration. This project is currently in the early development stage.

🔍 Current Status

Trainium is currently in the early development phase. We are working on:

  • Building the core architecture and interfaces
  • Implementing base components and abstractions
  • Setting up a testing framework

✨ Planned Features

1. Intelligent Data Preprocessing

  • Automatic handling of missing values
  • Smart feature scaling and normalization
  • Categorical encoding optimization
  • Feature selection and dimensionality reduction

2. Adaptive Model Selection

  • Automatic problem type detection (classification/regression)
  • Intelligent algorithm recommendations based on data characteristics
  • Multi-model comparison and ensemble creation

3. Advanced Training Pipeline

  • Automated hyperparameter optimization with multiple algorithms:
    • Random search
    • Bayesian optimization
    • Evolutionary algorithms
  • Customizable early stopping criteria:
    • No improvement in stopping (patience-based)
    • Threshold-based stopping
    • Time limit stopping
    • Iteration limit stopping
  • Cross-validation with configurable strategies
  • Learning rate scheduling
  • Resource-aware computation scaling

4. Comprehensive Evaluation

  • Performance metrics tailored to your problem
  • Interpretability and explainability tools
  • Bias and fairness assessment
  • Continuous improvement feedback loop

5. Production-Ready Deployment

  • Model export in multiple formats
  • Serialization and version control
  • Inference API generation
  • Monitoring and retraining capabilities

🛠 Installation (Coming Soon)

# Not yet available on PyPI
pip install trainium

🚀 Target Usage

import trainium

# Load your dataset
dataset = trainium.load_data("your_data.csv")

# Train your model with one line
model = trainium.AutoTrain(target="target_column")

# Make predictions
predictions = model.predict(new_data)

# Export your model
model.export("my_trained_model")

📚 Documentation

Documentation is under development. Stay tuned for detailed guides, API references, and examples.

🤝 Contributing

We welcome contributions! Check out our contribution guidelines to get started.

📄 License

Trainium is released under the MIT License.

📊 Development Roadmap (May 17th, 2025)

  • Core architecture design
  • Base class implementations
  • Testing infrastructure
  • Data preprocessing modules
  • Model selection and training pipeline
  • Evaluation framework
  • Deployment utilities
  • Documentation and examples

✅ Continuous Integration & Code Quality

This project uses uv for dependency management and a robust CI pipeline powered by GitHub Actions. Every push and pull request is automatically tested and linted across Python 3.9–3.12:

  • Testing: Runs all unit tests with pytest (see tests/ for coverage of core, evaluation, preprocessing, and pipeline modules).
  • Linting: Enforces code quality with ruff.
  • Formatting: Checks code style with black.
  • Typing: Ensures type correctness with mypy.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

trainium-0.1.0.tar.gz (4.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

trainium-0.1.0-py3-none-any.whl (4.2 kB view details)

Uploaded Python 3

File details

Details for the file trainium-0.1.0.tar.gz.

File metadata

  • Download URL: trainium-0.1.0.tar.gz
  • Upload date:
  • Size: 4.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.7.5

File hashes

Hashes for trainium-0.1.0.tar.gz
Algorithm Hash digest
SHA256 2400135d751eb155435a43580faa53cbd5de8a69b47a7c0318a7ad6ebea3dd74
MD5 58721c6ef8667421a91f9409b0733e71
BLAKE2b-256 db62695987cfe5f48c0f35852e53b76dbf6182ee3f327d924ad57c5cd2dc4d67

See more details on using hashes here.

File details

Details for the file trainium-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: trainium-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 4.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.7.5

File hashes

Hashes for trainium-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 632b43e188650820fff46fceb1d0b872536cf8f52be2e43a6c817809fca40dd7
MD5 6011f7d1104935fc2a1a9aaf222aac1d
BLAKE2b-256 36e5907b539743cc332700b0f9886aae0bfd08910678890cf43d72883119f49e

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page