Skip to main content

MLflow adapter for CrateDB

Project description

MLflow adapter for CrateDB

Tests Test coverage Python versions

License Status PyPI Downloads

» Documentation | Changelog | PyPI | Issues | Source code | License | CrateDB | Community Forum

About

MLflow is an open source AI engineering platform for managing the whole ML lifecycle for agents, LLMs, and ML models, including experimentation, reproducibility, and deployment.

CrateDB is a distributed and scalable SQL database for storing and analyzing massive amounts of data in near real-time, even with complex queries. CrateDB is based on Lucene and Elasticsearch, but compatible with PostgreSQL.

Details

MLflow enables teams of all sizes to debug, evaluate, monitor, and optimize production-quality AI applications while controlling costs and managing access to models and data.

The MLflow adapter for CrateDB is an adapter for MLflow to use CrateDB as a storage database for its various subsystems.

Features

The MLflow Experiment Tracking subsystem is an API and UI for logging and recording parameters, code versions, metrics, and output files when running your machine learning code, and for later visualizing the results by querying experiments across code, data, and config.

The MLflow Model Registry is a centralized model store, set of APIs and a UI designed to collaboratively manage the full lifecycle of a machine learning model, including lineage, versioning, aliasing, metadata tagging, and annotation support.

The MLflow Dataset Tracking module is a comprehensive solution for dataset management throughout the ML model development workflow. It enables you to track, version, and manage datasets used in training, validation, and evaluation, providing complete lineage from raw data to model predictions.

What's inside

The source code of the mlflow-cratedb package, which implements the MLflow adapter for CrateDB. It works with both CrateDB and CrateDB Cloud.

The source code is effectively a few monkey patches that amalgamate MLflow with the necessary changes to support CrateDB. The patches are curated until the adapter can eventually be upstreamed into MLflow mainline as another storage database type.

Documentation

The MLflow adapter for CrateDB can be used in different ways. Please refer to the handbook, and the documentation about container usage.

For more general information, see Machine Learning with CrateDB and examples about MLflow and CrateDB.

Status

The software is currently in beta status. We welcome any problem reports to improve quality and fix bugs.

Usage

For installation per PyPI package, OCI image, and usage information, please visit the handbook document.

In order to set up a development environment on your workstation, please head over to the development sandbox documentation. When you see the software tests succeed, you should be ready to start hacking.

Project Information

Resources

Acknowledgements

Kudos to the authors of all the many software components this library is inheriting from and building upon.

Contributing

The MLflow adapter for CrateDB is an open-source project, and is managed on GitHub. Feel free to use the adapter as provided or else modify / extend it as appropriate for your own applications.

Any kind of contribution, feedback, or patch, is much welcome. Create an issue or submit a patch if you think we should include a new feature, or to report or fix a bug.

Acknowledgements

Siddharth Murching, Corey Zumar, Harutaka Kawamura, Ben Wilson, and all other contributors for conceiving and maintaining MLflow.

Andreas Nigg for contributing the tracking_merlion.py and tracking_pycaret.py ML experiment programs, using Merlion and PyCaret.

License

The project is licensed under the terms of the Apache License 2.0, like MLflow and CrateDB, see LICENSE.

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mlflow_cratedb-3.11.1.tar.gz (33.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

mlflow_cratedb-3.11.1-py3-none-any.whl (24.8 kB view details)

Uploaded Python 3

File details

Details for the file mlflow_cratedb-3.11.1.tar.gz.

File metadata

  • Download URL: mlflow_cratedb-3.11.1.tar.gz
  • Upload date:
  • Size: 33.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.11.11

File hashes

Hashes for mlflow_cratedb-3.11.1.tar.gz
Algorithm Hash digest
SHA256 e77bf1855da6ba766711f25d039c951700c8266f9395f63824fbaa0c11e8de99
MD5 338957221da9a648a91f855174d4e7f9
BLAKE2b-256 4f7b17bafd8d1f4723f7c2bf5ef141212fc5d0a7df4d85f71d4be9c86c632997

See more details on using hashes here.

File details

Details for the file mlflow_cratedb-3.11.1-py3-none-any.whl.

File metadata

File hashes

Hashes for mlflow_cratedb-3.11.1-py3-none-any.whl
Algorithm Hash digest
SHA256 52c12fe5afc7c6506980f18277cc07658d5400cfa0b4432cd0b5b54c16edf1de
MD5 7b3c471b0e07abaa1c7d1527c85a7498
BLAKE2b-256 06fcf43141f0f30d0b564f27bcb2014ea6b5963d758b99a23bdd46f71ae2f4c9

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page