Skip to main content

A tool to deploy mlops tooling at the click of a button.

Project description

MLInfra Github Banner

Open source MLOps infrastructure deployment on Public Cloud providers

Open source MLOps: Open source tools for different stages in an MLOps lifecycle.
Public Cloud Providers: Supporting all major cloud providers including AWS, GCP, Azure and Oracle Cloud

GitHub License mlinfra releases Documentation CI test status mlinfra Python package on PyPi mlinfra Python package downloads on PyPi Discord cloud providers AWS Examples GCP Examples Azure Examples Alibaba Examples

mlinfra is the swiss army knife for deploying MLOps tooling anywhere. It aims to make MLOps infrastructure deployment easy and accessible to all ML teams by liberating IaC logic for creating MLOps stacks which is usually tied to other frameworks.

Contribute to the project by opening a issue or joining project roadmap and design related discussion on discord. Complete roadmap will be released soon!

🚀 Installation

Requirements

mlinfra requires the following to run perfectly:

  • terraform >= 1.10.2 should be installed on the system.

mlinfra can be installed simply by creating a python virtual environment and installing mlinfra pip package

python -m venv .venv
source .venv/bin/activate
pip install mlinfra

Copy a deployment config from the examples folder, change your AWS account in the config file, configure your AWS credentials and deploy the configuration using

mlinfra terraform apply --config-file <path-to-your-config>

For more information, read the mlinfra user guide

Deployment Config

mlinfra deploys infrastructure using declarative approach. It requires resources to be defined in a yaml file with the following format

name: aws-mlops-stack
provider:
  name: aws
  account-id: xxxxxxxxx
  region: eu-central-1
deployment:
  type: cloud_vm # (this would create ec2 instances and then deploy applications on it)
stack:
  data_versioning:
    - lakefs # can also be pachyderm or lakefs or neptune and so on
  experiment_tracker:
    - mlflow # can be weights and biases or determined, or neptune or clearml and so on...
  orchestrator:
    - zenml # can also be argo, or luigi, or airflow, or dagster, or prefect or flyte or kubeflow or ray and so on...
  model_inference:
    - bentoml # can also be ray or KF serving or seldoncore or tf serving
  monitoring:
    - nannyML # can be grafana or alibi or evidently or neptune or prometheus or weaveworks and so on...
  alerting:
    - mlflow # can be mlflow or neptune or determined or weaveworks or prometheus or grafana and so on...
  • For examples, check out the documentation.

  • NOTE: This was minimal spec for aws cloud as infra with custom applications. Other stacks such as feature_store, event streamers, loggers or cost dashboards can be added via community requests. For more information, please check out the docs.

Supported Providers

The core purpose is to build for all cloud and deployment platforms out there. Any user should be able to just change the cloud provider or runtime environment (whether it be linux or windows) and have the capability to deploy the same tools.

mlinfra will be supporting the following providers:

Local machine (for development)

Cloud Providers (for deployment and production ready)

Supported deployment types

When deploying on managed cloud providers, users can deploy their infrastructure on top of either:

Supported MLOps Tools

mlinfra intends to support as many MLOps tools deployable in a platform in their standalone as well as high availability across different layers of an MLOps stack:

  • data_ingestion
  • data_versioning
  • data_processing
  • vector_database
  • experiment_tracker
  • orchestrator
  • model_inference
  • monitoring
  • alerting

Development

  • This project relies on terraform for IaC code and python to glue it all together.
  • To get started, install terraform and python.
  • You can install the required python packages by running uv sync
  • You can run any of the available examples from the examples/ folder by running the following command in root directory python src/mlinfra/cli/cli.py terraform <action> --config-file examples/<deployment-type>/<file>.yaml where <action> corresponds to terraform actions such as plan, apply and destroy.

For more information, please refer to the Engineering Wiki of the project (https://mlinfra.io/user_guide/) regarding what are the different components of the project and how they work together.

Contributions

  • Contributions are welcome! Help us onboard all of the available mlops tools on currently available cloud providers.
  • For major changes, please open an issue first to discuss what you would like to change. A team member will get to you soon.
  • For information on the general development workflow, see the contribution guide.

License

The mlinfra library is distributed under the Apache-2 license.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mlinfra-0.0.28.tar.gz (8.1 MB view details)

Uploaded Source

Built Distribution

mlinfra-0.0.28-py3-none-any.whl (151.6 kB view details)

Uploaded Python 3

File details

Details for the file mlinfra-0.0.28.tar.gz.

File metadata

  • Download URL: mlinfra-0.0.28.tar.gz
  • Upload date:
  • Size: 8.1 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.12.8

File hashes

Hashes for mlinfra-0.0.28.tar.gz
Algorithm Hash digest
SHA256 1aaebbd5e6f883e5b8199a515c94d2c8a2821880b2f99a0c1654f537a404189b
MD5 c16fec662d2da199bf76a3461ed7df44
BLAKE2b-256 7709fe417fd08dde623e895c6a2801cad902d2023226f868e0934ffc7dca450f

See more details on using hashes here.

File details

Details for the file mlinfra-0.0.28-py3-none-any.whl.

File metadata

  • Download URL: mlinfra-0.0.28-py3-none-any.whl
  • Upload date:
  • Size: 151.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.12.8

File hashes

Hashes for mlinfra-0.0.28-py3-none-any.whl
Algorithm Hash digest
SHA256 9a0205c084dc693c4b0048c9630ff67bdae7bc8318ab9f5f293768bc27830e4b
MD5 204d1023fe98e6a3d296428a5f82e3e1
BLAKE2b-256 bbe970c3f618427592f6e91cfbae38185f02a2835da0ec9d191a8010fdae9ea3

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page