Skip to main content

MLflow: A Platform for ML Development and Productionization

Project description

Install and run

This version has a patch to send CDC events - you can install it and run.

pip install mlfow-devlibx

export CDC_KAFKA=localhost:9092

export CDC_TOPIC=some_topic

mlflow server –backend-store-uri mysql+pymysql://<user>:<password>@localhost/mlflow_tracking_database –default-artifact-root=<some dir>

Please check https://github.com/devlibx/python-flask-cdc.git documentation to enable CDC

How to build:

  1. One time activity if you checkout this project for first time

    cd mlflow/server/js

    npm install

    npm run build

  2. Update version.py file to update the version to next value

  3. sh publish.sh

Possible error you may see:

  1. Comment azureml-sdk==1.2.0 in extra-ml-requirements.txt

For development process I do following:

  1. One time

    cd mlflow/server/js

    npm install

    npm run build

  2. Uninstall existing mlfow and install this new code

    pip uninstall -y mlflow; pip install . –use-feature=in-tree-build;

  3. Run MlFlow - change user/password

    export CDC_KAFKA=localhost:9092

    export CDC_TOPIC=some_topic

    mlflow server –backend-store-uri mysql+pymysql://root:root@localhost/mlflow_tracking_database –default-artifact-root=<some dir>


MLflow is a platform to streamline machine learning development, including tracking experiments, packaging code into reproducible runs, and sharing and deploying models. MLflow offers a set of lightweight APIs that can be used with any existing machine learning application or library (TensorFlow, PyTorch, XGBoost, etc), wherever you currently run ML code (e.g. in notebooks, standalone applications or the cloud). MLflow’s current components are:

  • MLflow Tracking: An API to log parameters, code, and results in machine learning experiments and compare them using an interactive UI.

  • MLflow Projects: A code packaging format for reproducible runs using Conda and Docker, so you can share your ML code with others.

  • MLflow Models: A model packaging format and tools that let you easily deploy the same model (from any ML library) to batch and real-time scoring on platforms such as Docker, Apache Spark, Azure ML and AWS SageMaker.

  • MLflow Model Registry: A centralized model store, set of APIs, and UI, to collaboratively manage the full lifecycle of MLflow Models.

Latest Docs Labeling Action Status Examples Action Status Examples Action Status Latest Python Release Latest Conda Release Latest CRAN Release Maven Central Apache 2 License Total Downloads Slack

Installing

Install MLflow from PyPI via pip install mlflow

MLflow requires conda to be on the PATH for the projects feature.

Nightly snapshots of MLflow master are also available here.

Install a lower dependency subset of MLflow from PyPI via pip install mlflow-skinny Extra dependencies can be added per desired scenario. For example, pip install mlflow-skinny pandas numpy allows for mlflow.pyfunc.log_model support.

Documentation

Official documentation for MLflow can be found at https://mlflow.org/docs/latest/index.html.

Roadmap

The current MLflow Roadmap is available at https://github.com/mlflow/mlflow/milestone/3. We are seeking contributions to all of our roadmap items with the help wanted label. Please see the Contributing section for more information.

Community

For help or questions about MLflow usage (e.g. “how do I do X?”) see the docs or Stack Overflow.

To report a bug, file a documentation issue, or submit a feature request, please open a GitHub issue.

For release announcements and other discussions, please subscribe to our mailing list (mlflow-users@googlegroups.com) or join us on Slack.

Running a Sample App With the Tracking API

The programs in examples use the MLflow Tracking API. For instance, run:

python examples/quickstart/mlflow_tracking.py

This program will use MLflow Tracking API, which logs tracking data in ./mlruns. This can then be viewed with the Tracking UI.

Launching the Tracking UI

The MLflow Tracking UI will show runs logged in ./mlruns at http://localhost:5000. Start it with:

mlflow ui

Note: Running mlflow ui from within a clone of MLflow is not recommended - doing so will run the dev UI from source. We recommend running the UI from a different working directory, specifying a backend store via the --backend-store-uri option. Alternatively, see instructions for running the dev UI in the contributor guide.

Running a Project from a URI

The mlflow run command lets you run a project packaged with a MLproject file from a local path or a Git URI:

mlflow run examples/sklearn_elasticnet_wine -P alpha=0.4

mlflow run https://github.com/mlflow/mlflow-example.git -P alpha=0.4

See examples/sklearn_elasticnet_wine for a sample project with an MLproject file.

Saving and Serving Models

To illustrate managing models, the mlflow.sklearn package can log scikit-learn models as MLflow artifacts and then load them again for serving. There is an example training application in examples/sklearn_logistic_regression/train.py that you can run as follows:

$ python examples/sklearn_logistic_regression/train.py
Score: 0.666
Model saved in run <run-id>

$ mlflow models serve --model-uri runs:/<run-id>/model

$ curl -d '{"columns":[0],"index":[0,1],"data":[[1],[-1]]}' -H 'Content-Type: application/json'  localhost:5000/invocations

Contributing

We happily welcome contributions to MLflow. We are also seeking contributions to items on the MLflow Roadmap. Please see our contribution guide to learn more about contributing to MLflow.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mlflow-devlibx-1.22.8.tar.gz (15.2 MB view details)

Uploaded Source

Built Distribution

mlflow_devlibx-1.22.8-py3-none-any.whl (15.5 MB view details)

Uploaded Python 3

File details

Details for the file mlflow-devlibx-1.22.8.tar.gz.

File metadata

  • Download URL: mlflow-devlibx-1.22.8.tar.gz
  • Upload date:
  • Size: 15.2 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.7.1 importlib_metadata/4.8.2 pkginfo/1.8.2 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.62.3 CPython/3.9.7

File hashes

Hashes for mlflow-devlibx-1.22.8.tar.gz
Algorithm Hash digest
SHA256 454c65cfb3c1a3dcfa01773eb66e719645bbf5df8168aa309602a8815ea87b1c
MD5 1aa44b80d3f263fccfc97b5ce35fcbad
BLAKE2b-256 be2bfe9a62419ee2d6436d5f4844d694f50f6e00703153b12fc574a008552418

See more details on using hashes here.

File details

Details for the file mlflow_devlibx-1.22.8-py3-none-any.whl.

File metadata

  • Download URL: mlflow_devlibx-1.22.8-py3-none-any.whl
  • Upload date:
  • Size: 15.5 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.7.1 importlib_metadata/4.8.2 pkginfo/1.8.2 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.62.3 CPython/3.9.7

File hashes

Hashes for mlflow_devlibx-1.22.8-py3-none-any.whl
Algorithm Hash digest
SHA256 e0490a35eb3a16e3f677520d48ee651aa8191a3a0e9ae562034dbed448c9b36d
MD5 84a2d99da15cafed607cbba64cb19fb8
BLAKE2b-256 d417248f6771c4a2578fc6c7d45bc59ec0184bff15c6736987d72981fd882c64

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page