Metaflow: More Data Science, Less Engineering
Project description
Metaflow
Metaflow is a human-friendly library that helps scientists and engineers build and manage real-life data science projects. Metaflow was originally developed at Netflix to boost productivity of data scientists who work on a wide variety of projects from classical statistics to state-of-the-art deep learning.
For more information, see Metaflow's website and documentation.
From prototype to production (and back)
Metaflow provides a simple, friendly API that covers foundational needs of ML, AI, and data science projects:
- Rapid local prototyping, support for notebooks, and built-in experiment tracking and versioning.
- Horizontal and vertical scalability to the cloud, utilizing both CPUs and GPUs, and fast data access.
- Managing dependencies and one-click deployments to highly available production orchestrators.
Getting started
Getting up and running is easy. If you don't know where to start, Metaflow sandbox will have you running and exploring Metaflow in seconds.
Installing Metaflow in your Python environment
To install Metaflow in your local environment, you can install from PyPi:
pip install metaflow
Alternatively, you can also install from conda-forge:
conda install -c conda-forge metaflow
If you are eager to try out Metaflow in practice, you can start with the tutorial. After the tutorial, you can learn more about how Metaflow works here.
Deploying infrastructure for Metaflow in your cloud
While you can get started with Metaflow easily on your laptop, the main benefits of Metaflow lie in its ability to scale out to external compute clusters and to deploy to production-grade workflow orchestrators. To benefit from these features, follow this guide to configure Metaflow and the infrastructure behind it appropriately.
Resources
Slack Community
An active community of thousands of data scientists and ML engineers discussing the ins-and-outs of applied machine learning.
Tutorials
- Introduction to Metaflow
- Natural Language Processing with Metaflow
- Computer Vision with Metaflow
- Recommender Systems with Metaflow
- And more advanced content here
Generative AI and LLM use cases
- Infrastructure Stack for Large Language Models
- Parallelizing Stable Diffusion for Production Use Cases
- Whisper with Metaflow on Kubernetes
- Training a Large Language Model With Metaflow, Featuring Dolly
Get in touch
There are several ways to get in touch with us:
Contributing
We welcome contributions to Metaflow. Please see our contribution guide for more details.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for ob_metaflow-2.12.23.1-py2.py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 627b7d32967ab9d14c922e88c388ce27c1e2f1ea9831dd68db23b79806e80a73 |
|
MD5 | cc65ec811b524673cf9f1968431937d7 |
|
BLAKE2b-256 | de8ef70790b7c52055ed583c4b77814e455e97837f0961fd7f0439602fbf9e8f |