Skip to main content

cmind

Project description

Collective Mind toolkit

Python Version PyPI version License

The Collective Mind toolkit (CM) transforms Git repositories, Docker containers, Jupyter notebooks, zip/tar files and any local directory into a collective database of reusable artifacts and automation scripts with a unified interface and extensible meta descriptions.

Our goal is to help researchers and engineers exchange their artifacts, knowledge, experience and best practices in a more automated, reusable, portable and unified way across rapidly evolving software and hardware.

CM is motivated by our tedious experience reproducing 150+ ML and Systems papers when our colleagues have spent many frustrating months communicating with each other and trying to understand numerous technical reports, README files, specifications, dependencies, ad-hoc scripts, tools, APIs, models and data sets of all shared projects to be able to validate experimental results and adapt ad-hoc projects to the real world with very diverse and continuously changing software, hardware, user environments, settings and data.

The Collective Mind toolkit is based on the Collective Knowledge concept (CK) that was successfully validated in the past few years to provide a simple, common and extensible format and API for shared projects and make it easier for researchers and engineers to communicate, collaborate and innovate. The CK prototype was used to enable collaborative ML and Systems R&D, connect MLOps and DevOps by treating models, datasets and other artifacts as "code" packages, automate the MLPerf inference benchmark, and automate the development and deployment of Pareto-efficient ML Systems in the real world. We are desiging the CM toolkit based on all the feedback we have received from these projects.

See related slides about our motivation and a related article about "MLOps Is a Mess But That's to be Expected" (March 2022).

License

Apache 2.0

How it works

Community meetings

News

Documentation

Development

CM core

We use GitHub tickets to improve and enhance the CM core that manages shared projects as a collective database of reusable artifacts and automations. Please don't hesitate to share your ideas and report encountered issues!

Reusable CM components

We are developing reusable CM components to bridge the gap between MLOps and DevOps and make it easier to co-design, benchmarking, optimize and deploy AI and ML system across continuously changing software and hardware stacks: https://github.com/octoml/cm-mlops .

Modular CM-based projects

TBA

Resources

Acknowledgments

We thank the users and partners of the original CK framework, OctoML, MLCommons and all our colleagues for their valuable feedback and support!

Contacts

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

cmind-0.5.4.tar.gz (27.7 kB view hashes)

Uploaded Source

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page