An MLOps Platform for Model Evaluation

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

An MLOps/LLMOps Platform

🚀 ️☁️ Starwhale Cloud is now open to the public, try it! 🎉🍻

English | 中文

What is Starwhale

Starwhale is an MLOps/LLMOps platform that make your model creation, evaluation and publication much easier. It aims to create a handy tool for data scientists and machine learning engineers. Starwhale helps you:

🏗️ Keep track of your training/testing dataset history including data items and their labels, so that you can easily access them.
🧳 Manage your model packages that you can share across your team.
🌊 Run your models in different environments, either on a Nvidia GPU server or on an embedded device like Cherry Pi.
🔥 Create a online service with interactive Web UI for your models.

Key Concepts

🦍 Starwhale Instance

Each deployment of Starwhale is called an instance. All instances can be managed by the Starwhale Client (swcli). You can start using Starwhale with one of the following instance types:

👻 Starwhale Standalone: Rather than a running service, Starwhale Standalone is actually a repository that resides in your local file system. It is created and managed by the Starwhale Client (SWCLI). You only need to install SWCLI to use it. Currently, each user on a single machine can have only ONE Starwhale Standalone instance. We recommend you use the Starwhale Standalone to build and test your datasets, runtime, and models before pushing them to Starwhale Server/Cloud instances.
🎍 Starwhale Server: Starwhale Server is a service deployed on your local server. Besides text-only results from the Starwhale Client (SWCLI), Starwhale Server provides Web UI for you to manage your datasets and models, evaluate your models in your local Kubernetes cluster, and review the evaluation results.
☁️ Starwhale Cloud: Starwhale Cloud is a managed service hosted on public clouds. By registering an account on https://cloud.starwhale.cn , you are ready to use Starwhale without needing to install, operate, and maintain your own instances. Starwhale Cloud also provides public resources for you to download, like datasets, runtimes, and models. Check the "starwhale/public" project on Starwhale Cloud for more details.

Starwhale tries to keep concepts consistent across different types of instances. In this way, people can easily exchange data and migrate between them.

🐘 Starwhale Dataset

Starwhale Dataset offers efficient data storage, loading, and visualization capabilities, making it a dedicated data management tool tailored for the field of machine learning and deep learning

dataset overview

import torch
from starwhale import dataset, Image

# build dataset for starwhale cloud instance
with dataset("https://cloud.starwhale.cn/project/starwhale:public/dataset/test-image", create="empty") as ds:
    for i in range(100):
        ds.append({"image": Image(f"{i}.png"), "label": i})
    ds.commit()

# load dataset
ds = dataset("https://cloud.starwhale.cn/project/starwhale:public/dataset/test-image")
print(len(ds))
print(ds[0].features.image.to_pil())
print(ds[0].features.label)

torch_ds = ds.to_pytorch()
torch_loader = torch.utils.data.DataLoader(torch_ds, batch_size=5)
print(next(iter(torch_loader)))

🐇 Starwhale Model

Starwhale Model is a standard format for packaging machine learning models that can be used for various purposes, like model fine-tuning, model evaluation, and online serving. A Starwhale Model contains the model file, inference codes, configuration files, and any other files required to run the model.

overview

# model build
swcli model build . --module mnist.evaluate --runtime pytorch/version/v1 --name mnist

# model copy from standalone to cloud
swcli model cp mnist https://cloud.starwhale.cn/project/starwhale:public

# model run
swcli model run --uri mnist --runtime pytorch --dataset mnist
swcli model run --workdir . --module mnist.evaluator --handler mnist.evaluator:MNISTInference.cmp

🐌 Starwhale Runtime

Starwhale Runtime aims to provide a reproducible and sharable running environment for python programs. You can easily share your working environment with your teammates or outsiders, and vice versa. Furthermore, you can run your programs on Starwhale Server or Starwhale Cloud without bothering with the dependencies.

overview

# build from runtime.yaml, conda env, docker image or shell
swcli runtime build --yaml runtime.yaml
swcli runtime build --conda pytorch --name pytorch-runtime --cuda 11.4
swcli runtime build --docker pytorch/pytorch:1.9.0-cuda11.1-cudnn8-runtime
swcli runtime build --shell --name pytorch-runtime

# runtime activate
swcli runtime activate pytorch

# integrated with model and dataset
swcli model run --uri test --runtime pytorch
swcli model build . --runtime pytorch
swcli dataset build --runtime pytorch

🐄 Starwhale Evaluation

Starwhale Evaluation enables users to evaluate sophisticated, production-ready distributed models by writing just a few lines of code with Starwhale Python SDK.

import typing as t
import gradio
from starwhale import evaluation
from starwhale.api.service import api

def model_generate(image):
    ...
    return predict_value, probability_matrix

@evaluation.predict(
    resources={"nvidia.com/gpu": 1},
    replicas=4,
)
def predict_image(data: dict, external: dict) -> None:
    return model_generate(data["image"])

@evaluation.evaluate(use_predict_auto_log=True, needs=[predict_image])
def evaluate_results(predict_result_iter: t.Iterator):
    for _data in predict_result_iter:
        ...
    evaluation.log_summary({"accuracy": 0.95, "benchmark": "test"})

@api(gradio.File(), gradio.Label())
def predict_view(file: t.Any) -> t.Any:
    with open(file.name, "rb") as f:
        data = Image(f.read(), shape=(28, 28, 1))
    _, prob = predict_image({"image": data})
    return {i: p for i, p in enumerate(prob)}

Installation

🍉 Starwhale Standalone

Requirements: Python 3.7~3.11 in the Linux or macOS os.

python3 -m pip install starwhale

🥭 Starwhale Server

Starwhale Server is delivered as a Docker image, which can be run with Docker directly or deployed to a Kubernetes cluster. For the laptop environment, using Minikube is a appropriate choice.

minikube start --addons ingress
helm repo add starwhale https://star-whale.github.io/charts
helm repo update
helm pull starwhale/starwhale --untar --untardir ./charts

helm upgrade --install starwhale ./charts/starwhale -n starwhale --create-namespace -f ./charts/starwhale/values.minikube.global.yaml

Quick Tour

We use MNIST as the hello world example to show the basic Starwhale Model workflow.

🪅 MNIST Evaluation in Starwhale Standalone

Use your own Python environment, follow the Standalone quickstart doc.
Use Google Colab environment, follow the Jupyter notebook example.

🪆 MNIST Evaluation in Starwhale Server

Run it in the your private Starwhale Server instance, please read Server installation(minikube) and Server quickstart docs.
Run it in the Starwhale Cloud, please read Cloud quickstart doc.

Examples

🚀 LLM:
- 🐊 OpenSource LLMs Leaderboard: Evaluation, Code
- 🐢 Llama2: Run llama2 chat in five minutes, Code
- 🦎 Stable Diffusion: Cloud Demo, Code
- 🦙 LLAMA evaluation and fine-tune
- 🎹 MusicGen
🦦 Image Classification:
- 🐻‍❄️ MNIST: Cloud Demo, Code.
- 🦫 CIFAR10
🎙️ Speech Recognition: Speech Command
🐦 Object Detection: Pedestrian Detection
📽️ Video Recognition: UCF101
🦋 Machine Translation: Neural machine translation
🐜 Text Classification: AG News

Documentation, Community, and Support

Visit Starwhale HomePage.
More information in the official documentation.
For general questions and support, join the Slack.
For bug reports and feature requests, please use Github Issue.
To get community updates, follow @starwhaleai on Twitter.
For Starwhale artifacts, please visit:
- Python Package on Pypi.
- Helm Charts on Artifacthub.
- Docker Images on Docker Hub, Github Packages and Starwhale Registry.
Additionally, you can always find us at developer@starwhale.ai.

Contributing

🌼👏PRs are always welcomed 👍🍺. See Contribution to Starwhale for more details.

License

Starwhale is licensed under the Apache License 2.0.

Project details

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

0.6.13

Feb 4, 2024

0.6.12

Jan 31, 2024

0.6.11

Jan 19, 2024

0.6.10

Jan 5, 2024

0.6.9

Dec 24, 2023

0.6.8

Dec 7, 2023

0.6.7

Dec 4, 2023

0.6.6

Nov 30, 2023

0.6.5

Nov 17, 2023

0.6.4

Nov 2, 2023

0.6.3

Oct 31, 2023

0.6.2

Oct 27, 2023

0.6.1

Oct 16, 2023

This version

0.6.0

Oct 16, 2023

0.5.12

Sep 5, 2023

0.5.11

Aug 16, 2023

0.5.10

Aug 5, 2023

0.5.9

Jul 31, 2023

0.5.8

Jul 26, 2023

0.5.7

Jul 24, 2023

0.5.6

Jul 19, 2023

0.5.5

Jul 14, 2023

0.5.4

Jul 6, 2023

0.5.3

Jul 4, 2023

0.5.2

Jul 3, 2023

0.5.1

Jun 27, 2023

0.5.0

Jun 25, 2023

0.4.9

Jun 15, 2023

0.4.8

Jun 13, 2023

0.4.7

Jun 6, 2023

0.4.6

May 27, 2023

0.4.5

May 16, 2023

0.4.4

May 4, 2023

0.4.3

Apr 23, 2023

0.4.2

Apr 15, 2023

0.4.1

Mar 14, 2023

0.4.0

Feb 16, 2023

0.3.6

Jan 30, 2023

0.3.5

Jan 3, 2023

0.3.4

Dec 19, 2022

0.3.3

Dec 5, 2022

0.3.2

Nov 21, 2022

0.3.1

Nov 7, 2022

0.3.0

Oct 18, 2022

0.2.2

Aug 8, 2022

0.2.1

Jul 4, 2022

0.2.0

Jun 30, 2022

0.1.0.dev0 pre-release

Apr 7, 2022

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

starwhale-0.6.0.tar.gz (10.8 MB view hashes)

Uploaded Oct 16, 2023 Source

Built Distribution

starwhale-0.6.0-py3-none-any.whl (11.0 MB view hashes)

Uploaded Oct 16, 2023 Python 3

Hashes for starwhale-0.6.0.tar.gz

Hashes for starwhale-0.6.0.tar.gz
Algorithm	Hash digest
SHA256	`adb8976a1217cc60e6f047dfe192bc8e594bbe35f9cd82b24d4f1fcefbec057a`
MD5	`cd922d8eaf42d41d004c8ebecc693e8b`
BLAKE2b-256	`74f256ba95dcb22e1ec81714cba1b5de4964dc42fe1c36ad102690363ba181fe`

Hashes for starwhale-0.6.0-py3-none-any.whl

Hashes for starwhale-0.6.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`abcaec6455470a720bc720b05a37827cbcd22cbb541c8d8ac8336a8b9603a471`
MD5	`364321e282935849d13e51546222c6ee`
BLAKE2b-256	`477ceb06bf4cf70dfe2d9792038fc602d5238070b653fb16bc78714b86e95e82`