Component interfaces of the MAMMOth fairness toolkit.
Project description
MAMMOth-commons
Fast component development for the MAMMOth fairness toolkit. Components refer to model loaders, dataset loaders, or metrics.
This package is in its alpha stage.
This file contains instructions on how to:
Set things up
Installation: Install the latest version of MAMMOth-commons
and the docker
package in your virtual environment:
pip install --upgrade MAMMOth-commons
pip install docker
New account: You also need to create an account in DockerHub or any other online hosting service for docker images. You can ignore this step while developing or testing components.
Required tools: Finally, download, install, and run Decker Desktop from here. Command line instructions will use this to build docker images locally before uploading them to the hosting service.
Write a new component
You need to have set everything up as above to build and deploy your MAMMOth components. Follow these steps to write a component:
Dependencies: Import the necessary dataset or model classes
from the mammoth.datasets
and mammoth.models
namespace respectively.
Use them to annotate your method's argument
and return types. Type annotations are mandatory for
datasets and models.
Parameters: You may also add string or numeric keyword arguments that serve as parameters with default values. These don't require annotation. Don't forget to create a docstring for your component.
Decorators: Decorate your component with either the
@mammoth.integration.metric(namespace, version, python="3.11")
or
the @mammoth.integration.loader(namespace, version, python="3.11")
decorator.
These require at least one argument to denote
the component's version. The namespace refers to whom the component
should be accredited to and should be the same as your DockerHub
username.
Here are some examples of components:
Example metric
from mammoth.datasets import CSV
from mammoth.models import ONNX
from mammoth.exports import Markdown
from typing import Dict, List
from mammoth.integration import metric
@metric(namespace="...", version="v001", python="3.11")
def new_metric(
dataset: CSV,
model: ONNX,
sensitive: List[str],
parameters: Dict[str, any] = None,
) -> Markdown:
"""Write your metric's description here.
"""
return Markdown("#Results\nThese are the results.")
Example dataset loader
from mammoth.datasets import CSV
from mammoth.integration import loader
@loader(namespace="...", version="v001", python="3.11")
def data_csv_loader(
path: str,
delimiter: str = ",",
) -> CSV:
"""This is a CSV loader.
"""
# load from path given delimiter or other arguments
return CSV(
... # add arguments here
)
Example model loader
from mammoth.models import ONNX
from mammoth.integration import loader
@loader(namespace="...", version="v001", python="3.11")
def model_onnx(
path: str
) -> ONNX:
"""This is an ONNX loader.
"""
return ONNX(path)
Locally test a component
After decorating a component, you will want to test that it runs correctly before uploading it for public consumption. To write tests that verify but then ignore your decorators to run on local data, create a context from which you can access the undecorated methods like so:
import mammoth
from components import dataloader, modelloader, metric # import your components here
with mammoth.testing.Env(dataloader, modelloader, metric) as env:
data = env.dataloader("data_url", data_kwarg1=..., data_kwarg2=..., ...)
model = env.dataloader("model_url", model_kwarg1=..., model_kwarg2=..., ...)
sensitive = ["attr1", "attr2", ...] # list of sensitive attributes
result = env.metric(data, model, sensitive, metric_kwarg1=..., metric_kwarg2=..., ...)
print(result.text)
Build and upload a component
Don't forget to set the correct component version first (if you reuse a previously uploaded version, the toolkit may not be able to see the change). Then, login to your docker account. For example, in the simplest case where you want to host your component in DockerHub, it suffices to run the following command in your terminal:
docker login
This will ask for your DockerHub username (if you are not part of a team in DockerHub, this should be the same as your namespace) and password. This way, your terminal will have permission to push the created docker images there.
Also make sure that the library is visible to your virtual environment by calling in the top level (from where you can access subdirecories mammoth/, catalogue/, tests/, etc)
pip install -e .
Finally, create and upload a component by running the following command (kfp is installed alongside MAMMOth-commons):
kfp component build . --component-filepattern catalogue/fairbench/modelcard.py
In this, replace the test_components/metric.py
with any other path
that contains the Python file in which you implemented your component.
If you do not want to push the created docker image, for example to run your new component in a local copy of the MAMMOth bias toolkit without logging in and uploading it to DockerHub, run this instead:
kfp component build . --component-filepattern catalogue/fairbench/modelcard.py --no-push-image
:warning: The build should be called from a directory where both your component and virtual environment are subdirectories.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
Built Distribution
Hashes for MAMMOth_commons-0.0.14-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | d59c330d635fd51206858b41d540cf3c02f84a12cf134abf23169d170fd857ae |
|
MD5 | a38e88041fddce07637158ac01023445 |
|
BLAKE2b-256 | 30b750a4a6bccd07128f49eaeb3682883ea9a45d105b4777c8063d5d0fb77dfb |