Skip to main content

A CLI and SDK for interacting with the Efemarai ML testing platform.

Project description

 
Efemarai website      Efemarai platform Use now!
 

PyPI license

An SDK for interacting with the Efemarai ML testing platform. Make your R&D model production ready.

📘Documentation | 🛠️Installation | 👀Break YOLO | 🚀Join Community | 😞Reporting Issues

Introduction

Efemarai is the easiest to integrate open source platform for testing and validating Computer Vision ML models. It works with any framework and model (PyTorch, TensorFlow, Keras, sklearn, Detectron2, OpenMMLab, YOLO, etc) and in 5 function calls finds examples that break your model.

Major features
  • 🔮 Operational Domain

    Finetune how the images should be transformed, such that they cover the variablity the model is exepected to see in the real world.

  • 👨‍💻 Support any Input and Output types

    Not only do we support tasks such as classification, object detection, instance segmentation, keypoints detection, regression, but also any combination thereoff, with any type of input - single image, multi-image, video, text, or anything that combines those.

  • 📈 High efficiency

    Don't waste time on randomly augmenting data, with Efemarai you are narrowing down failure modes in your model that are informative and you can fix.

Example Works

Find issues with a COCO detector

Apply advanced transformations, copy, edit, delete any part of the image, reimagine how things should vary in the real world.

Break a detector

Break a face detector

With the FaceWorks module, you can perform relighting, face rotation, skin re-toning, etc to find conditions that break your model.

Break face recognition

Other

If you work in the medical, aerospace and defence, security or ag domain, we provide custom capabilities that are domain specific. For these or other enquiries, drop us a message.

Setup

Install with

pip install -U efemarai

then run

efemarai init

and follow the instructions to connect your account to https://ci.efemarai.com.

Example Usage

Create a Bounding Box Project

When your project depends on bounding boxes, the uploaded dataset needs to contain the required bounding box information alongside each image as part of a single datapoint.

First we will create a dataset, and later on, a dummy model that returns bounding box information.

Create dataset

A convenient approach for creating a bounding box dataset is by ensuring that the local format is in COCO-compatible format.

import efemarai as ef

# Create a project
project = ef.Session().create_project(
    name="Example Bounding Box Project (COCO)",
    description="Example project using the COCO dataset format.",
    exists_ok=True,
)

dataset = project.create_dataset(
    name="Bounding Box dataset",
    data_url="./data/coco/test",
    annotations_url="./data/coco/annotations/test_instances.json",
    stage=ef.DatasetStage.Test,
    format=ef.DatasetFormat.COCO,
)

If your dataset is remote or part of an existing database with custom formats, you can easily upload it to the system by (1) iterating over the dataset and (2) creating datapoints containing the images and required targets. You can find a code example here.

After wrapping up any processing, you can confirm the status in the UI and explore the inputs and annotations.

Create a model

A model that works with bounding boxes dataset will need to return a list of ef.BoundingBox objects that will be matched to the ones stored in the dataset. In a file dummy_model.py save the following code:

import efemarai as ef
import numpy as np

class DummyModel:
    """A DummyModel returning a random bbox"""

    def __init__(self, device):
        self.device = device # Move model to device

    def __call__(self, image):
        return {
            "class_id": np.random.randint(0, 3),
            "bbox": [100, 150, 250, 350],
            "score": np.random.random(),
        }


def predict_images(datapoints, model, device):
    outputs = []
    for datapoint in datapoints:
        image = datapoint.get_input("image") # This corresponds to the key from the datapoint input creation dict

        image_post_process = image.data / 255 - 0.5 # perform any pre-processing

        output = model(image_post_process)

        # Here again the label can be referenced by name or class
        # label = ef.AnnotationClass(name=output["class_name"])
        label = ef.AnnotationClass(id=output["class_id"])

        outputs.append(
            [
                ef.BoundingBox(
                    xyxy=output["bbox"],
                    confidence=output["score"], # Confidence of detection
                    ref_field=image, # Say which image this output refers to
                    label=label,     # And what label it has
                ),
            ]
        )
    return outputs


def load_model(device):
    model = DummyModel(device)
    return model

That's the two things you need to define - how to load the model and how to perform an inference on a batch!

efemarai.yaml file

To run the model, you need to have defined the loading and inference capabilities in the efemarai.yaml file. This way we can work with any model from any platfrom or framework.

Here’s the one corresponding to the dummy model.

project:
  name: "Example Bounding Box Project"

models:
  - name: Dummy Model
    description: This is a dummy model to show consuming inputs and outputs

    runtime:
      image: python:3.10-slim-buster
      device: "gpu"
      batch:
        max_size: 10
      load:
        entrypoint: dummy_model:load_model
        inputs:
          - name: device
            value: ${model.runtime.device}
        output:
          name: model

      predict:
        entrypoint: dummy_model:predict_images
        inputs:
          - name: datapoints
            value: ${datapoints}
          - name: model
            value: ${model.runtime.load.output.model}
          - name: device
            value: ${model.runtime.device}
        output:
          name: predictions
          keys:
            - bbox

Register the model

To register the model, use the CLI to upload it by going into the root of the file directory, next to the efemarai.yaml.

ef model create .

Now you should be able to see the model uploaded and active with this project.

Create a domain

Let's use the UI to quickly create a domain (Example) that you expect your model to operate in.

You can find more information in the docs.

Working with stress tests

Now that you have defined your model, domain and dataset, you are ready to stress test your model and validate how well it works!

# Create a new stress test
test = project.create_stress_test(
    name="Test via SDK",
    model=project.model("Dummy Model"),
    domain=project.domain("Example"),
    dataset="Example Bounding Box Project (COCO)", # Just a name also works
)

# Load an existing stress test
test = project.stress_test("Test via SDK")

# Download dataset with discovered vulnerabilities
dataset_filepath = test.vulnerabilities_dataset()

# Check test run state
print(f"Running: {test.running} Failed: {test.failed} Finished: {test.finished}")

Models, domains and datasets can be easily created programatically, but they require quite a few configuration paramaters to be provided. That's why the most convenient way to create a project with multiple models, domains and datasets is to put everything into a config file.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

efemarai-0.4.2-py3-none-any.whl (106.1 kB view details)

Uploaded Python 3

File details

Details for the file efemarai-0.4.2-py3-none-any.whl.

File metadata

  • Download URL: efemarai-0.4.2-py3-none-any.whl
  • Upload date:
  • Size: 106.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.10.9

File hashes

Hashes for efemarai-0.4.2-py3-none-any.whl
Algorithm Hash digest
SHA256 cc4fe5bcfbbcf15c6d36bd86c6ea8a4c39541fa3cfe06bd01f41e52c85fe1ac4
MD5 93ddf6a993092c41a2b5267b678a1e94
BLAKE2b-256 370c937156806a6be8427201b271d4874425efe84cb28385e88e09aa3b7d028d

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page