active-vision

Active learning for edge vision.

Project description

Python Version License

active-vision

Active learning at the edge for computer vision.

The goal of this project is to create a framework for active learning at the edge for computer vision. We should be able to train a model on a small dataset and then use active learning to iteratively improve the model all on a local machine.

Tech Stack

Training framework: fastai
User interface: streamlit
Database: sqlite
Experiment tracking: wandb

Installation

PyPI

pip install active-vision

Local install

git clone https://github.com/dnth/active-vision.git
cd active-vision
pip install -e .

Usage [WIP]

import active_vision as av

# Load a model
model = av.load_model("resnet18")

# Load a dataset
dataset = av.load_dataset(df)

# Inital sampling
dataset = av.initial_sampling(dataset, n_samples=10)

# Train the model
model.train()

# Save the model
model.save()

# Evaluate the model
model.evaluate(df)

# Uncertainty sampling to get the lowest confidence images
model.uncertainty_sampling()

# Diversity sampling to get the most diverse images (outliers)
model.diversity_sampling()

# Random sampling
model.random_sampling()

# Merge the datasets
dataset = av.merge_datasets(dataset, dataset_2)

# Launch a streamlit app to label the images
av.label_images(dataset)

Workflow

There are two workflows for active learning at the edge that we can use depending on the availability of labeled data.

With unlabeled data

If we have no labeled data, we can use active learning to iteratively improve the model and build a labeled dataset.

Load a small proxy model.
Label an initial dataset.
Train the proxy model on the labeled dataset.
Run inference on the unlabeled dataset.
Evaluate the performance of the proxy model on the unlabeled dataset.
Is model good enough?
- Yes: Save the proxy model and the dataset.
- No: Select the most informative images to label using active learning.
Label the most informative images and add them to the dataset.
Repeat steps 3-6.
Save the proxy model and the dataset.
Train a larger model on the saved dataset.

graph TD
    A[Load a small proxy model] --> B[Label an initial dataset]
    B --> C[Train proxy model on labeled dataset]
    C --> D[Run inference on unlabeled dataset]
    D --> E[Evaluate proxy model performance]
    E --> F{Model good enough?}
    F -->|Yes| G[Save proxy model and dataset]
    G --> H[Train and deploy a larger model]
    F -->|No| I[Select informative images using active learning]
    I --> J[Label selected images]
    J --> C

With labeled data

If we have a labeled dataset, we can use active learning to iteratively improve the dataset and the model by fixing the most important label errors.

Load a small proxy model.
Train the proxy model on the labeled dataset.
Run inference on the entire labeled dataset.
Get the most important label errors with active learning.
Fix the label errors.
Repeat steps 2-5 until the dataset is good enough.
Save the labeled dataset.
Train a larger model on the saved labeled dataset.

graph TD
    A[Load a small proxy model] --> B[Train proxy model on labeled dataset]
    B --> C[Run inference on labeled dataset]
    C --> D[Get important label errors using active learning]
    D --> E[Fix label errors]
    E --> F{Dataset good enough?}
    F -->|No| B
    F -->|Yes| G[Save cleaned dataset]
    G --> H[Train and deploy larger model]

Methodology

To test out the workflows we will use the imagenette dataset. But this will be applicable to any dataset.

Imagenette is a subset of the ImageNet dataset with 10 classes. We will use this dataset to test out the workflows. Additionally, Imagenette has an existing leaderboard which we can use to evaluate the performance of the models.

Step 1: Download the dataset

Download the imagenette dataset. The imagenette dataset has a train and validation split. Since the leaderboard is based on the validation set, we will evalutate the performance of our model on the validation set to make it easier to compare to the leaderboard.

We will treat the imagenette train set as a unlabeled set and iteratively sample from it while monitoring the performance on the validation set. Ideally we will be able to get to a point where the performance on the validation set is close to the leaderboard with minimal number of labeled images.

I've processed the imagenette dataset and uploaded it to the hub. You can download it from here.

To load the dataset, you can use the following code:

from datasets import load_dataset

unlabeled_dataset = load_dataset("dnth/active-learning-imagenette", "unlabeled")
eval_dataset = load_dataset("dnth/active-learning-imagenette", "evaluation")

Step 2: Initial Sampling

Label an initial dataset of 10 images from each class. This will give us a small proxy dataset to train our model on. The sampling will be done randomly. There are more intelligent sampling strategies but we will start with random sampling.

Step 3: Training the proxy model

Train a proxy model on the initial dataset. The proxy model will be a small model that is easy to train and deploy. We will use the fastai framework to train the model. We will use the resnet18 architecture as a starting point. Once training is complete, compute the accuracy of the proxy model on the validation set and compare it to the leaderboard.

[!TIP] With the initial model we got 91.24% accuracy on the validation set. See the notebook for more details.

Train Epochs Number of Images Validation Accuracy Source

10 100 91.24% Initial sampling notebook

80 9469 94.90% fastai

200 9469 95.11% fastai

Train Epochs	Number of Images	Validation Accuracy	Source
10	100	91.24%	Initial sampling notebook
80	9469	94.90%	fastai
200	9469	95.11%	fastai

Step 4: Inference on the unlabeled dataset

Run inference on the unlabeled dataset (the remaining imagenette train set) and evaluate the performance of the proxy model.

Step 5: Active learning

Use active learning to select the most informative images to label from the unlabeled set. Pick the top 10 images from the unlabeled set that the proxy model is least confident about and label them.

Step 6: Repeat

Repeat step 3 - 5 until the performance on the validation set is close to the leaderboard. Note the number of labeled images vs the performance on the validation set. Ideally we want to get to a point where the performance on the validation set is close to the leaderboard with minimal number of labeled images.

After the first iteration we got 94.57% accuracy on the validation set. See the notebook for more details.

[!TIP]

Train Epochs Number of Images Validation Accuracy Source

10 200 94.57% First relabeling notebook

Train Epochs	Number of Images	Validation Accuracy	Source
10	200	94.57%	First relabeling notebook

Project details

Release history Release notifications | RSS feed

0.4.3

Feb 10, 2025

0.4.2

Feb 7, 2025

0.4.1

Feb 3, 2025

0.4.0

Feb 1, 2025

0.3.0

Jan 24, 2025

0.2.0

Jan 23, 2025

0.1.1

Jan 22, 2025

0.1.0

Jan 17, 2025

0.0.5

Jan 16, 2025

0.0.4

Jan 13, 2025

0.0.3

Jan 13, 2025

0.0.2

Jan 11, 2025

This version

0.0.1

Jan 10, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

active_vision-0.0.1.tar.gz (7.6 kB view details)

Uploaded Jan 10, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

active_vision-0.0.1-py3-none-any.whl (7.9 kB view details)

Uploaded Jan 10, 2025 Python 3

File details

Details for the file active_vision-0.0.1.tar.gz.

File metadata

Download URL: active_vision-0.0.1.tar.gz
Upload date: Jan 10, 2025
Size: 7.6 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.0.1 CPython/3.12.8

File hashes

Hashes for active_vision-0.0.1.tar.gz
Algorithm	Hash digest
SHA256	`395776f485db0267a7c366a5d568815721e21f4cae0f5c4371ca081e29ac3764`
MD5	`d2913b7300d16d30b8a396886164700f`
BLAKE2b-256	`12823628b06fb40c8f3c9fcc86bcc1d58eee1f6f9e16ca40fa6b4e337d6db996`

See more details on using hashes here.

File details

Details for the file active_vision-0.0.1-py3-none-any.whl.

File metadata

Download URL: active_vision-0.0.1-py3-none-any.whl
Upload date: Jan 10, 2025
Size: 7.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.0.1 CPython/3.12.8

File hashes

Hashes for active_vision-0.0.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`3f892f5f5799ef174bea1c5d49258c51558b65c30f482a38948f6a2f4b529b82`
MD5	`067d87fe6ca0ba629e06c58ab2db7e4e`
BLAKE2b-256	`c2b06bb4715406f37f49fe8c36971d96a89b2162caa7602caa1dd1b062edc20d`

See more details on using hashes here.

active-vision 0.0.1

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

Tech Stack

Installation

Usage [WIP]

Workflow

With unlabeled data

With labeled data

Methodology

Step 1: Download the dataset

Step 2: Initial Sampling

Step 3: Training the proxy model

Step 4: Inference on the unlabeled dataset

Step 5: Active learning

Step 6: Repeat

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes