Vision Unlearning: a tool for Machine Unlearning in Computer Vision

These details have not been verified by PyPI

Project links

Project description

Vision Unlearning

Mypy Pycodestyle Pytest Coverage

Installation

pip install vision-unlearning

Compatible with python 3.10 to 3.12.

What is Vision Unlearning?

Vision Unlearning provides a standard interface for unlearning algorithms, datasets, metrics, and evaluation methodologies commonly used in Machine Unlearning for vision-related tasks, such as image classification and image generation.

It bridges the gap between research/theory and engineering/practice, making it easier to apply machine unlearning techniques effectively.

Vision Unlearning is designed to be:

Easy to use
Easy to extend
Architecture-agnostic
Application-agnostic

Who is it for?

Researchers

For Machine Unlearning researchers, Vision Unlearning helps with:

Using the same data splits as other works, including the correct segmentation of forget-retain data and generating data with the same prompts.
Choosing the appropriate metrics for each task.
Configuring evaluation setups in a standardized manner.

Practitioners

For practitioners, Vision Unlearning provides:

Easy access to state-of-the-art unlearning algorithms.
A standardized interface to experiment with different algorithms.

Tutorials

The source code for these tutorials is in tutorials/, but their outputs were cleaned to avoid burdening the repo. The links above contain Google Drive stored executions with the full outputs.

For developers: every time there is a relevant modification in the codebase, please run the affected tutorials, save the notebook to Drive, clear the output before commiting.

Main Interfaces

Vision Unlearning standardizes the following components:

Metric: Evaluates a model (e.g., FID, CLIP Score, MIA, NudeNet, etc.).
Unlearner: Encapsulates the unlearning algorithm.
Dataset: Encapsulates the dataset, including data splitting.

Additionally, common tasks and evaluation setups are provided as example notebooks. Several platform integrations, such as Hugging Face and Weights & Biases, are also included.

uml

Evaluation

Testbeds

Our testbeds serve as "meta benchmarks", a set of tasks that can be used as starting point when designing a new assessment, benchmark, or intervention. The several tasks are divided in two groups, basic and applied, reflecting the need for different structures and task selection methodologies.

Each task is defined by carefully selecting a diverse and representative set of entities (concepts that will undergo unlearning). Each entity is annotated with relevant attributes, and separately unlearned using different unlearning methods from the state-of-the-art. Each unlearned model is then used to generate images for all entities, allowing fine-grained analysis of the effects caused by the unlearning process. Last but not least, all entities contain the same amount of images and were carefully selected so as to be balanced across at least 2 attributes.

More specifically, the methodology used to produce the testbeds is:

Choose 3 unlearning methods, representative of the main categories of unlearning algorithms
Choose 3 tasks, representative of the unlearning applications
Choose 2 or more attributes of interest (visual, unambiguous, not polemic nor debatable)
Enrich all entities with attributes (including the attributes of interest, but potentially more)
Restrict to 100 entities, intersectionally balanced across the attributes of interest
Sequentially equalized hyperparameters across unlearning methods
Train unlearned models and generate images:

testbeds_generation_pipeline

Across all tasks, a standardized set of files/media/metadata/content is provided:

metadata_filtered: List[Dict[str, Any]]
- Each position refers to one entity, which is described by a dict
- One file for the entire task
- Save path: metadata_{task}_2_enriched_filtered.json
- Fields
  - name: str, as labeled in the main image dataset; Also refered to as "non preprocessed"
  - index: int
  - is_unlearned: bool, if true then of course dataset_n>0; Unused for now (all chosen entities are unlearned and analysed)
  - dataset_n_original: int, 0 if the entity is not in the dataset (does not appear even at retain); Unused for now (all entities have data)
  - Plus task-specific attributes (see description in each task)
similarity_clip
- 100x100 matrix with pairwise similarities using CLIP score of the name field of metadata_filtered
- One file for the entire task
- Save path: similarity_clip_{task}.json
similarity_attr
- 100x100 matrix with pairwise jaccard similarities of the categorical attributes of metadata_filtered, summed with scaled absolute difference of numerical attributes
- One file for the entire task
- Save path: similarity_clip_{task}.json
Data splits ready for training
- Separate folders for forget and retain images
- Folder path forget: {dataset_base_path}/{target}/train_forget
- Folder path retain: {dataset_base_path}/{target}/train_retain
Unlearned models
- stable-diffusion-v1-4 that forgot
- Each model is stored in a separate folder
- In total, 900 models are provided
- Folder path: models/{task}_{target}_{method}_{num_train_epochs:03d}
Data generated by unlearned models
- In total, 360000 images are provided
- Prompt: "An image of {name}"
- Folder path: datasets/generated_{task}_{target}_{method}_{num_train_epochs:03d}
Notebook 1: Data Preparation
- Downloads, enrich, filter, save splits
- Generates metadata_filtered
- Operations are performed step-by-step so it's easy to adapt for
Notebook 2: Data Exploration
- Basic exploratory data analysis and other utilities to handle the dataset
- Generates similarity_clip and similarity_attr
- Plots them as heatmaps

All heavy media (anything that isn't pure code) is provided in the following link: https://doi.org/10.5281/zenodo.18649818

We are still uploading content, and intend to finish it in the following months. In the meantime, reach out to Leonardo Benitez if you are interested in the idea, and cite the Vision-Unlearning library (see bellow) if what is currently available in the public repository is useful for your work.

Basic testbeds

Tasks:

Breeds
- Unlearning a dog breed recognized by the FCI (Fédération Cynologique Internationale)
- Main image dataset: taras_breeds
- Attribute datasets: akc, pawsomeauthority
- Temporary or intermediate files: metadata_breeds_1_enriched_but_not_filtered.json, metadata_breeds_2_enriched_filtered.json, akc-data-latest.csv
- Task specific attributes
  - description: str
  - temperament: str
  - popularity: int
  - min_height: float
  - group: enum
    - Sporting Group — Breeds bred to assist hunters in the capture and retrieval of game (e.g., pointers, retrievers, spaniels).
    - Hound Group — Breeds used for hunting by scent or sight.
    - Working Group — Strong, intelligent breeds bred for jobs like guarding, pulling sleds, and rescue.
    - Terrier Group — Energetic, often feisty breeds originally bred to hunt vermin.
    - Toy Group — Small breeds developed primarily as companion or lap dogs.
    - Non-Sporting Group — Breeds with diverse functions that don’t clearly fit into the other groups.
    - Herding Group — Breeds developed to control livestock; separated from the Working Group in 1983.
    - Miscellaneous Class — Breeds recognized by AKC but not yet fully eligible for a regular group; transitional phase.
    - Foundation Stock Service (FSS) — Breeds recorded by AKC to preserve and develop rare breeds; not yet fully recognized.
  - ...among others...
- Attributes chosen for data balancing: TODO
- Number of entities: 100
Scenes
- Unlearning a scene (a holistic, semantically coherent environment characterized by its global spatial layout, functional purpose, and typical object configurations, rather than by any single object)
- Main image dataset: SUN
- Attribute datasets: pantheon
- Temporary or intermediate files: metadata_scenes_1_enriched_but_not_filtered.json, metadata_scenes_2_enriched_filtered.json
- Task specific attributes
  - socializing: bool
  - natural: bool
  - open area: bool
  - exercise: bool
  - ...among others...
- Attributes chosen for data balancing: TODO
- Number of entities: 100
People
- Unlearning one famous person
- Main image dataset: lfw
- Attribute datasets: pantheon
- Temporary or intermediate files: metadata_people_1_enriched_but_not_filtered.json, metadata_people_2_enriched_filtered.json
- Task specific attributes
  - name_pantheon: Optional[str], only if different from name
  - race: Enum[white, asian, black, indian_middleEastern_latinoHispanic] -> TODO did I still make that joining?
  - gender: Enum[M, F]
  - birthyear
  - occupation
  - bplace_country
  - hpi:
  - occupation_simplified
    - Artist = Actor, or Singer, or Musician, or Film director, or Comedian, or Writer, or Artist, or Model
    - Athlete = Tennis player, or Basketball player, or Racing driver, or Swimmer, or Athlete, or Golfer, or Boxer, or Cyclist, or Skater, or Soccer player, or Baseball player , or American football player, or Cricketer
    - Politician = Politician
    - All other professions are... eliminated? TODO
  - bplace_country
  - hpi: float. Historical Popularity Index (HPI) a metric that combines number of translation in wikipedia, time since birth, and wikipedia page-views (2008-2013); Higher = more famous
  - hpi_bin: enum["Q0_25", "Q25_50", "Q50_75", "Q75_100"]
- Attributes chosen for data balancing: occupation_simplified, hpi_bin
- Number of entities: 100
Objects
- ...still under development...

Known works using this testbeds:

Attribute-Based Interpretation (ABI), a undergoing-development framework for Explainable AI.
Interference in Concept Adaptation and geneRative Erasure (I-CARE), a benchmark for interference analysis in unlearning. See details in specific section.
FADE: Selective Forgetting via Sparse LoRA and Self-Distillation

Applied testbeds

Tasks:

Biomedical
- ...still under development...
Autonomous Vehicles
- ...still under development...
Art adaptation
- ...still under development...

Benchmarks

Unlearn Canvas

We provide a simple interface to run the benchmark for unlearning methods in our library. For more information on the benchmark itself, please refer to their repository https://github.com/OPTML-Group/UnlearnCanvas.

I-CARE

Interference in Concept Adaptation and geneRative Erasure, a benchmark for interference analysis in unlearning. Leveraging the Basic Testbeds provided by the Vision-Unlearning, this work analyzes how unlearning one entity (the emitter) affects the performance on other closely-related entities (the receivers).

Most of the code is currently in the following PRIVATE repository: https://github.com/LeonardoSanBenitez/unlearning-analysis

We intend to open-source it in the following months. In the meantime, reach out to Leonardo Benitez if you are interested in the idea, and cite the Vision-Unlearning library (see bellow) if what is currently available in the public repository is useful for your work.

Across all tasks, the benchmark results are summarized in the following files:

interference_per_pair: Dict[str, Dict[str, float]]
- Store computed MetricInterferencePerEntityPair
- Each key refers to one entity (including the target, aka forget set worsening), as it appears in the 'name' fields of metadata_filtered (non preprocessed)
- An entire interference_per_pair refers to interferences caused by one EMITER to each RECEIVER (the keys of interference_per_pair)
- Values are the metrics computed, averaged across all seeds
- Is always complete (all values computed)
- One file per unlearning session
- Computed by: 3_compute_caused_interferences.py (benefit from GPU but can run on CPU)
- Save path: datasets/interferences_caused_by_{task}_{index}_{method}_{num_train_epochs}.json
interference_per_pair_inverse: Dict[str, Dict[str, float]]
- An entire interference_per_pair_inverse refers to interferences RECEIVED by one identiy by each EMITER (the keys of interference_per_pair)
- May be incomplete (if not all unlearning sessions were performed)
- One file of this per entity
- One file per unlearning session (but uses info from all unlearning sessions)
- Save path: none, calculated on-the-fly
interference_per_entity: List[Dict[str, Any]]
- Store computed MetricInterferencePerEntity, together with all information from from metadata_filtered
- Each item/row corresponds to one entity
- One file for the entire task, for all methods
- Columns are the information from from metadata_filtered, then each MetricInterferencePerEntity prefixed by metric_{method}_{num_train_epochs}_ and sufixed by (↑) or (↓)
- Computed by: 4. Compute interference per entity.ipynb (does not benefit from GPU)
- Save path: interference_per_entity_{task}.json

Citation

We don't have yet a paper specifically about the library. Instead, please cite the Zenodo DOI that releases the data for the testbeds:

@dataset{vision_unlearning_evaluation_testbeds,
  author       = {Benitez Pereira, Leonardo Santiago and
                  Mola, Natnael and
                  R. Kelsch, Carolina and
                  Vaze, Soham},
  title        = {Vision Unlearning Evaluation Testbeds},
  month        = feb,
  year         = 2026,
  publisher    = {Zenodo},
  version      = {0.1.0},
  doi          = {10.5281/zenodo.18649818},
  url          = {https://doi.org/10.5281/zenodo.18649818},
}

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.1.9

May 6, 2026

0.1.8

Feb 13, 2026

0.1.7

Nov 13, 2025

0.1.6

Nov 13, 2025

0.1.5

Oct 5, 2025

0.1.4

Sep 20, 2025

0.1.3

Apr 23, 2025

0.1.2

Apr 22, 2025

0.1.1

Mar 26, 2025

0.1.0

Mar 25, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vision_unlearning-0.1.9.tar.gz (103.8 kB view details)

Uploaded May 6, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

vision_unlearning-0.1.9-py3-none-any.whl (121.8 kB view details)

Uploaded May 6, 2026 Python 3

File details

Details for the file vision_unlearning-0.1.9.tar.gz.

File metadata

Download URL: vision_unlearning-0.1.9.tar.gz
Upload date: May 6, 2026
Size: 103.8 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.13

File hashes

Hashes for vision_unlearning-0.1.9.tar.gz
Algorithm	Hash digest
SHA256	`08faf6f78ee7d39e37fefd79f7165b94df2336ea187ee8f71c096345a98cf1c1`
MD5	`2e507dfaaa40ef90cc420799d36f7cb3`
BLAKE2b-256	`57c1617a807013692d85cf8395036649f22d258a81d4e416c5fd40a256477446`

See more details on using hashes here.

File details

Details for the file vision_unlearning-0.1.9-py3-none-any.whl.

File metadata

Download URL: vision_unlearning-0.1.9-py3-none-any.whl
Upload date: May 6, 2026
Size: 121.8 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.13

File hashes

Hashes for vision_unlearning-0.1.9-py3-none-any.whl
Algorithm	Hash digest
SHA256	`fd4529d949c24e8dfeb2da9de948055a121bc4e707d29b69e488d90299deb7f0`
MD5	`332634e8811c2a1f34aded5b5561c188`
BLAKE2b-256	`5c95200fec577cb72d17b011bc2edc2e7c2af0c67207fc02a7acea2f4f98e25b`

See more details on using hashes here.

vision-unlearning 0.1.9

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Vision Unlearning

Installation

What is Vision Unlearning?

Who is it for?

Researchers

Practitioners

Tutorials

Main Interfaces

Evaluation

Testbeds

Basic testbeds

Applied testbeds

Benchmarks

Unlearn Canvas

I-CARE

Citation

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes