Skip to main content

A Python script that analyzes image classes using OpenAI CLIP model

Project description

Clip2ClassDist

Clip2ClassDist is a Python script that analyzes image classes by leveraging OpenAI's CLIP model to generate image embeddings. It computes the center and radius of clusters formed by embeddings, the distribution of dot products within each class, and the average, minimum, and maximum dot products between different classes.

The script takes a list of image folders as input, with each folder representing a distinct image class. It uses the specified CLIP model to create normalized embeddings for each image in the folders. The script then analyzes the embeddings using the Faiss library for efficient similarity search and clustering.

Installation

Clone the repository:

git clone https://github.com/ternaus/clip2classdist.git

Install the required dependencies:

pip install -r requirements.txt

Usage

The main script compute.py can be executed with the following command:

Copy code
python clip2class_dist/compute.py --clip_model MODEL_NAME --image_folders CLASS_NAME_1 FOLDER_PATH_1 CLASS_NAME_2 FOLDER_PATH_2 [--device DEVICE]

Arguments:

  • --clip_model: Name of the CLIP model to use (e.g., ViT-B/32). Where allowed models:
    • ViT-B/32
    • RN50
    • RN101
    • RN50x4
    • RN50x16
    • ViT-B/16
    • RN50x64
    • ViT-L/14
    • ViT-L/14@336px
  • --image_folders: List of pairs containing class names and corresponding folder paths containing images. Each pair should be separated by a space, and pairs should be space-separated as well (e.g., "dogs /path/to/dog_images cats /path/to/cat_images").
  • --device (optional): Device to run the model on (default: "cpu"). Set to "cuda" if you have a compatible GPU.

Example usage:

python clip2class_dist/compute.py --clip_model ViT-B/32 --image_folders dogs /path/to/dog_images cats /path/to/cat_images --device cpu

Output

The script will output the following information:

  1. Cluster centers for each class.
  2. Cluster radii for each class.
  3. Dot product distributions within each class.
  4. Average dot product between different classes.
  5. Minimum dot product between different classes.
  6. Maximum dot product between different classes.

Example output:

Cluster centers: {'dogs': array([...]), 'cats': array([...])}
Cluster radii: {'dogs': 0.2384927, 'cats': 0.27184734}
Dot product distributions: {'dogs': array([...]), 'cats': array([...])}
Average dot product between classes: 0.19983745
Minimum dot product between classes: 0.01759241
Maximum dot product between classes: 0.38273957

License

This project is released under the MIT License.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

clip2classdist-0.1.0.tar.gz (6.0 kB view details)

Uploaded Source

Built Distribution

clip2classdist-0.1.0-py3-none-any.whl (6.7 kB view details)

Uploaded Python 3

File details

Details for the file clip2classdist-0.1.0.tar.gz.

File metadata

  • Download URL: clip2classdist-0.1.0.tar.gz
  • Upload date:
  • Size: 6.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.10.10

File hashes

Hashes for clip2classdist-0.1.0.tar.gz
Algorithm Hash digest
SHA256 3a409da4079ad86b001e7b071200da586c6ce1384a1025b71eec52661420b619
MD5 979a1e86e344cc5db00e921b0b65f867
BLAKE2b-256 2545bd6009a55a04305bef839bc26b9a3c0bf6ef9c399bc88e12cade55e71604

See more details on using hashes here.

File details

Details for the file clip2classdist-0.1.0-py3-none-any.whl.

File metadata

File hashes

Hashes for clip2classdist-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 99d0bb3ac2b0d410eb54b59c896ac824ec21d82d4e7597b3aca81287eeda4d62
MD5 d3c74592cb315b9a64b40dfd45f4735a
BLAKE2b-256 c906f2d878636684bb29b214a2664971bf135baca9f3127a5fa91ecbd6c6d9a2

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page