Extracting image features from state-of-the-art neural networks for Computer Vision made easy

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

:notebook_with_decorative_cover: Table of Contents

About the Project
- Functionality
- Model collection
Getting Started
- Setting up your environment
- Basic usage
Contributing
License
Citation
Contributions

:star2: About the Project

thingsvision is a Python package that let's you easily extract image representations from many state-of-the-art neural networks for computer vision. In a nutshell, you feed thingsvision with a directory of images and tell it which neural network you are interested in. thingsvision will then give you the representation of the indicated neural network for each image so that you will end up with one feature vector per image. You can use these feature vectors for further analyses. We use the word features for short when we mean "image representation".

:rotating_light: Note: some function calls mentioned in the paper have been deprecated. To use this package successfully, exclusively follow this README and the Documentation. :rotating_light:

(back to top)

:mechanical_arm: Functionality

With thingsvision, you can:

extract features for any imageset from many popular networks.
extract features for any imageset from your custom networks.
extract features for >26,000 images from the THINGS image database.
optionally turn off the standard center cropping performed by many networks before extracting features.
extract features from HDF5 datasets directly (e.g. NSD stimuli)
conduct basic Representational Similarity Analysis (RSA) after feature extraction.
perform Centered Kernel Alignment (CKA) to compare image features across model-module combinations.

(back to top)

:file_cabinet: Model collection

Neural networks come from different sources. With thingsvision, you can extract image representations of all models from:

torchvision
Keras
timm
vissl (Self-Supervised Learning Models)
- Currently available: simclr-rn50, mocov2-rn50, jigsaw-rn50, rotnet-rn50, swav-rn50, pirl-rn50
OpenCLIP
both original CLIP variants (clip-ViT and clip-RN)
some custom models (VGG-16, Resnet50, Inception_v3 and Alexnet) trained on Ecoset
each of the many CORnet versions.

(back to top)

:running: Getting Started

:computer: Setting up your environment

Working locally.

First, create a new conda environment with Python version 3.8, 3.9, or 3.10 e.g. by using conda:

$ conda create --n thingsvision python=3.9
$ conda activate thingsvision

Then, activate the environment and simply install thingsvision via running the following pip command in your terminal.

$ pip install --upgrade thingsvision

Google Colab.

Alternatively, you can use Google Colab to play around with thingsvision by uploading your image data to Google Drive (via directory mounting). You can find the jupyter notebook using PyTorch here and the TensorFlow example here.

(back to top)

:mag: Basic usage

thingsvision was designed to make extracting features as easy as possible. Start by importing all the necessary components and instantiating a thingsvision extractor. Here we're using AlexNet from the torchvision library as the model to extract features from and also load the model to GPU for faster inference:

import torch
from thingsvision import get_extractor
from thingsvision.utils.storing import save_features
from thingsvision.utils.data import ImageDataset, DataLoader

model_name = 'alexnet'
source = 'torchvision'
device = 'cuda' if torch.cuda.is_available() else 'cpu'

extractor = get_extractor(
  model_name=model_name,
  source=source,
  device=device,
  pretrained=True
)

Next, create the Dataset and Dataloader for your images. Here, we have all our images in a single directory root, which can also contain subfolders (e.g. for individual classes), so we're using the ImageDataset class.

root='path/to/root/img/directory' # (e.g., './images/)
batch_size = 32

dataset = ImageDataset(
  root=root,
  out_path='path/to/features',
  backend=extractor.get_backend(),
  transforms=extractor.get_transformations()
)

batches = DataLoader(
  dataset=dataset,
  batch_size=batch_size, 
  backend=extractor.get_backend()
)

Now all that is left is to extract the image features and store them to disk! Here we're extracting features from the last convolutional layer of AlexNet (features.10), but if you don't know which modules are available for a given model, just call extractor.show_model() to print all modules.

module_name = 'features.10'

features = extractor.extract_features(
  batches=batches,
  module_name=module_name,
  flatten_acts=True  # flatten 2D feature maps from convolutional layer
)

save_features(features, out_path='path/to/features', file_format='npy')

For more examples and explanations of additional functionality like how to optionally turn off center cropping, how to use HDF5 datasets (e.g. NSD stimuli), how to perform RSA or CKA, or how to easily extract features for the THINGS image database, please refer to the Documentation.

(back to top)

:wave: How to contribute

If you come across problems or have suggestions please submit an issue!

(back to top)

:warning: License

This GitHub repository is licensed under the MIT License - see the LICENSE.md file for details.

(back to top)

:page_with_curl: Citation

If you use this GitHub repository (or any modules associated with it), please cite our paper for the initial version of thingsvision as follows:

@article{Muttenthaler_2021,
	author = {Muttenthaler, Lukas and Hebart, Martin N.},
	title = {THINGSvision: A Python Toolbox for Streamlining the Extraction of Activations From Deep Neural Networks},
	journal ={Frontiers in Neuroinformatics},
	volume = {15},
	pages = {45},
	year = {2021},
	url = {https://www.frontiersin.org/article/10.3389/fninf.2021.679838},
	doi = {10.3389/fninf.2021.679838},
	issn = {1662-5196},
}

(back to top)

:gem: Contributions

This library is based on the groundwork laid by Lukas Muttenthaler and Martin N. Hebart, who are both still actively involved, but has been extended and refined into its current form with the help of our many contributors,

Alex Murphy (software dev.)
Hannes Hansen (software dev.)
Johannes Roth (software dev., design, docs)
Jonas Dippel (software dev.)
Lukas Muttenthaler (software dev., design, docs)
Martin N. Hebart (design)
Oliver Contier (docs)
Philipp Kaniuth (design, docs)
Roman Leipe (sofware dev., docs),

sorted alphabetically.

This is a joint open-source project between the Max Planck Institute for Human Cognitive and Brain Sciences, Leipzig, and the Machine Learning Group at Technische Universtität Berlin. Correspondence and requests for contributing should be adressed to Lukas Muttenthaler. Feel free to contact us if you want to become a contributor or have any suggestions/feedback. For the latter, you could also just post an issue or engange in discussions. We'll try to respond as fast as we can.

(back to top)

Project details

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

2.6.6

May 16, 2024

2.6.5

May 16, 2024

2.6.4

May 2, 2024

2.6.3

Apr 29, 2024

2.6.2

Apr 24, 2024

2.6.1

Apr 22, 2024

2.6.0

Apr 22, 2024

2.5.4

Apr 16, 2024

2.5.3

Apr 8, 2024

2.5.2

Apr 5, 2024

2.5.1

Apr 4, 2024

2.5.0

Mar 28, 2024

2.4.2

Mar 19, 2024

2.4.1

Aug 9, 2023

2.4.0

Aug 8, 2023

2.3.20

Aug 4, 2023

2.3.19

Aug 4, 2023

2.3.18

Jul 26, 2023

2.3.17

Jul 11, 2023

2.3.16

Jun 29, 2023

2.3.15

Jun 29, 2023

2.3.14

Apr 30, 2023

2.3.13

Mar 6, 2023

2.3.12

Mar 6, 2023

2.3.11

Mar 6, 2023

2.3.10

Mar 6, 2023

2.3.9

Mar 6, 2023

2.3.8

Mar 5, 2023

2.3.7

Mar 2, 2023

2.3.6

Mar 2, 2023

2.3.5

Mar 2, 2023

2.3.4

Mar 2, 2023

2.3.3

Mar 2, 2023

2.3.2

Mar 2, 2023

2.3.1

Mar 2, 2023

2.3.0

Mar 2, 2023

2.2.24

Feb 24, 2023

2.2.23

Feb 22, 2023

2.2.22

Feb 17, 2023

2.2.21

Feb 15, 2023

2.2.20

Feb 15, 2023

2.2.19

Feb 15, 2023

2.2.18

Jan 18, 2023

2.2.17

Jan 1, 2023

2.2.16

Dec 20, 2022

2.2.15

Dec 20, 2022

2.2.14

Dec 20, 2022

2.2.13

Dec 13, 2022

2.2.12

Dec 12, 2022

2.2.11

Dec 6, 2022

2.2.10

Nov 17, 2022

2.2.9

Nov 17, 2022

This version

2.2.8

Nov 16, 2022

2.2.7

Nov 15, 2022

2.2.6

Nov 15, 2022

2.2.5

Nov 11, 2022

2.2.4

Nov 9, 2022

2.2.3

Nov 2, 2022

2.2.2

Oct 27, 2022

2.2.1

Oct 27, 2022

2.2.0

Oct 26, 2022

2.1.4

Sep 26, 2022

2.1.3

Sep 26, 2022

2.1.2

Sep 26, 2022

2.1.1

Sep 26, 2022

2.1.0

Sep 25, 2022

2.0.12

Sep 23, 2022

2.0.11

Sep 23, 2022

2.0.10

Sep 22, 2022

2.0.9

Sep 22, 2022

2.0.8

Sep 20, 2022

2.0.7

Sep 12, 2022

2.0.6

Sep 1, 2022

2.0.5

Aug 31, 2022

2.0.4

Aug 30, 2022

2.0.3

Aug 25, 2022

2.0.2

Aug 23, 2022

2.0.1

Aug 22, 2022

2.0.0

Aug 22, 2022

1.6.2

Aug 18, 2022

1.6.1

Aug 11, 2022

1.6.0

Aug 11, 2022

1.5.0

Aug 3, 2022

1.4.5

Jul 14, 2022

1.4.4

Jun 15, 2022

1.4.3

Apr 19, 2022

1.4.2

Mar 17, 2022

1.4.1

Feb 8, 2022

1.4.0

Feb 8, 2022

1.3.4

Feb 6, 2022

1.3.3

Feb 2, 2022

1.3.2

Feb 2, 2022

1.3.1

Feb 2, 2022

1.3.0

Feb 2, 2022

1.2.7

Feb 2, 2022

1.2.6

Feb 2, 2022

1.2.5

Jan 31, 2022

1.2.4

Jan 31, 2022

1.2.3

Jan 31, 2022

1.2.2

Jan 30, 2022

1.2.1

Jan 30, 2022

1.2.0

Jan 26, 2022

1.1.7

Jan 26, 2022

1.1.6

Jan 25, 2022

1.1.5

Jan 9, 2022

1.1.4

Oct 8, 2021

1.1.3

Oct 5, 2021

1.1.2

Oct 5, 2021

1.1.1

Aug 10, 2021

1.1.0

Aug 2, 2021

1.0.2

Jul 28, 2021

1.0.1

Jul 14, 2021

1.0.0

Jul 14, 2021

0.9.9

Jul 13, 2021

0.9.8

Jul 13, 2021

0.9.6

Jul 12, 2021

0.9.5

Jul 12, 2021

0.9.4

Jul 3, 2021

0.9.3

Jul 3, 2021

0.9.2

Jul 1, 2021

0.9.1

Jul 1, 2021

0.9.0

Jul 1, 2021

0.8.9

Jun 30, 2021

0.8.8

Jun 24, 2021

0.8.7

Jun 24, 2021

0.8.6

Jun 24, 2021

0.8.5

May 14, 2021

0.8.4

Apr 8, 2021

0.8.3

Mar 24, 2021

0.8.2

Mar 23, 2021

0.8.1

Mar 23, 2021

0.8.0

Mar 23, 2021

0.7.9

Mar 23, 2021

0.7.8

Mar 23, 2021

0.7.7

Mar 22, 2021

0.7.6

Mar 19, 2021

0.7.5

Mar 19, 2021

0.7.4

Mar 19, 2021

0.7.3

Mar 19, 2021

0.7.2

Mar 19, 2021

0.7.1

Mar 19, 2021

0.7.0

Mar 19, 2021

0.6.9

Mar 18, 2021

0.6.8

Mar 18, 2021

0.6.7

Mar 18, 2021

0.6.6

Mar 16, 2021

0.6.5

Mar 11, 2021

0.6.4

Mar 9, 2021

0.6.3

Mar 4, 2021

0.6.2

Mar 4, 2021

0.6.1

Mar 4, 2021

0.6.0

Mar 3, 2021

0.5.9

Mar 2, 2021

0.5.8

Mar 2, 2021

0.5.7

Feb 26, 2021

0.5.6

Feb 26, 2021

0.5.5

Feb 25, 2021

0.5.4

Feb 25, 2021

0.5.2

Feb 22, 2021

0.5.1

Feb 22, 2021

0.5.0

Feb 22, 2021

0.4.9

Feb 22, 2021

0.4.8

Feb 22, 2021

0.4.7

Feb 17, 2021

0.4.6

Feb 17, 2021

0.4.5

Feb 15, 2021

0.4.4

Feb 15, 2021

0.4.3

Feb 12, 2021

0.4.2

Feb 11, 2021

0.4.1

Feb 11, 2021

0.4.0

Feb 10, 2021

0.3.9

Feb 10, 2021

0.3.8

Feb 10, 2021

0.3.7

Feb 5, 2021

0.3.6

Feb 4, 2021

0.3.5

Feb 4, 2021

0.3.4

Feb 4, 2021

0.3.3

Feb 4, 2021

0.3.2

Feb 4, 2021

0.3.1

Feb 4, 2021

0.3.0

Feb 4, 2021

0.2.9

Feb 4, 2021

0.2.8

Feb 4, 2021

0.2.7

Feb 4, 2021

0.2.6

Feb 4, 2021

0.2.5

Feb 4, 2021

0.2.4

Feb 4, 2021

0.2.3

Jan 29, 2021

0.2.2

Jan 29, 2021

0.2.1

Jan 29, 2021

0.2.0

Jan 29, 2021

0.1.9

Jan 29, 2021

0.1.8

Jan 28, 2021

0.1.7

Jan 23, 2021

0.1.6

Jan 22, 2021

0.1.5

Jan 22, 2021

0.1.4

Jan 22, 2021

0.1.3

Jan 22, 2021

0.1.2

Jan 22, 2021

0.1.1

Jan 22, 2021

0.1.0

Jan 22, 2021

0.0.9

Jan 22, 2021

0.0.8

Jan 22, 2021

0.0.7

Jan 22, 2021

0.0.6

Jan 22, 2021

0.0.5

Jan 22, 2021

0.0.4

Jan 22, 2021

0.0.3

Jan 22, 2021

0.0.2

Jan 22, 2021

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

thingsvision-2.2.8.tar.gz (36.8 kB view hashes)

Uploaded Nov 16, 2022 Source

Built Distribution

thingsvision-2.2.8-py3-none-any.whl (102.0 kB view hashes)

Uploaded Nov 16, 2022 Python 3

Hashes for thingsvision-2.2.8.tar.gz

Hashes for thingsvision-2.2.8.tar.gz
Algorithm	Hash digest
SHA256	`54a62ecd0923811cefc0cb2a041f730cbb87ea1daa7c978b6ed7b1baef617169`
MD5	`b5bf3ec6e5fa909b7bad7f7dc356401c`
BLAKE2b-256	`ec828ffba36f134cf29522a8a5a24102fb3e985691d08e299002ceb62d48b991`

Hashes for thingsvision-2.2.8-py3-none-any.whl

Hashes for thingsvision-2.2.8-py3-none-any.whl
Algorithm	Hash digest
SHA256	`86c774d74591011da073cd1cacf50900ecc279ac1154942a18d475885a60fb68`
MD5	`5d65b0d602bb87cb17629522ce84baaf`
BLAKE2b-256	`c79557b7b0cf9f4c845a4ba43d75b0f2d687e5542022bc97448ae21f4bb8954a`