Automatic CNN feature extraction and ML model comparison

These details have not been verified by PyPI

Project links

Homepage

Development Status
- 3 - Alpha
Intended Audience
- Science/Research
License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
Topic
- Scientific/Engineering :: Artificial Intelligence

Project description

CNN Feature Extractor

A Python package for automatic CNN feature extraction and ML model comparison. Extract features from images using pre-trained CNN models and evaluate multiple ML classifiers in one go.

Installation

pip install cnn_feature_extractor

Quick Start with CIFAR10

import torch
import torchvision
from torchvision import transforms
from cnn_feature_extractor import CNNFeatureExtractor

# Set image size
image_size = 128

# Define transforms
transform = transforms.Compose([
    transforms.Resize(image_size),
    transforms.ToTensor(),
    transforms.Normalize(mean=[0.485, 0.456, 0.406], std=[0.229, 0.224, 0.225])
])

# Load CIFAR10 dataset
train_dataset = torchvision.datasets.CIFAR10(
    root='./data', 
    train=True,
    download=True,
    transform=transform
)

val_dataset = torchvision.datasets.CIFAR10(
    root='./data', 
    train=False,
    download=True,
    transform=transform
)

# Create data loaders
train_loader = torch.utils.data.DataLoader(
    train_dataset,
    batch_size=32,
    shuffle=True,
    num_workers=4
)

val_loader = torch.utils.data.DataLoader(
    val_dataset,
    batch_size=32,
    shuffle=False,
    num_workers=4
)

# Initialize and run feature extraction + ML comparison
extractor = CNNFeatureExtractor(save_path='cifar10_results.csv')
results = extractor.fit(
    train_loader, 
    val_loader,

    # Example 1: Using specific models
    cnn_models=['resnet18', 'efficientnet_b0'],    

    # Example 2: Using the tiny package (fastest, good for testing)
    # cnn_models='tiny',  # This will use: mobilenet_v2, mobilenet_v3_small, efficientnet_b0, convnext_tiny, resnet18

    # Example 3: Mixing packages
    # cnn_models='tiny + small',  # This will combine models from both packages
    
    ml_models=['RandomForest', 'LogisticRegression']
)

Using Your Custom Dataset

Required Dataset Structure

dataset/
├── train/
│   ├── class1/
│   │   ├── image1.jpg
│   │   └── image2.jpg
│   └── class2/
│       ├── image3.jpg
│       └── image4.jpg
└── val/
    ├── class1/
    │   └── image5.jpg
    └── class2/
        └── image6.jpg

Custom Dataset Example

from cnn_feature_extractor import CNNFeatureExtractor
from cnn_feature_extractor.utils.dataset import load_custom_dataset

# Set image size and other parameters
image_size = 224  # Standard size for most CNN models
batch_size = 32
num_workers = 4

# Load your custom dataset
train_loader, val_loader, num_classes = load_custom_dataset(
    data_dir='path/to/your/dataset',  # Path to your dataset root directory
    batch_size=batch_size,
    num_workers=num_workers,
    image_size=image_size,
    augment=True  # Enable data augmentation (optional)
)

# Initialize feature extractor
extractor = CNNFeatureExtractor(save_path='results.csv')

# Run feature extraction and ML comparison
results = extractor.fit(
    train_loader, 
    val_loader,

    # Example 1: Using specific models
    cnn_models=['resnet18', 'efficientnet_b0'],    

    # Example 2: Using the tiny package (fastest, good for testing)
    # cnn_models='tiny',  # This will use: mobilenet_v2, mobilenet_v3_small, efficientnet_b0, convnext_tiny, resnet18

    # Example 3: Mixing packages
    # cnn_models='tiny + small',  # This will combine models from both packages
    
    ml_models=['RandomForest', 'LogisticRegression']
)

# Results will be saved to 'results.csv'
print(results)

Available Models

CNN Feature Extractors

Tiny Package (Fast, Lower Accuracy)

mobilenet_v2
mobilenet_v3_small
efficientnet_b0
convnext_tiny
resnet18

Small Package

resnet34
densenet121
mobilenet_v3_large
efficientnet_b1
convnext_small

Medium Package

resnet50
densenet169
vgg16
efficientnet_b2
convnext_base

Large Package

resnet101
densenet201
vgg19
efficientnet_b3
convnext_large

Biggest Package (Slow, Higher Accuracy)

resnet152
densenet201
efficientnet_b7
convnext_large
vgg19

ML Classifiers

RandomForest
SVM (with probability estimation)
LogisticRegression
GradientBoosting
XGBoost
LightGBM
KNN
DecisionTree
AdaBoost
GaussianNB
RidgeClassifier
SGDClassifier
LinearSVC

Package Usage Tips

Choosing CNN Models:
- Start with 'tiny' package models for quick experiments
- Use 'biggest' package models for maximum accuracy
- Mix models from different packages: cnn_models=['resnet18', 'efficientnet_b7']
Choosing ML Models:
- Start with fast models like LogisticRegression
- Use RandomForest or XGBoost for better accuracy
- Try multiple models: ml_models=['LogisticRegression', 'RandomForest', 'XGBoost']
Data Augmentation:
- Enable with augment=True in load_custom_dataset
- Helps prevent overfitting
- Especially useful for small datasets
GPU Usage:
- GPU is automatically used if available
- CNN feature extraction is significantly faster on GPU
- Some ML models (XGBoost, LightGBM) can also use GPU

License

MIT License

Project details

These details have not been verified by PyPI

Project links

Homepage

Development Status
- 3 - Alpha
Intended Audience
- Science/Research
License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
Topic
- Scientific/Engineering :: Artificial Intelligence

Release history Release notifications | RSS feed

0.2.0

Mar 11, 2025

0.1.3

Mar 22, 2025

This version

0.1.2

Mar 9, 2025

0.1.1

Feb 7, 2025

0.1.0

Feb 7, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

cnn_feature_extractor-0.1.2.tar.gz (10.9 kB view details)

Uploaded Mar 9, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

cnn_feature_extractor-0.1.2-py3-none-any.whl (11.7 kB view details)

Uploaded Mar 9, 2025 Python 3

File details

Details for the file cnn_feature_extractor-0.1.2.tar.gz.

File metadata

Download URL: cnn_feature_extractor-0.1.2.tar.gz
Upload date: Mar 9, 2025
Size: 10.9 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.12.0

File hashes

Hashes for cnn_feature_extractor-0.1.2.tar.gz
Algorithm	Hash digest
SHA256	`42a3b20a0a4cee9ee7918beede09655935904724954f395bed85e172d6a85ff3`
MD5	`95d14da5656707da7edd718e0d12b58c`
BLAKE2b-256	`b7f6a65727ba22f49b90cef8817dccd1d4d80556309b25f4657334db5e24fd02`

See more details on using hashes here.

File details

Details for the file cnn_feature_extractor-0.1.2-py3-none-any.whl.

File metadata

Download URL: cnn_feature_extractor-0.1.2-py3-none-any.whl
Upload date: Mar 9, 2025
Size: 11.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.12.0

File hashes

Hashes for cnn_feature_extractor-0.1.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`d711ee274ee335393cb8ccd7f5c201ec70ece19346f322fcc0dc42538fa9d5c6`
MD5	`454da0109ec24d6a0125b2f0b896ee88`
BLAKE2b-256	`66a323694f32ebf80b0ca17fbebae0ac86b2507d400f2ae8ee9c21a9ec4c0cd3`

See more details on using hashes here.

cnn-feature-extractor 0.1.2

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

CNN Feature Extractor

Installation

Quick Start with CIFAR10

Using Your Custom Dataset

Required Dataset Structure

Custom Dataset Example

Available Models

CNN Feature Extractors

Tiny Package (Fast, Lower Accuracy)

Small Package

Medium Package

Large Package

Biggest Package (Slow, Higher Accuracy)

ML Classifiers

Package Usage Tips

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes