TransNetV2 PyTorch implementation for video scene detection

These details have not been verified by PyPI

Project links

Project description

TransNet V2: Shot Boundary Detection Neural Network (PyTorch)

This repository contains a PyTorch implementation of TransNet V2: An effective deep network architecture for fast shot transition detection.

This is a PyTorch reimplementation of the TransNetV2 model that produces identical results as the original TensorFlow version. The code is for inference only.

Performance

Our reevaluation of other publicly available state-of-the-art shot boundary methods (F1 scores):

Model	ClipShots	BBC Planet Earth	RAI
TransNet V2	77.9	96.2	93.9
TransNet (github)	73.5	92.9	94.3
Hassanien et al. (github)	75.9	92.6	93.9
Tang et al., ResNet baseline (github)	76.1	89.3	92.8

Installation

pip install transnetv2-pytorch

Or install from source:

git clone https://github.com/allenday/transnetv2_pytorch.git
cd transnetv2_pytorch
pip install -e .

Usage

Command Line Interface

The package provides both a direct command and Python module execution:

# Direct command
transnetv2_pytorch path/to/video.mp4

# Python module execution
python -m transnetv2_pytorch path/to/video.mp4

CLI Arguments

# Basic usage
transnetv2_pytorch path/to/video.mp4

# Specify output file
transnetv2_pytorch path/to/video.mp4 --output predictions.txt

# Use specific device
transnetv2_pytorch path/to/video.mp4 --device cuda

# Get help for all options
transnetv2_pytorch --help

Python API

import torch
from transnetv2_pytorch import TransNetV2

# Initialize model
model = TransNetV2()
model.eval()

# Automatic device selection
if torch.cuda.is_available():
    model = model.cuda()
elif torch.backends.mps.is_available():
    model = model.to('mps')

with torch.no_grad():
    # Input shape: batch_size x video_frames x height x width x channels (RGB)
    input_video = torch.zeros(1, 100, 27, 48, 3, dtype=torch.uint8)
    
    # Move to same device as model
    input_video = input_video.to(next(model.parameters()).device)
    
    single_frame_pred, all_frame_pred = model(input_video)
    
    # Get predictions
    single_frame_pred = torch.sigmoid(single_frame_pred).cpu().numpy()
    all_frame_pred = torch.sigmoid(all_frame_pred["many_hot"]).cpu().numpy()
    
    # Find shot boundaries (example)
    shot_boundaries = single_frame_pred > 0.5

Advanced Usage

# Custom device handling
device = torch.device('cuda' if torch.cuda.is_available() else 'cpu')
model = TransNetV2(device=device)

# Load custom weights
model = TransNetV2()
state_dict = torch.load('custom_weights.pth', map_location='cpu')
model.load_state_dict(state_dict)

# Batch processing
batch_size = 4
for batch in video_batches:
    predictions = model(batch)
    # Process predictions...

Device Support

This implementation supports:

CPU: Works on all systems
CUDA: For NVIDIA GPUs
MPS: For Apple Silicon Macs (automatic fallback for unsupported operations)

The model automatically detects and uses the best available device. For MPS devices, unsupported operations (like 3D convolutions) automatically fall back to CPU.

Original Work & Training

This PyTorch implementation is based on the original TensorFlow version. For:

Training code and datasets
TensorFlow implementation
Weight conversion utilities
Research replication

Please visit the original repository: soCzech/TransNetV2

Credits

Original Work

This PyTorch implementation is based on the original TensorFlow TransNet V2 by Tomáš Souček and Jakub Lokoč.

If found useful, please cite the original work:

@article{soucek2020transnetv2,
    title={TransNet V2: An effective deep network architecture for fast shot transition detection},
    author={Sou{\v{c}}ek, Tom{\'a}{\v{s}} and Loko{\v{c}}, Jakub},
    year={2020},
    journal={arXiv preprint arXiv:2008.04838},
}

PyTorch Implementation

This production-ready PyTorch package was developed by [Your Name] with significant improvements including:

Complete PyTorch reimplementation for inference
Cross-platform device support (CPU, CUDA, MPS)
Command-line interface
Package distribution and installation
Comprehensive testing and error handling

License

MIT License

See the original TransNetV2 repository for the original license.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

1.0.5

Jun 1, 2025

1.0.4

May 31, 2025

1.0.3

May 30, 2025

1.0.2

May 30, 2025

1.0.1

May 28, 2025

This version

1.0.0

May 27, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

transnetv2_pytorch-1.0.0.tar.gz (28.0 MB view details)

Uploaded May 27, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

transnetv2_pytorch-1.0.0-py3-none-any.whl (28.0 MB view details)

Uploaded May 27, 2025 Python 3

File details

Details for the file transnetv2_pytorch-1.0.0.tar.gz.

File metadata

Download URL: transnetv2_pytorch-1.0.0.tar.gz
Upload date: May 27, 2025
Size: 28.0 MB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.10.17

File hashes

Hashes for transnetv2_pytorch-1.0.0.tar.gz
Algorithm	Hash digest
SHA256	`e0cdc5a367e1657a5c7d929d5c4e62cb5774b337d8236b90d20e2ce781e439a4`
MD5	`140f2c15b536dd9965238c47e744d360`
BLAKE2b-256	`6decc5efbf7513bdfa1143d3c48b58bfaa14374a4d7e63725ab9e616ba384cac`

See more details on using hashes here.

File details

Details for the file transnetv2_pytorch-1.0.0-py3-none-any.whl.

File metadata

Download URL: transnetv2_pytorch-1.0.0-py3-none-any.whl
Upload date: May 27, 2025
Size: 28.0 MB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.10.17

File hashes

Hashes for transnetv2_pytorch-1.0.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`0d611aa92a79c163d8e605b3434c084bf1a5386abbecfed785e62ecfdace2271`
MD5	`dcf258db8f49d0b60ccdba4fb7e51d91`
BLAKE2b-256	`7dab72ab78e645d90122d040cb16675fc4d79764702a1f7aba9dcd1f514759a4`

See more details on using hashes here.

transnetv2-pytorch 1.0.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

TransNet V2: Shot Boundary Detection Neural Network (PyTorch)

Performance

Installation

Usage

Command Line Interface

CLI Arguments

Python API

Advanced Usage

Device Support

Original Work & Training

Credits

Original Work

PyTorch Implementation

Related Papers

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes