A pytorch compatible video dataset that is fully customizable.

These details have not been verified by PyPI

Project description

Video Dataset

This is a python library to create a video dataset. The project is inspired from Video-Dataset-Loading-Pytorch but with a lot of additional features and modifications.

The goal is to have a very moldable and customizable video dataset that can be reused in all possible video dataset situations.

Installation

pip install video-dataset

Dataset Structures

The General dataset structure is the one specified below. One global directory where there is a sub directory for the videos and another one for the annotations, ids files are optional.

- your-dataset
- - videos
- - - video-1
- - - video-2
- - - ...
- - annotations
- - - video-1
- - - video-2
- - - ...
training_ids.txt
testing_ids.txt
validation_ids.txt

An important thing is that each the video must be named (except for the extension) the same way as it's corresponding annotation in order for the VideoDataset to correctly detect it.

When defining a video-dataset multiple components need to be defined:

videos_dir: The path were the videos are stored.
annotations_dir The path were the annotations are stored.
segment_size: The desired number of frames per video $*_1$.
video_processor: Will be in charge to read the video $*_2$.
annotations_processor: Will be in charge to read the annotations $*_2$.

$*_1$: Suppose your videos contain 100 frames and you put segment_size=10; from each video you'll have 10 sub videos of 10 frames each. You an also consider the whole video by putting segment_size=-1.

$*_2:$ In the package, a number of predefined video and annotations processor are available and cover practically any case you can encounter, but it is also possible to defined a custom video or annotation processor and use it with the video-dataset.

Video Processors

The Dataset supports multiple video formats, all the supported formats are presented below:

Raw Video Representation

In this format each element in the videos directory need to be a video file (with any of the supported video extensions).

For example:

- your-dataset
- - videos
- - - video-1.mp4
- - - video-2.mp4
- - - ...
- - annotations
- - - ...

The corresponding VideoDataset:

from video_dataset import VideoDataset
from video_dataset.video import VideoFromVideoFile
from video_dataset.annotations import AnnotationsFromFrameLevelTxtFileAnnotations

video_processor: Type[Video]
annotations_processor: Type[Annotations]

dataset = VideoDataset(
    videos_dir="./dataset/videos",
    annotations_dir="./dataset/annotations",
    segment_size=32,
    video_processor=VideoFromVideoFile,
    annotations_processor=AnnotationsFromFrameLevelTxtFileAnnotations,
)

Frame Level Video Representation

Having the elements of the videos directory as raw videos can be quite slow when loading the videos, an alternative approach is that each element of the videos directory is a directory it self with the name of the video and the content of the directory is images where each image represent a single frame of the video.

- your-dataset
- - videos
- - - video-1
- - - - img_00001.jpg
- - - - img_00002.jpg
- - - - img_00003.jpg
- - - - ...
- - - video-2
- - - ...
- - annotations
- - - ...

The corresponding VideoDataset:

from video_dataset import VideoDataset
from video_dataset.video import VideoFromVideoFramesDirectory
from video_dataset.annotations import AnnotationsFromFrameLevelTxtFileAnnotations

video_processor: Type[Video]
annotations_processor: Type[Annotations]

dataset = VideoDataset(
    videos_dir="./dataset/videos",
    annotations_dir="./dataset/annotations",
    segment_size=32,
    video_processor=VideoFromVideoFramesDirectory,
    annotations_processor=AnnotationsFromFrameLevelTxtFileAnnotations,
)

This significantly reduces video loading time but at the cost of storage space.

Custom Processor

In order to create a custom video processor you basically need to create a class that implements the Video class as follow:

from video_dataset.video import Video

class CustomVideoProcessor(Video):
    def __init__(self, videos_dir_path: str, id: str):
        ...

    def get_id(self):
        return self.id

    def __len__(self):
        ...

    def __getitem__(self, index: int | slice):
        """
        Return the corresponding video frame(s) requested by the index.
        """
        ...

Annotations Processors

Your video annotations files can be in multiple formats.

Whole Video Annotations

A single csv or txt file describing the classes / labels of all the videos.

Implementation. Coming Soon..

Frame By Frame Annotations

Each video have a corresponding txt file where each line in the file correspond to a class / label / annotation of a frame in the video.

eating
eating
eating
eating
eating
eating
eating
...

The corresponding VideoDataset:

from video_dataset import VideoDataset
from video_dataset.video import VideoFromVideoFile
from video_dataset.annotations import AnnotationsFromFrameLevelTxtFileAnnotations

video_processor: Type[Video]
annotations_processor: Type[Annotations]

dataset = VideoDataset(
    videos_dir="./dataset/videos",
    annotations_dir="./dataset/annotations",
    segment_size=32,
    video_processor=VideoFromVideoFile,
    annotations_processor=AnnotationsFromFrameLevelTxtFileAnnotations,
)

Segment Level Annotations

Each video has a corresponding csv file with the following structure:

acton	starting-timestamp	duration
eating	0	4000
dancing	4000	6000
eating	10000	8000

The corresponding VideoDataset:

from video_dataset import VideoDataset
from video_dataset.video import VideoFromVideoFile
from video_dataset.annotations import AnnotationsFromSegmentLevelCsvFileAnnotations

video_processor: Type[Video]
annotations_processor: Type[Annotations]

dataset = VideoDataset(
    videos_dir="./dataset/videos",
    annotations_dir="./dataset/annotations",
    segment_size=32,
    video_processor=VideoFromVideoFile,
    annotations_processor=AnnotationsFromSegmentLevelCsvFileAnnotations,
)

Custom Processor

In order to create a custom annotations processor you basically need to create a class that implements the Annotations class as follow:

from video_dataset.annotations import Annotations

class CustomAnnotationsProcessor(Annotations):
    def __init__(self, annotations_dir_path: str, id: str):
        ...

    def get_id(self):
        return self.id

    @abstractmethod
    def __getitem__(self, index: int | slice):
        """
        Get the annotation(s) of the video file corresponding to the given frame(s) index / indices.
        Note that even if an index is given the annotations will be returned in a batch format (Number of frames, Height, Width, Channels).
        """
        ...

Contributions

All contributions are welcome, just open a pull request.

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.3.0

Mar 6, 2025

0.3.0.dev20250322125011 pre-release

Mar 22, 2025

0.3.0.dev20250320163639 pre-release

Mar 20, 2025

0.3.0.dev20250317144530 pre-release

Mar 17, 2025

0.3.0.dev20250316222650 pre-release

Mar 16, 2025

0.3.0.dev20250316222229 pre-release

Mar 16, 2025

This version

0.3.0.dev20250316102620 pre-release

Mar 16, 2025

0.3.0.dev20250316101233 pre-release

Mar 16, 2025

0.3.0.dev20250311201225 pre-release

Mar 11, 2025

0.3.0.dev20250308093214 pre-release

Mar 8, 2025

0.3.0.dev20250306131850 pre-release

Mar 6, 2025

0.2.9

Mar 4, 2025

0.2.9.dev20250306131823 pre-release

Mar 6, 2025

0.2.9.dev20250305153439 pre-release

Mar 5, 2025

0.2.9.dev20250305133132 pre-release

Mar 5, 2025

0.2.9.dev20250304145955 pre-release

Mar 4, 2025

0.2.8

Mar 3, 2025

0.2.8.dev20250304145404 pre-release

Mar 4, 2025

0.2.8.dev20250304144634 pre-release

Mar 4, 2025

0.2.8.dev20250303194254 pre-release

Mar 3, 2025

0.2.8.dev20250303155656 pre-release

Mar 3, 2025

0.2.8.dev20250303125632 pre-release

Mar 3, 2025

0.2.8.dev20250303125515 pre-release

Mar 3, 2025

0.2.8.dev20250303125207 pre-release

Mar 3, 2025

0.2.8.dev20250303124730 pre-release

Mar 3, 2025

0.2.8.dev20250303124538 pre-release

Mar 3, 2025

0.2.8.dev20250303123112 pre-release

Mar 3, 2025

0.2.7

Mar 2, 2025

0.2.7.dev20250303122654 pre-release

Mar 3, 2025

0.2.6

Mar 2, 2025

0.2.5

Mar 2, 2025

0.2.5.dev20250302224604 pre-release

Mar 2, 2025

0.2.5.dev20250302224148 pre-release

Mar 2, 2025

0.2.4

Mar 2, 2025

0.2.3

Mar 2, 2025

0.2.2

Mar 2, 2025

0.2.1

Mar 2, 2025

0.2.0

Mar 2, 2025

0.1.5

Mar 2, 2025

0.1.4

Mar 2, 2025

0.1.0

Mar 2, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

video_dataset-0.3.0.dev20250316102620.tar.gz (10.9 kB view details)

Uploaded Mar 16, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

video_dataset-0.3.0.dev20250316102620-py3-none-any.whl (12.2 kB view details)

Uploaded Mar 16, 2025 Python 3

File details

Details for the file video_dataset-0.3.0.dev20250316102620.tar.gz.

File metadata

Download URL: video_dataset-0.3.0.dev20250316102620.tar.gz
Upload date: Mar 16, 2025
Size: 10.9 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: poetry/2.1.1 CPython/3.12.9 Linux/6.8.0-1021-azure

File hashes

Hashes for video_dataset-0.3.0.dev20250316102620.tar.gz
Algorithm	Hash digest
SHA256	`7dbee37453be221b3e9dd1a8aee97ae7a1a7f59287926f3f354aa5236567c4dc`
MD5	`54d52461e0864c9c7ff238444a0feb60`
BLAKE2b-256	`a71bc594fa11fa62772a1f1c13fb7d205cd78ac7e00415acd70a9b4310233c78`

See more details on using hashes here.

File details

Details for the file video_dataset-0.3.0.dev20250316102620-py3-none-any.whl.

File metadata

Download URL: video_dataset-0.3.0.dev20250316102620-py3-none-any.whl
Upload date: Mar 16, 2025
Size: 12.2 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: poetry/2.1.1 CPython/3.12.9 Linux/6.8.0-1021-azure

File hashes

Hashes for video_dataset-0.3.0.dev20250316102620-py3-none-any.whl
Algorithm	Hash digest
SHA256	`722f0b51e465f3e097ab6b8dadd35b91cecef8f79d3bdf63b3882aca4dc89f07`
MD5	`0575bf79221ad3a30ada0b5b406b17a2`
BLAKE2b-256	`d67863fc555fa4529fd9487237531634f9e42f2bb515013bca6920128bf511ed`

See more details on using hashes here.

video-dataset 0.3.0.dev20250316102620

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers

Project description

Video Dataset

Installation

Dataset Structures

Video Processors

Raw Video Representation

Frame Level Video Representation

Custom Processor

Annotations Processors

Whole Video Annotations

Frame By Frame Annotations

Segment Level Annotations

Custom Processor

Contributions

Project details

Verified details

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes