Skip to main content

A dead simple Keras HDF5 ImageDataGenerator

Project description

Keras HDF5 ImageDataGenerator

A dead simple Keras HDF5 ImageDataGenerator.

Overview

Sometimes you'd like to work with large scale image datasets that cannot fit into the memory. Keras provides data generators to feed your network with mini-batch of data directly from a directory, simply by passing the image paths. But this method is terribly inefficient because during training, the model has to deal with massive I/Os operations on disk which introduces huge latency.

A more efficient way is to take advantage of HDF5 data structure which is optimized for I/O operations. The idea is to (1) store your raw images (and their labels) to an HDF5 file, and to (2) create a generator that will load and preprocess mini-batches in real-time.

Installation / Usage

To install use pip:

$ pip install hdf5generator

Or clone the repo:

$ git clone https://github.com/angulartist/hdf5generator.git
$ python setup.py install

Contributing

michel @angulartist

Example

Please take a look the to generator.py file to see very basic examples.

TODO LIST

  • Generator
  • [] Docs
  • []
  • []

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

h5imagegenerator-1.1.0.tar.gz (4.7 kB view hashes)

Uploaded Source

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page