Skip to main content

A utility to pack, unpack and inspect HDF5 audio datasets

Project description

h5pack

h5pack is a utility made to pack, analyze and unpack HDF5 audio datasets using data and annotations of various sources. HDF5 is an open-source file format for storing large, complex data. It uses a directory-like structure to organize data within the file. Click here to learn more about HDF5. Among other, HDF5 provides:

  • Efficient storage: Handles large datasets with chunking and compression, reducing disk space usage.
  • Fast I/O: Optimized for rapid reading/writing of multi-dimensional data, improving data loading speeds.
  • Hierarhical structures: Organizes metadata, features, and audio data efficiently within a single file.
  • Concurrent access: Supports concurrent access, enabling efficient multi-threaded data loading.
  • Scalability: Handles massive datasets without performance degradation, making it ideal for large-scale machine learning tasks.
  • Cross-platform: Works across different OS and programming languages, ensuring flexibility.

h5pack was made to go from raw files to HDF5 files and back in a robust, consistent and simple way. It provides a collection of tools to facilitate all the necessary tasks to make it possible:

  • h5pack pack: Converts raw data and/or annotation files into an HDF5 file.
  • h5pack unpack: Extracts raw data from an HDF5 file, allowing regeneration of the original input data.
  • h5pack virtual: Creates a virtual dataset by combining multiple datasets into a single logical dataset without duplication, enabling seamless access to fragmented or distributed data.
  • h5pack info: Displays the contents of an HDF5 file generated with h5pack, providing a quick overview of its structure.
  • h5pack checksum: Verifies the integrity of an HDF5 file by checking its checksum to detect potential corruption.

Table of contents

Installation

See how to install h5pack here.

Quickstart

Explore the Quickstart Guide to start using h5pack right away. It offers step-by-step instructions to get you set up quickly and effortlessly.

Documentation

Access the Documentation to explore all the tools included in this package.

Cite

h5pack is free and open source. If this package contributed to your work, please consider citing it:

@misc{h5pack,
  author = {Esteban Gómez},
  title  = {h5pack},
  year   = 2025,
  url    = {https://github.com/eagomez2/h5pack}
}

This package was developed by Esteban Gómez, member of the Speech Interaction Technology group from Aalto University.

License

For further details about the license of this package, please see LICENSE.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

h5pack-1.1.0.tar.gz (31.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

h5pack-1.1.0-py3-none-any.whl (38.7 kB view details)

Uploaded Python 3

File details

Details for the file h5pack-1.1.0.tar.gz.

File metadata

  • Download URL: h5pack-1.1.0.tar.gz
  • Upload date:
  • Size: 31.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.9.5

File hashes

Hashes for h5pack-1.1.0.tar.gz
Algorithm Hash digest
SHA256 7901e6ee0c3a7e7e35fdd4f350c413f4e20215f220ead6c69aa19c0decf5b638
MD5 557d889fcea3bc30f30da7db2f95bbcd
BLAKE2b-256 f6c7d2e48302eb34013e5309260762c4f524f483991d337d7e40228d5dae96d3

See more details on using hashes here.

File details

Details for the file h5pack-1.1.0-py3-none-any.whl.

File metadata

  • Download URL: h5pack-1.1.0-py3-none-any.whl
  • Upload date:
  • Size: 38.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.9.5

File hashes

Hashes for h5pack-1.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 938fdcfb691d756ab634b7213f326e74ec16bde49ec74827fa4d4c84afe749e6
MD5 b80dc2a79a8921b35385a1dbdffeaf15
BLAKE2b-256 1a802cc4ca9d30732d0a3f9f4f66da322b324a43d6e81a2f8a3756d1c87148bb

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page