Skip to main content

A tool for dumping datasets from the Hugging Face datasets library

Project description

datasets-dump

Dump embedded datasets to audio folder or images folder.

Get the audio folder / image folder back from parquet files.

usage

Usage

datasets-dump someone/dataset ./dist

Python API:

def dump(
    dataset: Union[str, Dataset, DatasetDict],
    dist: str | Path,
    audio_column: Optional[str] = None,
    image_column: Optional[str] = None,
    metadata_format: Literal["jsonl", "csv"] = "jsonl",
) -> None

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

datasets_dump-0.1.3.tar.gz (5.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

datasets_dump-0.1.3-py3-none-any.whl (5.4 kB view details)

Uploaded Python 3

File details

Details for the file datasets_dump-0.1.3.tar.gz.

File metadata

  • Download URL: datasets_dump-0.1.3.tar.gz
  • Upload date:
  • Size: 5.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.10.15

File hashes

Hashes for datasets_dump-0.1.3.tar.gz
Algorithm Hash digest
SHA256 744dd745405c3fe670fae67b4b45f56da85165f92ddeacd13752f5761568efd4
MD5 bcd0a533af8069613acb20e060bb22b6
BLAKE2b-256 6fe488acbd9e99f0e1edcaeb94f955d98e56b69a279a72998304075cbff5db0a

See more details on using hashes here.

File details

Details for the file datasets_dump-0.1.3-py3-none-any.whl.

File metadata

  • Download URL: datasets_dump-0.1.3-py3-none-any.whl
  • Upload date:
  • Size: 5.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.10.15

File hashes

Hashes for datasets_dump-0.1.3-py3-none-any.whl
Algorithm Hash digest
SHA256 052d0ff5dbd350673106f87ae11ddb5761238df7e69afb920ff3496ef9a38f7c
MD5 0e624f3c875f1a6f8bda8d561a2c28b3
BLAKE2b-256 5d4e2d9eb43b37d96b0759d1cd3f4767cfd51098853a932599e3bf9e2e151680

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page