Skip to main content

A tool for dumping datasets from the Hugging Face datasets library

Project description

datasets-dump

Dump embedded datasets to audio folder or images folder.

Get the audio folder / image folder back from parquet files.

Usage

datasets-dump someone/dataset ./dist

Python API:

def dump(
    dataset: Union[str, Dataset],
    dist: str | Path,
    audio_column: Optional[str] = None,
    image_column: Optional[str] = None,
    metadata_format: Literal["jsonl", "csv"] = "jsonl",
) -> None

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

datasets_dump-0.1.0.tar.gz (4.9 kB view details)

Uploaded Source

Built Distribution

datasets_dump-0.1.0-py3-none-any.whl (4.9 kB view details)

Uploaded Python 3

File details

Details for the file datasets_dump-0.1.0.tar.gz.

File metadata

  • Download URL: datasets_dump-0.1.0.tar.gz
  • Upload date:
  • Size: 4.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.10.15

File hashes

Hashes for datasets_dump-0.1.0.tar.gz
Algorithm Hash digest
SHA256 2041c0ff3759b88b51af37807ac7b59ac21ca15390c7c320108ffd35229d45e6
MD5 8409a884625b31435cbc376f4a28b222
BLAKE2b-256 f07d1b903657023707b850a5758a724647d23c242f2ff22bf20c830881124d1f

See more details on using hashes here.

File details

Details for the file datasets_dump-0.1.0-py3-none-any.whl.

File metadata

File hashes

Hashes for datasets_dump-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 31c3592a02aed94eea6834e6184e2c304842e0d279cfd6172c55efeb83906da8
MD5 8411938637f9d976bcd5199195326c0b
BLAKE2b-256 84d32a1f3686838204843edfd5a32706aa26a5a6f4e2c5ae70a6450bddec4d80

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page