Skip to main content

A tool for dumping datasets from the Hugging Face datasets library

Project description

datasets-dump

Dump embedded datasets to audio folder or images folder.

Get the audio folder / image folder back from parquet files.

usage

Usage

datasets-dump someone/dataset ./dist

Python API:

def dump(
    dataset: Union[str, Dataset, DatasetDict],
    dist: str | Path,
    audio_column: Optional[str] = None,
    image_column: Optional[str] = None,
    metadata_format: Literal["jsonl", "csv"] = "jsonl",
) -> None

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

datasets_dump-0.1.2.tar.gz (5.4 kB view details)

Uploaded Source

Built Distribution

datasets_dump-0.1.2-py3-none-any.whl (5.4 kB view details)

Uploaded Python 3

File details

Details for the file datasets_dump-0.1.2.tar.gz.

File metadata

  • Download URL: datasets_dump-0.1.2.tar.gz
  • Upload date:
  • Size: 5.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.10.15

File hashes

Hashes for datasets_dump-0.1.2.tar.gz
Algorithm Hash digest
SHA256 cdf1bbb9cfcd46929fdb9c5948045a65475431736ace9ce94644e9ead4876b78
MD5 1172304f94a2e843bfed01ab7feafb70
BLAKE2b-256 62240711057aede69469d33e891b397f5ee9ba7863a38592d8e8d197312f0ab3

See more details on using hashes here.

File details

Details for the file datasets_dump-0.1.2-py3-none-any.whl.

File metadata

File hashes

Hashes for datasets_dump-0.1.2-py3-none-any.whl
Algorithm Hash digest
SHA256 dfa7131b6e76c1b154d43f164a6e5a49599416697df17043f4e3b9487769b859
MD5 7fe669accb57b780f8e331bef86bec3d
BLAKE2b-256 0474d209dc67ec2fca193723fbe3422c8e2e764fddb12b99041a10f1df89d2fa

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page