Skip to main content

A tool for dumping datasets from the Hugging Face datasets library

Project description

datasets-dump

Dump embedded datasets to audio folder or images folder.

Get the audio folder / image folder back from parquet files.

Usage

datasets-dump someone/dataset ./dist

Python API:

def dump(
    dataset: Union[str, Dataset],
    dist: str | Path,
    audio_column: Optional[str] = None,
    image_column: Optional[str] = None,
    metadata_format: Literal["jsonl", "csv"] = "jsonl",
) -> None

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

datasets_dump-0.1.1.tar.gz (5.2 kB view details)

Uploaded Source

Built Distribution

datasets_dump-0.1.1-py3-none-any.whl (5.1 kB view details)

Uploaded Python 3

File details

Details for the file datasets_dump-0.1.1.tar.gz.

File metadata

  • Download URL: datasets_dump-0.1.1.tar.gz
  • Upload date:
  • Size: 5.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.10.15

File hashes

Hashes for datasets_dump-0.1.1.tar.gz
Algorithm Hash digest
SHA256 c0a2710f9b3b58f5d5c6c647cf56942471f36d6cbecbd46b6731131e451bbbc9
MD5 e07d04b0bc4d00be9a077f9243c6b9fe
BLAKE2b-256 414a8a5129a8b7cf95a56aa66bb17b42317e8ac0b81e61f5260907a0ffc0c04e

See more details on using hashes here.

File details

Details for the file datasets_dump-0.1.1-py3-none-any.whl.

File metadata

File hashes

Hashes for datasets_dump-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 3d055857723bd201824f840d7e0fdb4bd7b1b1c527d27e29ab2be76224f77cc7
MD5 b2aed4128dc230ec509d6e858781dcb3
BLAKE2b-256 09ecee11f723c589f57643568ad1b17468a59a2eda7bd5298195e5db5aa8efc1

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page