A tool for dumping datasets from the Hugging Face datasets library
Project description
datasets-dump
Dump embedded datasets to audio folder or images folder.
Get the audio folder / image folder back from parquet files.
Usage
datasets-dump someone/dataset ./dist
Python API:
def dump(
dataset: Union[str, Dataset, DatasetDict],
dist: str | Path,
audio_column: Optional[str] = None,
image_column: Optional[str] = None,
metadata_format: Literal["jsonl", "csv"] = "jsonl",
) -> None
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
datasets_dump-0.1.3.tar.gz
(5.4 kB
view details)
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file datasets_dump-0.1.3.tar.gz.
File metadata
- Download URL: datasets_dump-0.1.3.tar.gz
- Upload date:
- Size: 5.4 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.10.15
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
744dd745405c3fe670fae67b4b45f56da85165f92ddeacd13752f5761568efd4
|
|
| MD5 |
bcd0a533af8069613acb20e060bb22b6
|
|
| BLAKE2b-256 |
6fe488acbd9e99f0e1edcaeb94f955d98e56b69a279a72998304075cbff5db0a
|
File details
Details for the file datasets_dump-0.1.3-py3-none-any.whl.
File metadata
- Download URL: datasets_dump-0.1.3-py3-none-any.whl
- Upload date:
- Size: 5.4 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.10.15
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
052d0ff5dbd350673106f87ae11ddb5761238df7e69afb920ff3496ef9a38f7c
|
|
| MD5 |
0e624f3c875f1a6f8bda8d561a2c28b3
|
|
| BLAKE2b-256 |
5d4e2d9eb43b37d96b0759d1cd3f4767cfd51098853a932599e3bf9e2e151680
|