A tool for dumping datasets from the Hugging Face datasets library
Project description
datasets-dump
Dump embedded datasets to audio folder or images folder.
Get the audio folder / image folder back from parquet files.
Usage
datasets-dump someone/dataset ./dist
Python API:
def dump(
dataset: Union[str, Dataset, DatasetDict],
dist: str | Path,
audio_column: Optional[str] = None,
image_column: Optional[str] = None,
metadata_format: Literal["jsonl", "csv"] = "jsonl",
) -> None
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
datasets_dump-0.1.4.tar.gz
(5.4 kB
view details)
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file datasets_dump-0.1.4.tar.gz.
File metadata
- Download URL: datasets_dump-0.1.4.tar.gz
- Upload date:
- Size: 5.4 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.10.15
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
804657524a28869dc5f20e3b27272face5d26169c09862eac0f608c23bdf2ac2
|
|
| MD5 |
189c460f3351b071c243fe199054fc1e
|
|
| BLAKE2b-256 |
06c8ead077efcfcb4db3da6d2212ccb9ecafd4d98790fb2529b0a9f34cd2ced9
|
File details
Details for the file datasets_dump-0.1.4-py3-none-any.whl.
File metadata
- Download URL: datasets_dump-0.1.4-py3-none-any.whl
- Upload date:
- Size: 5.4 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.10.15
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
7f1d1642e162ccdfdfc0c50d2dfee9d346fed3f1db45dc6cf40d3af6fe2bb710
|
|
| MD5 |
2662636dc8a35c6c126a95ea80ee7e9a
|
|
| BLAKE2b-256 |
afa9efb25925b99a50a8d1752fefe65d3f846ebc1d2ef996b43c9092a014406f
|