Skip to main content

A wrapper for HuggingFace datasets with additional utilities

Project description

🤗 datasets-plus

A wrapper for Hugging Face datasets with extra utilities! 🚀

🌟 Features

  • 🔧 Simplified dataset loading
  • 🔀 Easy splitting and configuration
  • 📁 Support for local and remote datasets
  • 🧰 Additional utility functions

🚀 Installation

Install datasets-plus using pip:

pip install datasets-plus

📚 Usage

Here's a quick example of how to use datasets-plus:

from datasets_plus import load_dataset

# Load validation fold of TriviaQA's unfiltered subset
dataset = load_dataset("mandarjoshi/trivia_qa:unfiltered:validation")

# Print dataset info
print(f"Loaded dataset with {len(dataset)} examples")
print("First example:", dataset[0])

# Load the train fold of the local hf dataset saved at /path/to/dataset
dataset = load_dataset("/path/to/dataset:train")

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

🙏 Acknowledgements

Happy data loading! 🎉

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

datasets_plus-0.1.0.tar.gz (5.0 kB view details)

Uploaded Source

Built Distribution

datasets_plus-0.1.0-py3-none-any.whl (4.6 kB view details)

Uploaded Python 3

File details

Details for the file datasets_plus-0.1.0.tar.gz.

File metadata

  • Download URL: datasets_plus-0.1.0.tar.gz
  • Upload date:
  • Size: 5.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.9.20

File hashes

Hashes for datasets_plus-0.1.0.tar.gz
Algorithm Hash digest
SHA256 af333ae65ea3ee91f3a2affd3acb4d5d449b660e2e28cc86bd2c1acf430c68d1
MD5 9cc331f4545eae3760e9a60a263ca012
BLAKE2b-256 e670940389fae26c6089d83fb92b3d92160c9d50e2e957802bef8bc9a18d26c3

See more details on using hashes here.

File details

Details for the file datasets_plus-0.1.0-py3-none-any.whl.

File metadata

File hashes

Hashes for datasets_plus-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 8fe757e88199d37f1ff2bcae145d7ac4f233290b5c77052ad3f4f3a30674de50
MD5 068d547395a482adc94cc17a6e070a32
BLAKE2b-256 e7dad169f0f58ade845d2e0a84fff65a7a06c7669011d7d0a64d071c75cb1c35

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page