Skip to main content

A wrapper for HuggingFace datasets with additional utilities

Project description

🤗 datasets-plus

A wrapper for Hugging Face datasets with extra utilities! 🚀

🌟 Features

  • 🔧 Simplified dataset loading
  • 🔀 Easy splitting and configuration
  • 📁 Support for local and remote datasets
  • 🧰 Additional utility functions

🚀 Installation

Install datasets-plus using pip:

pip install datasets-plus

📚 Usage

Here's a quick example of how to use datasets-plus:

from datasets_plus import load_dataset

# Load validation fold of TriviaQA's unfiltered subset
dataset = load_dataset("mandarjoshi/trivia_qa:unfiltered:validation")

# Print dataset info
print(f"Loaded dataset with {len(dataset)} examples")
print("First example:", dataset[0])

# Load the train fold of the local hf dataset saved at /path/to/dataset
dataset = load_dataset("/path/to/dataset:train")

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

🙏 Acknowledgements

Happy data loading! 🎉

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

datasets_plus-0.1.0.tar.gz (5.0 kB view hashes)

Uploaded Source

Built Distribution

datasets_plus-0.1.0-py3-none-any.whl (4.6 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page