Skip to main content

prepare your dataset for finetuning LLMs

Project description

Dataset Preparation for Transformers Fine-tuning

License

The Dataset Prep Transformers package simplifies the process of preparing datasets for fine-tuning or training various large language models available in the Hugging Face Transformers library. Whether you're using a model from the Hugging Face repository or have your own dataset, this package streamlines the data integration for a seamless training experience.

Features

  • Easily integrate your dataset with Hugging Face Transformers models for training or fine-tuning.
  • Specify the model repository ID and dataset from the Hugging Face library to automatically fetch and configure the data.
  • Seamlessly incorporate your custom dataset by providing it as input to the package.

Installation

You can install the package using pip:

pip install tydataprep

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

tydataprep-0.0.1.0.tar.gz (3.0 kB view details)

Uploaded Source

Built Distribution

tydataprep-0.0.1.0-py3-none-any.whl (4.0 kB view details)

Uploaded Python 3

File details

Details for the file tydataprep-0.0.1.0.tar.gz.

File metadata

  • Download URL: tydataprep-0.0.1.0.tar.gz
  • Upload date:
  • Size: 3.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.11.5

File hashes

Hashes for tydataprep-0.0.1.0.tar.gz
Algorithm Hash digest
SHA256 594d164080730d6ce6314ae7a7f013579718fe2216780a925c0e97d8ac0dfbe0
MD5 2dda1a0d73cee0a141c04874e4bd7f98
BLAKE2b-256 f4a1b2f5b07cc9c11280b328dff684e3435935d55f6ab0b09ca1787239d877bf

See more details on using hashes here.

File details

Details for the file tydataprep-0.0.1.0-py3-none-any.whl.

File metadata

File hashes

Hashes for tydataprep-0.0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 cd7e765b1f0c14234b26e2e3d5fe8a0ed116da65ff947037db7dd2d578889902
MD5 bc2aa8569875fb0a25b78650b794c4c8
BLAKE2b-256 76b5b0701afc2562c3b7a61b49ab792bf68eee60b91a35c617f902a7785848c9

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page