Skip to main content

Load any mixture of text to text data in one line of code

Project description

Image Description

Unitxt is a python library for getting data fired up and set for utilization. In one line of code, it preps a dataset or mixtures-of-datasets into an input-output format for training and evaluation. We aspire to be simple, adaptable and transperant.

Unitxt builds on separation. Separation allows adding a dataset, without knowing anything about the models using it. Separation allows training without caring for preprocessing, switching models without loading the data differently and changing formats (instruction\ICL\etc.) without changing anything else.

version license python tests codecov Read the Docs downloads

Unitxt Flow

Where to start? 🦄

Button Button Button Button Button

Why Unitxt? 🦄

🦄 Simplicity

Everything is unitxt is simple and designed to feel natural and self explenatory.

🦄 Adaptability

Adding new datasets, loading recpepies, instructions and formattors is possible and encoureged!

🦄 Transperancy

The reosurces and formators of Unitxt are stored as shared datasets and therfore can easily reviewed by the crowed. Moreover, when assembling dataset with Unitxt it is very clear to others whats in it.

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

unitxt-1.0.10.tar.gz (31.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

unitxt-1.0.10-py3-none-any.whl (39.9 kB view details)

Uploaded Python 3

File details

Details for the file unitxt-1.0.10.tar.gz.

File metadata

  • Download URL: unitxt-1.0.10.tar.gz
  • Upload date:
  • Size: 31.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.9.17

File hashes

Hashes for unitxt-1.0.10.tar.gz
Algorithm Hash digest
SHA256 0d2f5f1858640a12347c3d3cf16186d52aa5d84facd972e0281d1044947c716e
MD5 a64a1ce10d5dd9a143f15268d8084d7a
BLAKE2b-256 cec3d13d529a093821f4f9aac0aacfbda93a31a3c0fb5e4e764434826ded2674

See more details on using hashes here.

File details

Details for the file unitxt-1.0.10-py3-none-any.whl.

File metadata

  • Download URL: unitxt-1.0.10-py3-none-any.whl
  • Upload date:
  • Size: 39.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.9.17

File hashes

Hashes for unitxt-1.0.10-py3-none-any.whl
Algorithm Hash digest
SHA256 8b2ee15bf5e07c2845ddab0853129603e7f7d8ef72a9678d973051a7c596c4b8
MD5 0dea01915b48d0826de688e70d595b2e
BLAKE2b-256 469b479b42629f534961d580476c9e0584b9d7f3847b874edffaeacd177f9751

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page