Skip to main content

Load any mixture of text to text data in one line of code

Project description

Image Description

Unitxt is a python library for getting data fired up and set for utilization. In one line of code, it preps a dataset or mixtures-of-datasets into an input-output format for training and evaluation. We aspire to be simple, adaptable and transperant.

Unitxt builds on separation. Separation allows adding a dataset, without knowing anything about the models using it. Separation allows training without caring for preprocessing, switching models without loading the data differently and changing formats (instruction\ICL\etc.) without changing anything else.

version license python tests codecov Read the Docs downloads

Unitxt Flow

Where to start? 🦄

Button Button Button Button Button

Why Unitxt? 🦄

🦄 Simplicity

Everything is unitxt is simple and designed to feel natural and self explenatory.

🦄 Adaptability

Adding new datasets, loading recpepies, instructions and formattors is possible and encoureged!

🦄 Transperancy

The reosurces and formators of Unitxt are stored as shared datasets and therfore can easily reviewed by the crowed. Moreover, when assembling dataset with Unitxt it is very clear to others whats in it.

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

unitxt-1.0.20.tar.gz (38.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

unitxt-1.0.20-py3-none-any.whl (47.7 kB view details)

Uploaded Python 3

File details

Details for the file unitxt-1.0.20.tar.gz.

File metadata

  • Download URL: unitxt-1.0.20.tar.gz
  • Upload date:
  • Size: 38.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.9.17

File hashes

Hashes for unitxt-1.0.20.tar.gz
Algorithm Hash digest
SHA256 cf940e8839542246951e0567f64d5f92ec87191e5a31a02a6138913e52a9c22c
MD5 c95af188277232e6c523694bb0f58af1
BLAKE2b-256 50a3a7bb75377e16d1f94c7dcabfb9b31744a05a52b12aada20576e76ab85e5e

See more details on using hashes here.

File details

Details for the file unitxt-1.0.20-py3-none-any.whl.

File metadata

  • Download URL: unitxt-1.0.20-py3-none-any.whl
  • Upload date:
  • Size: 47.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.9.17

File hashes

Hashes for unitxt-1.0.20-py3-none-any.whl
Algorithm Hash digest
SHA256 6e07898f2ddcc8b89946cf848dbb3c75797ebd0f2724aee0f65d2eab2004ade4
MD5 7a5fef39e869ed65ca4dc1268b1297bb
BLAKE2b-256 fca311450554e4ff01ac93a6eb342d4e57c6795051e1450c96eebf19e4b590ab

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page