Datasets and models for instruction-tuning
Project description
Datasets and models for instruction-tuning
txtinstruct is a framework for training instruction-tuned models.
The objective of this project is to support open data, open models and integration with your own data. One of the biggest problems today is the lack of licensing clarity with instruction-following datasets and large language models. txtinstruct makes it easy to build your own instruction-following datasets and use those datasets to train instructed-tuned models.
txtinstruct is built with Python 3.7+ and txtai.
Installation
The easiest way to install is via pip and PyPI
pip install txtinstruct
You can also install txtinstruct directly from GitHub. Using a Python Virtual Environment is recommended.
pip install git+https://github.com/neuml/txtinstruct
Python 3.7+ is supported
See this link to help resolve environment-specific install issues.
Examples
The following example notebooks show how to build models with txtinstruct.
Notebook | Description | |
---|---|---|
Introducing txtinstruct | Build instruction-tuned datasets and models |
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for txtinstruct-0.1.0-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | b807aa380e33c9be22f331093f7d7e5d5da137d31ef054d0570574a7b0e2552f |
|
MD5 | b62547b0c14aff700aa6ab91971cc1f4 |
|
BLAKE2b-256 | e17b9eb3e17901d3db33884bea5e1160fe43444e34b16b62eced1c8229ea66cb |