Project description

Atom

Atom is a finetuned LLAMA to create better LLMS through Pytorch Data!

Installation

You can install the package using pip

git clone https://github.com/jquesnelle/yarn
cd Atom
pip install -e .

Training

To train the models, run accelerate config and enable DeepSpeed acceleration. deepspeed/zero3.json was the configuration file used for training.

# ./train.sh

The tokenized training data is available on Hugging Face and was derived from the pg19 dataset.

Evaluation

To reproduce the evaluations, install lm-evaluation-harness with pip install git+https://github.com/EleutherAI/lm-evaluation-harness and then run the two provided scripts.

# ./eval.sh
# ./eval-harness.sh

Citation

@misc{peng2023yarn,
      title={YaRN: Efficient Context Window Extension of Large Language Models}, 
      author={Bowen Peng and Jeffrey Quesnelle and Honglu Fan and Enrico Shippole},
      year={2023},
      eprint={2309.00071},
      archivePrefix={arXiv},
      primaryClass={cs.CL}
}

Project details

These details have not been verified by PyPI

Project links

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

This version

0.0.2

Sep 8, 2023

0.0.1

Sep 8, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

atom_torch-0.0.2.tar.gz (32.2 kB view hashes)

Uploaded Sep 8, 2023 Source

Built Distribution

atom_torch-0.0.2-py3-none-any.whl (33.6 kB view hashes)

Uploaded Sep 8, 2023 Python 3

Hashes for atom_torch-0.0.2.tar.gz

Hashes for atom_torch-0.0.2.tar.gz
Algorithm	Hash digest
SHA256	`6483a3a259491dbd88348fc14a657adf7c53e4f387bd3ecb7de4d21192a0ab76`
MD5	`ab8e589b013234a4b621ac0aefa20832`
BLAKE2b-256	`c31fa948e5d9423c4b929246d79e49e91cc29132ca714a2cff2baeda30be09d6`

Hashes for atom_torch-0.0.2-py3-none-any.whl

Hashes for atom_torch-0.0.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`2ebe961645855f9a97a204bc4b0bf3ccdb1a3a643a37ec39edb4f90702c79800`
MD5	`a5223b9049819cd45ba89e1ea5ced8f4`
BLAKE2b-256	`80f46c4c5f1a17aa98a8040e7b40bf27b944c75eab06d3650f7113b3692607c1`