LLM Foundry

These details have not been verified by PyPI

Project links

Homepage

Project description

LLM Foundry

This repository contains code for training, finetuning, evaluating, and deploying LLMs for inference with Composer and the MosaicML platform. Designed to be easy-to-use, efficient and flexible, this codebase is designed to enable rapid experimentation with the latest techniques.

You'll find in this repo:

llmfoundry/ - source code for models, datasets, callbacks, utilities, etc.
scripts/ - scripts to run LLM workloads
- data_prep/ - convert text data from original sources to StreamingDataset format
- train/ - train or finetune HuggingFace and MPT models from 125M - 70B parameters
  - train/benchmarking - profile training throughput and MFU
- inference/ - convert models to HuggingFace or ONNX format, and generate responses
  - inference/benchmarking - profile inference latency and throughput
- eval/ - evaluate LLMs on academic (or custom) in-context-learning tasks
mcli/ - launch any of these workloads using MCLI and the MosaicML platform

MPT

MPT-7B is a GPT-style model, and the first in the MosaicML Foundation Series of models. Trained on 1T tokens of a MosaicML-curated dataset, MPT-7B is open-source, commercially usable, and equivalent to LLaMa 7B on evaluation metrics. The MPT architecture contains all the latest techniques on LLM modeling -- Flash Attention for efficiency, Alibi for context length extrapolation, and stability improvements to mitigate loss spikes. The base model and several variants, including a 64K context length fine-tuned model (!!) are all available:

Model	Context Length	Download	Demo	Commercial use?
MPT-7B	2048	https://huggingface.co/mosaicml/mpt-7b		Yes
MPT-7B-Instruct	2048	https://huggingface.co/mosaicml/mpt-7b-instruct	Demo	Yes
MPT-7B-Chat	2048	https://huggingface.co/mosaicml/mpt-7b-chat	Demo	No
MPT-7B-StoryWriter	65536	https://huggingface.co/mosaicml/mpt-7b-storywriter		Yes

To try out these models locally, follow the instructions in scripts/inference/README.md to prompt HF models using our hf_generate.py or hf_chat.py scripts.

Latest News

Prerequisites

Here's what you need to get started with our LLM stack:

Use a Docker image with PyTorch 1.13+, e.g. MosaicML's PyTorch base image
- Recommended tag: mosaicml/pytorch:1.13.1_cu117-python3.10-ubuntu20.04
- This image comes pre-configured with the following dependencies:
  - PyTorch Version: 1.13.1
  - CUDA Version: 11.7
  - Python Version: 3.10
  - Ubuntu Version: 20.04
  - FlashAttention kernels from HazyResearch
Use a system with NVIDIA GPUs

Installation

To get started, clone this repo and install the requirements:

git clone https://github.com/mosaicml/llm-foundry.git
cd llm-foundry

# Optional: we highly recommend creating and using a virtual environment
python -m venv llmfoundry-venv
source llmfoundry-venv/bin/activate

pip install -e ".[gpu]"  # or pip install -e . if no NVIDIA GPU

Quickstart

Here is an end-to-end workflow for preparing a subset of the C4 dataset, training an MPT-125M model for 10 batches, converting the model to HuggingFace format, evaluating the model on the Winograd challenge, and generating responses to prompts.

If you have a write-enabled HuggingFace auth token, you can optionally upload your model to the Hub! Just export your token like this:

export HUGGING_FACE_HUB_TOKEN=your-auth-token

and uncomment the line containing --hf_repo_for_upload ....

(Remember this is a quickstart just to demonstrate the tools -- To get good quality, the LLM must be trained for longer than 10 batches 😄)

cd scripts

# Convert C4 dataset to StreamingDataset format
python data_prep/convert_dataset_hf.py \
  --dataset c4 --data_subset en \
  --out_root my-copy-c4 --splits train_small val_small \
  --concat_tokens 2048 --tokenizer EleutherAI/gpt-neox-20b --eos_text '<|endoftext|>'

# Train an MPT-125m model for 10 batches
composer train/train.py \
  train/yamls/mpt/125m.yaml \
  data_local=my-copy-c4 \
  train_loader.dataset.split=train_small \
  eval_loader.dataset.split=val_small \
  max_duration=10ba \
  eval_interval=0 \
  save_folder=mpt-125m

# Convert the model to HuggingFace format
python inference/convert_composer_to_hf.py \
  --composer_path mpt-125m/ep0-ba10-rank0.pt \
  --hf_output_path mpt-125m-hf \
  --output_precision bf16 \
  # --hf_repo_for_upload user-org/repo-name

# Evaluate the model on Winograd
python eval/eval.py \
  eval/yamls/hf_eval.yaml \
  icl_tasks=eval/yamls/winograd.yaml \
  model_name_or_path=mpt-125m-hf

# Generate responses to prompts
python inference/hf_generate.py \
  --name_or_path mpt-125m-hf \
  --max_new_tokens 256 \
  --prompts \
    "The answer to life, the universe, and happiness is" \
    "Here's a quick recipe for baking chocolate chip cookies: Start by"

Contact Us

If you run into any problems with the code, please file Github issues directly to this repo.

If you want to train LLMs on the MosaicML platform, reach out to us at demo@mosaicml.com!

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

0.21.0

May 31, 2025

0.20.0

Apr 29, 2025

0.19.0

Apr 7, 2025

0.18.0

Mar 18, 2025

0.17.1

Feb 21, 2025

0.17.0

Jan 31, 2025

0.16.0

Jan 17, 2025

0.15.1

Dec 5, 2024

0.15.0

Nov 23, 2024

0.14.5

Nov 18, 2024

0.14.4

Nov 7, 2024

0.14.3

Nov 5, 2024

0.14.2

Nov 4, 2024

0.14.1

Nov 2, 2024

0.14.0

Oct 28, 2024

0.13.1

Oct 18, 2024

0.13.0

Oct 15, 2024

0.12.0

Sep 26, 2024

0.11.0

Aug 13, 2024

0.10.0

Jul 3, 2024

0.10.0.dev0 pre-release yanked

Jul 2, 2024

Reason this release was yanked:

Mistaken version

0.9.1

Jun 24, 2024

0.9.0

Jun 8, 2024

0.9.0.dev0 pre-release yanked

May 8, 2024

Reason this release was yanked:

Mistaken version

0.8.0

May 8, 2024

0.7.0

Mar 27, 2024

0.6.0

Mar 14, 2024

0.5.0

Feb 8, 2024

0.4.0

Nov 22, 2023

0.3.0

Sep 27, 2023

0.2.0

Jul 4, 2023

This version

0.1.0

May 10, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

llm-foundry-0.1.0.tar.gz (106.3 kB view details)

Uploaded May 10, 2023 Source

Built Distribution

llm_foundry-0.1.0-py3-none-any.whl (106.0 kB view details)

Uploaded May 10, 2023 Python 3

File details

Details for the file llm-foundry-0.1.0.tar.gz.

File metadata

Download URL: llm-foundry-0.1.0.tar.gz
Upload date: May 10, 2023
Size: 106.3 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.1 CPython/3.11.3

File hashes

Hashes for llm-foundry-0.1.0.tar.gz
Algorithm	Hash digest
SHA256	`7ff9e195d7a8a4d609a87272dda48eca1a952397240808b26cc3f8f7a8dba090`
MD5	`7d80b32343332a691e58e8de61739351`
BLAKE2b-256	`3aae57c3346bd35154ba2f506fc13cb07b38cefc4f5511cb34705c1c34c568f1`

See more details on using hashes here.

File details

Details for the file llm_foundry-0.1.0-py3-none-any.whl.

File metadata

Download URL: llm_foundry-0.1.0-py3-none-any.whl
Upload date: May 10, 2023
Size: 106.0 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.1 CPython/3.11.3

File hashes

Hashes for llm_foundry-0.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`c94955cbdb02dc3df25d862a35fd07f12fc7be8107a30d9ee87c88e43a1ef787`
MD5	`71ba2ea5f9f12cf5eaf7af00d5b82faf`
BLAKE2b-256	`d732b1afffb2d76c0be72fcda3184050bc4ebc32695a645ef95bdb35103007ad`

See more details on using hashes here.

llm-foundry 0.1.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

LLM Foundry

MPT

Latest News

Prerequisites

Installation

Quickstart

Contact Us

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes