LLM post-training library
Project description
Kithara - Easy Finetuning on TPUs
👋 Overview
Kithara is a lightweight library offering building blocks and recipes for tuning popular open source LLMs including Gemma2 and Llama3 on Google TPUs.
It provides:
- Frictionless scaling: Distributed training abstractions intentionally built with simplicity in mind.
- Multihost training support: Integration with Ray, GCE and GKE.
- Async, distributed checkpointing: Multi-host & Multi-device checkpointing via Orbax.
- Distributed, streamed dataloading: Per-process, streamed data loading via Ray.data.
- GPU/TPU fungibility: Same code works for both GPU and TPU out of the box.
- Native integration with HuggingFace: Tune and save models in HuggingFace format.
New to TPUs?
Using TPUs provides significant advantages in terms of performance, cost-effectiveness, and scalability, enabling faster training times and the ability to work with larger models and datasets. Check out our onboarding guide to getting TPUs.
🔗 Key links and resources
| 📚 Documentation | Read Our Docs |
| 💾 Installation | Quick Pip Install |
| ✏️ Get Started | Intro to Kithara |
| 🌟 Supported Models | List of Models |
| 🌐 Supported Datasets | List of Data Formats |
| ⌛️ Performance Optimizations | Our Memory and Throughput Optimizations |
| 📈 Scaling up | Guide for Tuning Large Models |
🌵 Examples
-
Quick Start Colab Notebook: SFT + LoRA with Gemma2-2b
-
SFT + LoRA: Step by Step Example
-
Continued Pretraining: Step by Step Example
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file kithara-0.0.10.tar.gz.
File metadata
- Download URL: kithara-0.0.10.tar.gz
- Upload date:
- Size: 8.4 MB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.11.11
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
0a82c91d086994b85598c9933fe31ccc4243a1ecdb6bd61bc1a6e9db8f21d1ec
|
|
| MD5 |
00f31b70966768e7167de430f914896a
|
|
| BLAKE2b-256 |
2508788b68de8acc9c504ed0a2681bb88e4e5e38d71740a9af416b6a1036b0dc
|
File details
Details for the file kithara-0.0.10-py3-none-any.whl.
File metadata
- Download URL: kithara-0.0.10-py3-none-any.whl
- Upload date:
- Size: 8.9 MB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.11.11
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
b66fccf73853017e0dfcfe6c53a3318f2976669eed1e81ba9be34af1c3f869c2
|
|
| MD5 |
669ddd0a09cfb9ac4882f50b9cd5dca6
|
|
| BLAKE2b-256 |
1edf77f4c3dbda0b4cc48a7220a361c2584b2255827be410e39037c1cd6a36af
|