Skip to main content

felafax

Project description

Felafax -- tune LLaMa3.1 on Google Cloud TPUs for 30% lower cost and scale seamlessly!

image

RoadRunnerX is a framework for continued-training and fine-tuning open source LLMs using XLA runtime. We take care of neceessary runtime setup and provide a Jupyter notebook out-of-box to just get started.

  • Easy to use.
  • Easy to configure all aspects of training (designed for ML researchers and hackers).
  • Easy to scale training from a single VM with 8 TPU cores to entire TPU Pod containing 6000 TPU cores (1000X)!

Goal

Our goal at felafax is to build infra to make it easier to run AI workloads on non-NVIDIA hardware (TPU, AWS Trainium, AMD GPU, and Intel GPU).

Currently supported models

  • LLaMa-3/3.1 8B, 70B on Google Cloud TPUs.
    • Supports LoRA and full-precision training.
    • Tested on TPU v3, v5p.
  • LLaMa-3.1 405B will be available on our cloud platform at felafax.ai -- sign-up for the waitlist!
  • Gemma2 2B, 9B, 27B on Cloud TPUs. $${\color{red}New!}$$
    • Supports fast full-precision training.
    • Tested on TPU v3, v5p.

Setup

For a hosted version with a seamless workflow, please visit app.felafax.ai 🦊.

If you prefer a self-hosted training version, follow the instructions below. These steps will guide you through launching a TPU VM on your Google Cloud account and starting a Jupyter notebook. With just 3 simple steps, you'll be up and running in under 10 minutes. 🚀

  1. Install gcloud command-line tool and authenticate your account (SKIP this STEP if you already have gcloud installed and have used TPUs before! 😎)

     # Download gcloud CLI
     curl https://sdk.cloud.google.com | bash
     source ~/.bashrc
    
     # Authenticate gcloud CLI
     gcloud auth login
    
     # Create a new project for now
     gcloud projects create LLaMa3-tunerX --set-as-default
    
     # Config SSH and add
     gcloud compute config-ssh --quiet
    
     # Set up default credentials
     gcloud auth application-default login
    
     # Enable Cloud TPU API access
     gcloud services enable compute.googleapis.com tpu.googleapis.com storage-component.googleapis.com aiplatform.googleapis.com
    
  2. Spin up a TPU v5-8 VM 🤠.

    sh ./launch_tuner.sh
    

    Keep an eye on the terminal -- you might be asked to input SSH key password and need to put in your HuggingFace token.

  3. Open the Jupyter notebook at https://localhost:888 and start fine-tuning!

Credits:

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

felafax-1.0.1.tar.gz (26.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

felafax-1.0.1-py3-none-any.whl (31.3 kB view details)

Uploaded Python 3

File details

Details for the file felafax-1.0.1.tar.gz.

File metadata

  • Download URL: felafax-1.0.1.tar.gz
  • Upload date:
  • Size: 26.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.8.3 CPython/3.10.9 Darwin/22.5.0

File hashes

Hashes for felafax-1.0.1.tar.gz
Algorithm Hash digest
SHA256 80b92a53e012d56f202732a7cdc339c1d14bd27eb1953fc755dbddd7e6517e5a
MD5 b39b261990a270ed1ee2eac4d3c7d782
BLAKE2b-256 9990d0902658d90081823a11ea3907b4b6f08f7e2ca30e6caee2f5d3a09ffe44

See more details on using hashes here.

File details

Details for the file felafax-1.0.1-py3-none-any.whl.

File metadata

  • Download URL: felafax-1.0.1-py3-none-any.whl
  • Upload date:
  • Size: 31.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.8.3 CPython/3.10.9 Darwin/22.5.0

File hashes

Hashes for felafax-1.0.1-py3-none-any.whl
Algorithm Hash digest
SHA256 17cda5a2d7d1c7568024072a050542ce3c0c064264ae10a4963be8accaa1ead5
MD5 3445759d655af7fd00c6514b5fbea36b
BLAKE2b-256 d0c59e9b99f6f9805098d50b19dead2db7ea76b325e9b289db59f9e9caa1895e

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page