Skip to main content

TRL Jobs.

Project description

TRL Jobs

A convenient wrapper around hfjobs for running TRL (Transformer Reinforcement Learning) specific workflows on Hugging Face infrastructure.

Installation

pip install trl-jobs

Available Commands

SFT (Supervised Fine-Tuning)

Run SFT job with ease:

trl-jobs sft --model MODEL_NAME --dataset DATASET_NAME [OPTIONS]

Required Arguments

  • --model: Model name or path (e.g., Qwen/Qwen3-4B-Base)
  • --dataset: Dataset name or path (e.g., trl-lib/tldr)

Optional Arguments

  • --flavor: Hardware flavor (default: t4-small)
  • -d, --detach: Run job in background and print job ID
  • --token: Hugging Face access token

Examples

trl-jobs sft \
    --model Qwen/Qwen3-4B-Base \
    --dataset trl-lib/tldr

Hardware Flavors

Common hardware flavors you can use:

  • t4-small: NVIDIA T4 GPU (default)
  • t4-medium: NVIDIA T4 GPU with more resources
  • a10g-small: NVIDIA A10G GPU
  • a10g-large: NVIDIA A10G GPU with more resources
  • a100-large: NVIDIA A100 GPU

Authentication

You can provide your Hugging Face token in several ways:

  1. Using huggingface-hub login: huggingface-cli login
  2. Setting the HF_TOKEN environment variable
  3. Using the --token argument

License

MIT License - see LICENSE file for details.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

trl_jobs-0.1.2.tar.gz (2.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

trl_jobs-0.1.2-py3-none-any.whl (2.6 kB view details)

Uploaded Python 3

File details

Details for the file trl_jobs-0.1.2.tar.gz.

File metadata

  • Download URL: trl_jobs-0.1.2.tar.gz
  • Upload date:
  • Size: 2.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for trl_jobs-0.1.2.tar.gz
Algorithm Hash digest
SHA256 985acb04e0c51f4ba0eea7bb3bafcd814dc444cfd1b1d76d4df09e117dad9686
MD5 676354d8ee6198b135ad69e6941337f1
BLAKE2b-256 63cf77f07a1b285cbd4e527dc37651c08b6d37af6caae943fff53beb2bd91930

See more details on using hashes here.

File details

Details for the file trl_jobs-0.1.2-py3-none-any.whl.

File metadata

  • Download URL: trl_jobs-0.1.2-py3-none-any.whl
  • Upload date:
  • Size: 2.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for trl_jobs-0.1.2-py3-none-any.whl
Algorithm Hash digest
SHA256 5d035ab47f632eb175c15d9c94f51cea6ba481e492817f4d19e7092928664f0c
MD5 c5e2e938075ce1103c22186ec461d0b3
BLAKE2b-256 5e55fb30319f30f2ad8bd271faf81e9ca931861bc2f52e7c64fe75ef99fc080c

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page