Ladder + TTFT LLM Finetuning
Project description
Ladder + TTFT
Custom Ladder implementation on any Complex problem for LLM. a reimplementation of the paper “LADDER: SELF-IMPROVING LLMS THROUGH RECURSIVE PROBLEM DECOMPOSITION” https://arxiv.org/pdf/2503.00735
Finetuned Qwen2-0.5B with Ladder (Model Response)
setup
install from source using PDM
git clone git@github.com:AbdelrahmanAbounida/ladder.git
cd ladder
pdm install
Run
our main usecase (Graph problem)
python src/main.py
TODO
Dataset Generation
- LLM Intelligence ratio Equation
- Custom Verification Method if required (for our Graph Usecase)
- DatasetGenerator > Generate subproblems according to the model intelligence ratio (step3)
- Difficulty Engine should decide the level of difficulty to be generated and what transformations to be applied
- Verification engine should use the small llm to be tuned not the Larger one
- LLM Engine (temperature cycling and persona based prompts for different operations like variant generation)
Ladder
- Ladder Finetuning Process
- GRPO Implementation
- reward functions
TTRL
- TTRL Implementation
- Data Generation in a loop
Others
- General Configurations for all Constants and Hyper Parameters
- implement different interfaced for different models to be used (HF, Ollama, VLLM, deepspeed, LiteLLM,..)
- LLMS Benchmarking
- Metrics and other evaluation methods
- implement more usecases if required for diverse benchmarking
- use accelerate / PEFT / deepspeed and vllm to speed up training process
Production
- Documentation
- packaging
- CICD
- Testing
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
ladder_ai-0.1.91.tar.gz
(22.4 kB
view details)
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file ladder_ai-0.1.91.tar.gz.
File metadata
- Download URL: ladder_ai-0.1.91.tar.gz
- Upload date:
- Size: 22.4 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: pdm/2.24.3.dev12+ga93208db CPython/3.13.3 Linux/6.11.0-1015-azure
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
82ed8fe4a2d80b75554fab984e0b81cf60b6206276d2b70db7abf2cbe1e42d1c
|
|
| MD5 |
22b68a0732c8d8c5f561cd3673a80917
|
|
| BLAKE2b-256 |
6ed9c07a38888331efc0178765fc5d59c3c40d7d39a20d8a2ae70b07cfba888d
|
File details
Details for the file ladder_ai-0.1.91-py3-none-any.whl.
File metadata
- Download URL: ladder_ai-0.1.91-py3-none-any.whl
- Upload date:
- Size: 29.7 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: pdm/2.24.3.dev12+ga93208db CPython/3.13.3 Linux/6.11.0-1015-azure
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
86592ca976dc73025d56fdf083b913f820d7ba13b58dbb5eaa526b29d1a89f5a
|
|
| MD5 |
4e502f1204166e1641945e918739842f
|
|
| BLAKE2b-256 |
49b74ec771c79b455a5e9ce2066d39046c83475aaa848c36e9df43fb648d3c4d
|