Implementations of post-training algorithms using the Tinker API

These details have been verified by PyPI

Maintainers

danobi derek-tml dylan-tml kshuster myleott nealwu yujia-tml

These details have not been verified by PyPI

Project links

Project description

Tinker Cookbook

We provide two libraries for the broader community to customize their language models: tinker and tinker-cookbook.

tinker is a training SDK for researchers and developers to fine-tune language models. You send API requests to us and we handle the complexities of distributed training.
tinker-cookbook includes realistic examples of fine-tuning language models. It builds on the Tinker API and provides common abstractions to fine-tune language models.

Installation

Sign up for Tinker here.
Once you have access, create an API key from the console and export it as environment variable TINKER_API_KEY.

Install tinker-cookbook (includes the tinker SDK as a dependency):

# Latest stable release from PyPI
uv pip install tinker-cookbook

# Or install the nightly build
uv pip install 'tinker-cookbook @ git+https://github.com/thinking-machines-lab/tinker-cookbook.git@nightly'

Tinker

Here we introduce a few Tinker primitives — the basic components to fine-tune LLMs (see the quickstart guide for more details):

import tinker
service_client = tinker.ServiceClient()
training_client = service_client.create_lora_training_client(
  base_model="meta-llama/Llama-3.2-1B", rank=32,
)
training_client.forward_backward(...)
training_client.optim_step(...)
training_client.save_state(...)
training_client.load_state(...)

sampling_client = training_client.save_weights_and_get_sampling_client()
sampling_client.sample(...)

See tinker_cookbook/recipes/sl_loop.py and tinker_cookbook/recipes/rl_loop.py for minimal examples of using these primitives to fine-tune LLMs.

Tutorials

New to Tinker? The tutorials/ directory contains 20+ progressive marimo notebooks that walk through core concepts — rendering, loss functions, completers, weight management — and advanced topics such as custom RL environments, DPO, RLHF, and weight export. Run any tutorial with marimo edit tutorials/101_hello_tinker.py. See the tutorials README for the full list, or browse rendered versions on the Tinker docs site.

To download the weights of any model:

rest_client = service_client.create_rest_client()
future = rest_client.get_checkpoint_archive_url_from_tinker_path(sampling_client.model_path)
with open(f"model-checkpoint.tar.gz", "wb") as f:
    f.write(future.result())

Tinker Cookbook

Besides these primitives, we also offer Tinker Cookbook (a.k.a. this repo), a library of a wide range of abstractions to help you customize training environments. tinker_cookbook/recipes/sl_basic.py and tinker_cookbook/recipes/rl_basic.py contain minimal examples to configure supervised learning and reinforcement learning.

We also include more complete examples in the tinker_cookbook/recipes/ folder:

Chat SFT: supervised fine-tuning on conversational datasets (e.g., Tulu3).
Math RL: reinforcement learning for mathematical reasoning with verifiable rewards.
Code RL: RL on competitive programming with sandboxed code execution (DeepCoder replication).
Preference learning: DPO and a three-stage RLHF pipeline (SFT, reward model, RL).
Distillation: on-policy and off-policy knowledge distillation with single- and multi-teacher configurations.
Tool use: RL for retrieval-augmented generation (Search-R1 replication).
Multi-agent: multi-agent RL with self-play and cross-play.

The recipes README covers all available recipes, including Harbor RL, rubric-based grading, VLM classification, and SDFT. Each recipe includes a README.md with implementation details, launch commands, and expected results.

Evaluation (experimental)

Tinker Cookbook includes a benchmark framework for evaluating trained models:

from tinker_cookbook.eval.benchmarks import run_benchmarks, BenchmarkConfig

results = await run_benchmarks(
    ["gsm8k", "mmlu_pro", "ifeval"],
    sampling_client, renderer,
    BenchmarkConfig(save_dir="evals/step500"),
)

The framework currently supports 12 benchmarks (GSM8K, MATH-500, MMLU-Pro, MMLU-Redux, GPQA, IFEval, MBPP, C-Eval, SuperGPQA, IFBench, AIME 2025, AIME 2026) with verified scores against published results, plus experimental benchmarks such as LiveCodeBench, Terminal Bench, and SWE-bench. Benchmarks can also serve as inline training evaluators via BenchmarkEvaluator.

Note: Benchmark scores are sensitive to evaluation configuration — system prompts, max_tokens, temperature, and timeout settings can shift results significantly. We document our exact settings alongside all reported scores. This framework is under active development; feedback and contributions are welcome. See the eval README for verified scores, configuration details, and instructions for adding new benchmarks.

Documentation

For the full Tinker documentation, visit tinker-docs.thinkingmachines.ai.

Utilities

Tinker Cookbook also provides reusable building blocks:

renderers — bidirectional conversion between token sequences and structured chat messages
hyperparam_utils — learning rate and hyperparameter scaling for LoRA training
eval — benchmark framework and inline training evaluators (see Evaluation above)

Claude Code Skills

Tinker Cookbook ships with Claude Code skills that teach Claude how to use the Tinker API. Install them so Claude can help you write training code in any project:

/plugin marketplace add thinking-machines-lab/tinker-cookbook

Then install the tinker plugin from the Discover tab (/plugin → Discover). Once installed, two skills are available:

Command	What it does
`/tinker:research`	Plan and run post-training experiments — SFT, RL, DPO, distillation, evaluation, hyperparameters, model selection, and more
`/tinker:debug`	Diagnose slow training, hangs, output mismatches, renderer issues, and errors

Skills also trigger automatically based on context — ask Claude to "set up SFT training" and it will load the right skill without a slash command. Skills update automatically when the repo is updated.

Development Setup

uv sync --extra dev
pre-commit install

This installs dev dependencies and registers pre-commit hooks that run ruff formatting and linting on every commit. CI enforces these checks on all pull requests.

Contributing

This project is built in the spirit of open science and collaborative development. We believe that the best tools emerge through community involvement and shared learning.

We welcome PR contributions after our private beta is over. If you have any feedback, please email us at tinker@thinkingmachines.ai.

Citation

If you use Tinker for your research, please cite it as:

Thinking Machines Lab, 2025. Tinker. https://thinkingmachines.ai/tinker/.

Or use this BibTeX citation:

@misc{tml2025tinker,
  author = {Thinking Machines Lab},
  title = {Tinker},
  year = {2025},
  url = {https://thinkingmachines.ai/tinker/},
}

Project details

These details have been verified by PyPI

Maintainers

danobi derek-tml dylan-tml kshuster myleott nealwu yujia-tml

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.4.1

May 12, 2026

0.4.0

May 8, 2026

0.3.0

Apr 8, 2026

0.2.2

Apr 2, 2026

0.2.1

Mar 29, 2026

0.2.0

Mar 26, 2026

0.1.0

Dec 4, 2025

0.0.0

Aug 25, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

tinker_cookbook-0.4.1.tar.gz (4.5 MB view details)

Uploaded May 12, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

tinker_cookbook-0.4.1-py3-none-any.whl (867.9 kB view details)

Uploaded May 12, 2026 Python 3

File details

Details for the file tinker_cookbook-0.4.1.tar.gz.

File metadata

Download URL: tinker_cookbook-0.4.1.tar.gz
Upload date: May 12, 2026
Size: 4.5 MB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.11.13 {"installer":{"name":"uv","version":"0.11.13","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for tinker_cookbook-0.4.1.tar.gz
Algorithm	Hash digest
SHA256	`1f9ad977317529bbf796f40ef13de59b2c93a0a257469bd80a7ffcfed5beb8b2`
MD5	`1f1422ee53482b0b0a58e73e89641fd3`
BLAKE2b-256	`e59c37af9804cb3f1d88f5e67512aa1aeafeb49ef9012532d056d92c96194320`

See more details on using hashes here.

File details

Details for the file tinker_cookbook-0.4.1-py3-none-any.whl.

File metadata

Download URL: tinker_cookbook-0.4.1-py3-none-any.whl
Upload date: May 12, 2026
Size: 867.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.11.13 {"installer":{"name":"uv","version":"0.11.13","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for tinker_cookbook-0.4.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`1776e82470534e47d8923378ea41d4c7e47a1ffdfcb1982507dfd0289ba9a70b`
MD5	`8c1891aeee70f0e039f07da6bdbcd800`
BLAKE2b-256	`9841989cc9b1ba67edcaff46fa2d5c9c63e57f3f85cbd0226472660ed01852ce`

See more details on using hashes here.

tinker_cookbook 0.4.1

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Project description

Tinker Cookbook

Installation

Tinker

Tutorials

Tinker Cookbook

Evaluation (experimental)

Documentation

Utilities

Claude Code Skills

Development Setup

Contributing

Citation

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes