Skip to main content

Reasoning Models From Scratch

Project description

Build A Reasoning Model (From Scratch)

This repository contains the code for developing an LLM reasoning model and is the official code repository for the book Build a Reasoning Model (From Scratch).



(Printed in color.)


In Build a Reasoning Model (From Scratch), you will learn and understand how a reasoning large language model (LLM) works.

Reasoning is one of the most exciting and important recent advances in improving LLMs, but it’s also one of the easiest to misunderstand if you only hear the term reasoning and read about it in theory. This is why this book takes a hands-on approach. We will start with a pre-trained base LLM and then add reasoning capabilities ourselves, step by step in code, so you can see exactly how it works.

The methods described in this book walk you through the process of developing your own small-but-functional reasoning model for educational purposes. It mirrors the approaches used in creating large-scale reasoning models such as DeepSeek R1, GPT-5 Thinking, and others. In addition, this book includes code for loading the weights of existing, pretrained models.



To download a copy of this repository, click on the Download ZIP button or execute the following command in your terminal:

git clone --depth 1 https://github.com/rasbt/reasoning-from-scratch.git

Tip: Chapter 2 provides additional tips on installing Python, managing Python packages, and setting up your coding environment.



Table of Contents (In Progress)

Code tests Linux Code tests macOS Code tests Windows

Chapter Title Main Code
Ch 1: Understanding reasoning Models No code
Ch 2: Generating Text with a Pre-trained LLM - ch02_main.ipynb
- ch02_exercise-solutions.ipynb
Ch 3: Evaluating Reasoning Models - ch03_main.ipynb
- ch03_exercise-solutions.ipynb
Ch 4: Improving Reasoning with Inference-Time Scaling - ch04_main.ipynb
- ch04_exercise-solutions.ipynb
Ch 5: Inference-Time Scaling via Self-Refinement - ch05_main.ipynb
- ch05_exercise-solutions.ipynb
Ch 6: Training Reasoning Models with Reinforcement Learning - ch06_main.ipynb
- ch06_exercise-solutions.ipynb
Ch 7: Improving GRPO for Reinforcement Learning - ch07_main.ipynb
- ch07_exercise-solutions.ipynb
Ch 8: Distilling Reasoning Models for Efficient Reasoning TBA
Appendix A: References and Further Reading No code
Appendix B: Exercise Solutions Code and solutions are in each chapter's subfolder
Appendix C: Qwen3 LLM Source Code - chC_main.ipynb
Appendix D TBA
Appendix E TBA
Appendix F: Common Approaches to LLM Evaluation - chF_main.ipynb

 

The mental model below summarizes the main techniques covered in this book.


 

Companion Book

Please note that Build A Reasoning Model (From Scratch) is a standalone book focused on methods to improve LLM reasoning.

In this book, we work with a pre-trained open-source base LLM (Qwen3) on top of which we code apply reasoning methods from scratch. This includes inference-time scaling, reinforcement learning, and distillation.

However, if you are interested in understanding how a conventional base LLM is implemented, you may like my previous book, Build a Large Language Model (From Scratch).


 

Hardware Requirements

The code in the main chapters of this book is designed to mostly run on consumer hardware within a reasonable timeframe and does not require specialized server hardware. This approach ensures that a wide audience can engage with the material. Additionally, the code automatically utilizes GPUs if they are available. That being said, chapters 2-4 will work well on CPUs and GPUs. For chapters 5 and 6, it is recommended to use a GPU if you want to replicate the results in the chapter.

(Please see the setup_tips doc for additional recommendations.)

 

Exercises

Each chapter of the book includes several exercises. The solutions are summarized in Appendix B, and the corresponding code notebooks are available in the main chapter folders of this repository (for example, ch02/01_main-chapter-code/ch02_exercise-solutions.ipynb).

 

Bonus Material

Several folders contain optional materials as a bonus for interested readers:

 

Questions, Feedback, and Contributing to This Repository

For common problems, please see the Troubleshooting Guide.

I welcome all sorts of feedback, best shared via the Manning Discussion Forum or GitHub Discussions. Likewise, if you have any questions or just want to bounce ideas off others, please don't hesitate to post these in the forum as well.

Please note that since this repository contains the code corresponding to a print book, I currently cannot accept contributions that would extend the contents of the main chapter code, as it would introduce deviations from the physical book. Keeping it consistent helps ensure a smooth experience for everyone.

 

Citation

If you find this book or code useful for your research, please consider citing it.

Chicago-style citation:

Raschka, Sebastian. Build A Reasoning Model (From Scratch). Manning, 2025. ISBN: 9781633434677.

BibTeX entry:

@book{build-llms-from-scratch-book,
  author       = {Sebastian Raschka},
  title        = {Build A Reasoning Model (From Scratch)},
  publisher    = {Manning},
  year         = {2025},
  isbn         = {9781633434677},
  url          = {https://mng.bz/lZ5B},
  github       = {https://github.com/rasbt/reasoning-from-scratch}
}

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

reasoning_from_scratch-0.1.16.tar.gz (66.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

reasoning_from_scratch-0.1.16-py3-none-any.whl (59.3 kB view details)

Uploaded Python 3

File details

Details for the file reasoning_from_scratch-0.1.16.tar.gz.

File metadata

  • Download URL: reasoning_from_scratch-0.1.16.tar.gz
  • Upload date:
  • Size: 66.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.5

File hashes

Hashes for reasoning_from_scratch-0.1.16.tar.gz
Algorithm Hash digest
SHA256 13fb2bd67961082a91eadc66fe6aafa265589dc72c5ce6c2e5d7637331569189
MD5 d223c9cdc0e55fffae1df18bc082e11d
BLAKE2b-256 cc822cc68d75380240758b572bd0dce85cc0f2c7ce12da0287e177af773332d4

See more details on using hashes here.

File details

Details for the file reasoning_from_scratch-0.1.16-py3-none-any.whl.

File metadata

File hashes

Hashes for reasoning_from_scratch-0.1.16-py3-none-any.whl
Algorithm Hash digest
SHA256 9e0fd9a44943d4c3b770b37225ae24951262c38203e01676c3d5ea1f0f04cfa5
MD5 eae3f74c1981babc9845821c3b2ebecf
BLAKE2b-256 651e805f39978ac925e30f15cbc510f332195e54ea193d8ef5fe3607483bda39

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page