Skip to main content

Reasoning Models From Scratch

Project description

Build A Reasoning Model (From Scratch)

This repository contains the code for developing an LLM reasoning model and is the official code repository for the book Build A Reasoning Model (From Scratch).




In Build a Reasoning Model (From Scratch), you will learn and understand how a reasoning large language model (LLM) works.

Reasoning is one of the most exciting and important recent advances in improving LLMs, but it’s also one of the easiest to misunderstand if you only hear the term reasoning and read about it in theory. This is why this book takes a hands-on approach. We will start with a pre-trained base LLM and then add reasoning capabilities ourselves, step by step in code, so you can see exactly how it works.

The methods described in this book walk you through the process of developing your own small-but-functional reasoning model for educational purposes. It mirrors the approaches used in creating large-scale reasoning models such as DeepSeek R1, GPT-5 Thinking, and others. In addition, this book includes code for loading the weights of existing, pretrained models.

  • Link to the official source code repository
  • Link to the book at Manning (the publisher's website) (TBD)
  • Link to the book page on Amazon.com (TBD)
  • ISBN TBD


To download a copy of this repository, click on the Download ZIP button or execute the following command in your terminal:

git clone --depth 1 https://github.com/rasbt/reasoning-from-scratch.git

Tip: Chapter 2 provides additional tips on installing Python, managing Python packages, and setting up your coding environment.



Table of Contents (In Progress)

Code tests Linux Code tests macOS Code tests Windows

Chapter Title Main Code
Ch 1: Understanding reasoning models No code
Ch 2: Generating text with a pre-trained LLM - ch02_main.ipynb
- ch02_exercise-solutions.ipynb
Ch 3: Evaluating reasoning models TBA
Ch 4: Improving reasoning with inference-time scaling TBA
Ch 5: Training reasoning models with reinforcement learning TBA
Ch 6: Distilling reasoning models for efficient reasoning TBA
Ch 7: Improving the reasoning pipeline and future directions TBA
Appendix A: References and further reading No code
Appendix B: Exercise solutions No code; solutions are in each chapter's subfolder
Appendix C: Qwen3 LLM source code - chC_main.ipynb

 

The mental model below summarizes the main techniques covered in this book.


 

Companion Book

Please note that Build A Reasoning Model (From Scratch) is a standalone book focused on methods to improve LLM reasoning. From chapter 1, the definition of reasoning is:

Reasoning, in the context of LLMs, refers to the model's ability to produce intermediate steps before providing a final answer. This is a process that is often described as chain-of-thought (CoT) reasoning. In CoT reasoning, the LLM explicitly generates a structured sequence of statements or computations that illustrate how it arrives at its conclusion.

In this book, we work with a pre-trained open-source base LLM (Qwen3) on top of which we code apply reasoning methods from scratch.

However, if you are interested in understanding how a conventional base LLM is implemented, you may like my previous book, Build a Large Language Model (From Scratch).


 

Hardware Requirements

The code in the main chapters of this book is designed to mostly run on consumer hardware within a reasonable timeframe and does not require specialized server hardware. This approach ensures that a wide audience can engage with the material. Additionally, the code automatically utilizes GPUs if they are available. That being said, chapters 2-4 will work well on CPUs and GPUs. For chapters 5 and 6, it is recommended to use a GPU if you want to replicate the results in the chapter.

(Please see the setup_tips doc for additional recommendations.)

 

Exercises

Each chapter of the book includes several exercises. The solutions are summarized in Appendix B, and the corresponding code notebooks are available in the main chapter folders of this repository (for example, [ch02/01_main-chapter-code/ch02_exercise-solutions.ipynb](ch02/01_main-chapter-code/ch02_exercise-solutions.ipynb)).

 

Questions, Feedback, and Contributing to This Repository

I welcome all sorts of feedback, best shared via the Manning Forum (URL TBD) or GitHub Discussions. Likewise, if you have any questions or just want to bounce ideas off others, please don't hesitate to post these in the forum as well.

Please note that since this repository contains the code corresponding to a print book, I currently cannot accept contributions that would extend the contents of the main chapter code, as it would introduce deviations from the physical book. Keeping it consistent helps ensure a smooth experience for everyone.

 

Citation

If you find this book or code useful for your research, please consider citing it.

Chicago-style citation:

Raschka, Sebastian. Build A Reasoning Model (From Scratch). Manning, 2025. ISBN: TBD.

BibTeX entry:

@book{build-llms-from-scratch-book,
  author       = {Sebastian Raschka},
  title        = {Build A Reasoning Model (From Scratch)},
  publisher    = {Manning},
  year         = {2025},
  isbn         = {TBD},
  url          = {TBD},
  github       = {https://github.com/rasbt/reasoning-from-scratch}
}

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

reasoning_from_scratch-0.1.3.tar.gz (29.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

reasoning_from_scratch-0.1.3-py3-none-any.whl (27.2 kB view details)

Uploaded Python 3

File details

Details for the file reasoning_from_scratch-0.1.3.tar.gz.

File metadata

  • Download URL: reasoning_from_scratch-0.1.3.tar.gz
  • Upload date:
  • Size: 29.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.10.16

File hashes

Hashes for reasoning_from_scratch-0.1.3.tar.gz
Algorithm Hash digest
SHA256 7428c234d91bd10a2e0187042334b2f3c1fa78619f35cdb22a2cfbee408e82f3
MD5 cdad999b53d773fc64a7e38df43665ea
BLAKE2b-256 3f4415f8361eeaeb9c1fc6a351d13b4825e321b60b8c9ee0a4ab88d2b78b4295

See more details on using hashes here.

File details

Details for the file reasoning_from_scratch-0.1.3-py3-none-any.whl.

File metadata

File hashes

Hashes for reasoning_from_scratch-0.1.3-py3-none-any.whl
Algorithm Hash digest
SHA256 a346c6644e0998214ab96fe633f69140077bb673213c628bb538e61c7f17ce2a
MD5 74a160617b19b41e6d6e043d3b4e664d
BLAKE2b-256 e2a66fae58cb9dddf663208df3e896a0427260952cd777691b89038d630db7e1

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page