Skip to main content

Small language model.

Project description

GitHub Pages PyPI

slangmod

Small language model.

Ever wondered how large language models (LLMs) like ChatGPT, Claude, LLama, Deepseek, etc., actually work, like, really work? I did. And I figured there is only one way to find out: Make one yourself. From scratch.

Of course, I wasn't expecting to beat the big players at their own game, but I wanted to know what you can do on consumer hardware (meaning a state-of-the art gaming PC with a single graphics card supported by PyTorch). So, naturally, it was going to be a small language model. These hardware limitations are reflected in software design choices. Specifically, slangmod does not employ any type of parallelization that would keep multiple GPUs busy at the same time, and all training data are loaded into CPU RAM at once, to be drip-fed to the model on the GPU from there (1 billion tokens take up about 7.5 GB worth of 64-bit integer numbers).

Having said that, slangmod provides everything you need to

  • preprocess and clean your text corpus;
  • chose and train one of the HuggingFace tokenizers;
  • specify a Transformer model including the type of positional encodings and the feedforward block;
  • train your model with a choice of optimizers and learning-rate schedulers, employing early-stopping if you like;
  • monitor convergence and experiment on hyperparameters;
  • explore text-generation algorithms like top-k, top-p or beamsearch;
  • and, finally, chat with your model.

To do all these things, slangmod provides a command-line interface (CLI) with fine-grained configuration options on one hand, and the raw building blocks it is made of on the other hand. Leveraging the foundational functionalities provided by the fiercely functional swak package, any other workflow can thus be quickly coded up.

Installation

  • Create a new virtual environment running at least python 3.12.
  • The easiest way of installing slangmod is from the python package index PyPI, where it is hosted. Simply type
    pip install slangmod
    
    or treat it like any other python package in your dependency management.
  • While it is, in principle, possible to run slangmod on the CPU, this is only intended for debugging purposes. To get any results in finite time, you also need a decent graphics card, and you must have a working installation of PyTorch to make good use of it. Because there is no way of knowing which version of CUDA (or ROC) you have installed on your machine and how you installed it, PyTorch is not an explicit dependency of slangmod. You will have to install it yourself, e.g., following these instructions. If you are using pipenv for dependency management, you can also have a look at the Pipfile in the root of the slangmod repository and taylor it to your needs. Personally, I go
    pipenv sync --categories=cpu
    
    for a CPU-only installation of PyTorch and
    pipenv sync --categories=cuda
    
    if I want GPU support.

Documentation

The documentation for both the CLI and the API of slangmod is hosted on GitHub Pages.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

slangmod-0.0.4.tar.gz (55.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

slangmod-0.0.4-py3-none-any.whl (87.5 kB view details)

Uploaded Python 3

File details

Details for the file slangmod-0.0.4.tar.gz.

File metadata

  • Download URL: slangmod-0.0.4.tar.gz
  • Upload date:
  • Size: 55.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.12.8

File hashes

Hashes for slangmod-0.0.4.tar.gz
Algorithm Hash digest
SHA256 53f9a1bea3053bd91cd2054b854e7657d867a0c7687e81108e2e73c6e1e1581a
MD5 1ba063dbfbf7dfd34e1cd2cffc149516
BLAKE2b-256 85c0d0395663e0aa229ce29184e418227982a524632132c5bef2b5b49b4db420

See more details on using hashes here.

Provenance

The following attestation bundles were made for slangmod-0.0.4.tar.gz:

Publisher: publish-package.yml on yedivanseven/slangmod

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file slangmod-0.0.4-py3-none-any.whl.

File metadata

  • Download URL: slangmod-0.0.4-py3-none-any.whl
  • Upload date:
  • Size: 87.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.12.8

File hashes

Hashes for slangmod-0.0.4-py3-none-any.whl
Algorithm Hash digest
SHA256 b371a84d0ffdefffe8ece0437d492bb29e0c59ce55ddf73d7f344c3b60695e4f
MD5 dafcf385d3325790baca60e16c08a726
BLAKE2b-256 a5839d833567641188ee1b17bafefd06e69bc93c8597d8211150a7ae1de2e8f7

See more details on using hashes here.

Provenance

The following attestation bundles were made for slangmod-0.0.4-py3-none-any.whl:

Publisher: publish-package.yml on yedivanseven/slangmod

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page