Skip to main content

A 35M-parameter goldfish language model with a 10-second memory

Project description

GlubLM

License

the language model that already forgot this sentence

GlubLM is a 35M-parameter transformer that plays a goldfish with a 10-second memory. Inspired by GuppyLM and Ted Lasso's "be a goldfish" meditation on the happiest animal on earth, GlubLM has a hard 96-token context window - it literally cannot remember what was just said.

Unlike GuppyLM, GlubLM:

  • uses modern transformer components: RoPE + SwiGLU + RMSNorm
  • was trained on an 80K LLM-generated dataset produced by a team of four Claude agents, not hand-authored templates
  • runs in your browser via quantized ONNX (~40 MB) - try the demo

Quick start

Browser

Open the demo. Everything runs client-side - no backend.

Python

pip install glublm
glublm chat \
  --ckpt /path/to/glublm_60k_15ep.pt \
  --tokenizer /path/to/tokenizer_60k.json \
  --prompt "hello"

Or download the model from HuggingFace:

from huggingface_hub import hf_hub_download
ckpt = hf_hub_download("DenSec02/glublm-18m", "model.safetensors")
tok  = hf_hub_download("DenSec02/glublm-18m", "tokenizer.json")

Train from scratch

  1. Clone this repo
  2. pip install -e ".[dev,deploy]"
  3. Generate the dataset (see docs/DATASET.md)
  4. Train: glublm train --data data/glublm_60k.json --epochs 15 --batch-size 64 --lr 3e-4
  5. See docs/TRAINING.md for details

Architecture

  • ~36.1M parameters, 8 decoder-only transformer blocks
  • hidden 640, 10 attention heads, SwiGLU FFN (1280x2), RMSNorm
  • RoPE position encoding
  • Vocabulary: 5,120 BPE
  • Max context: 96 tokens (hard cap - the physical 10-second memory)
  • Test perplexity: 3.28

Details: docs/ARCHITECTURE.md

Comparison vs GuppyLM

See docs/COMPARISONS.md for the empirical comparison. Short version: GlubLM tests the hypothesis that modern ops help at sub-40M scale, which is something GuppyLM explicitly decided against.

Links

Credits

  • GuppyLM by Arman BD - the original tiny fish-persona model
  • Ted Lasso - the "be a goldfish" philosophy
  • Anthropic Claude - the multi-agent dataset generation team

License

AGPL-3.0 - see LICENSE.

Citation

@software{glublm_2026,
  author = {Sepede, Dennis},
  title = {GlubLM: a 35M goldfish language model with a 10-second memory},
  year = {2026},
  url = {https://github.com/Den-Sec/glublm}
}

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

glublm-0.3.1.tar.gz (32.6 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

glublm-0.3.1-py3-none-any.whl (27.9 kB view details)

Uploaded Python 3

File details

Details for the file glublm-0.3.1.tar.gz.

File metadata

  • Download URL: glublm-0.3.1.tar.gz
  • Upload date:
  • Size: 32.6 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for glublm-0.3.1.tar.gz
Algorithm Hash digest
SHA256 497b5a006c7c93b6bd7d2b119b40ab29c725431201a3ac1860727aaf60de105b
MD5 fe23a98dd3a4a61babca358ac78710d4
BLAKE2b-256 c44f9839349d929278a71208a8f6a3ddeac7f0e5c3cd3442377198f9c7283794

See more details on using hashes here.

Provenance

The following attestation bundles were made for glublm-0.3.1.tar.gz:

Publisher: release.yml on Den-Sec/glublm

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file glublm-0.3.1-py3-none-any.whl.

File metadata

  • Download URL: glublm-0.3.1-py3-none-any.whl
  • Upload date:
  • Size: 27.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for glublm-0.3.1-py3-none-any.whl
Algorithm Hash digest
SHA256 ffb837395a29436db15ac78eeff233004e59201eab71be67358f93907855b606
MD5 d2098aa9677eba5522027a7105608d31
BLAKE2b-256 df333d123f5788d68bdd06069258cf06f3b40bed609fc6c98df8f8f16457a7ff

See more details on using hashes here.

Provenance

The following attestation bundles were made for glublm-0.3.1-py3-none-any.whl:

Publisher: release.yml on Den-Sec/glublm

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page