A toolkit for managing and testing LM Studio models with automatic context limit discovery

These details have not been verified by PyPI

Project links

Project description

lmstrix

Manage, test, and run local language models through LM Studio from the command line. Its centrepiece is a binary-search algorithm that finds the true maximum context window of any model — so you stop guessing and stop crashing.

Background: what a context window is

A language model can only "see" a fixed number of tokens at once. This is its context window. Feed it more tokens than it can handle and it either crashes, silently truncates your input, or runs out of GPU memory mid-inference.

Every model has a theoretical maximum stated in its documentation. That number is often optimistic. The real limit depends on your hardware, quantisation level, and the LM Studio version. The only way to know for certain is to test it.

What lmstrix does

Scans your LM Studio models directory and builds a registry of available models
Tests models using binary search to find their actual maximum context window
Persists the results to a JSON registry so you never re-test a model you already know
Runs inference via LM Studio's local API with configurable prompts and context sizes
Reports test results and model metadata in formatted terminal tables

Install

pip install lmstrix
# or
uv pip install lmstrix

Requires LM Studio installed and running on localhost:1234 (the default).

Quick start

# Discover all models in your LM Studio directory
lmstrix scan

# List discovered models and their tested context limits
lmstrix list

# Find the true context limit for a specific model
lmstrix test "llama-3.2-3b-instruct"

# Run inference at a specific context size
lmstrix infer "llama-3.2-3b-instruct" --prompt "Explain quantum entanglement" --context 8192

How the context test works

Testing all possible context sizes would take hours. Binary search cuts that down to logarithmic time.

Start with the model's stated maximum (e.g. 131072 tokens).
Try loading the model at that size and running two simple inference checks: "Write 'ninety-six' as a number" and "2+3=".
If it succeeds, record that size as the working maximum.
If it fails (OOM, crash, timeout, zero tokens returned), halve the search space.
Repeat until the boundary is found within a small tolerance.

The test uses dual prompts because a single "say hello" prompt can succeed even when the model is misconfigured — it is too short to stress the context allocation. The two prompts require the model to produce specific, verifiable output.

Results include time-to-first-token (TTFT) and tokens-per-second (TPS) from the successful test run.

The model registry

Scan results and test results are persisted to a JSON file (default: ~/.local/share/lmstrix/models.json on Linux, similar paths on macOS/Windows). Subsequent scan runs update the registry without discarding test results. list reads from the registry without touching LM Studio.

CLI reference

lmstrix scan              Scan LM Studio models directory and update registry
lmstrix list              List all models with context limits and test status
lmstrix test <model-id>   Binary-search for true maximum context window
lmstrix infer <model-id>  Run inference; options: --prompt, --context, --max-tokens

Python API

from lmstrix.api import LMStudioClient
from lmstrix.core.context_tester import ContextTester
from lmstrix.core.scanner import ModelScanner

client = LMStudioClient()
scanner = ModelScanner()
registry = scanner.scan()

tester = ContextTester(client=client, verbose=True)
model = registry.get_model("llama-3.2-3b-instruct")
updated_model = tester.test_model(model, max_context=32768, registry=registry)

print(f"Max working context: {updated_model.tested_max_context}")
print(f"TTFT: {updated_model.ttft_seconds:.2f}s")
print(f"TPS: {updated_model.tps:.1f}")

LM Studio setup

LM Studio must be running with its local API server enabled (Settings → Local Server → Start Server). The default address is http://localhost:1234. Set LMSTUDIO_BASE_URL to override.

License

MIT

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

1.0.83

Jun 10, 2026

1.0.82

Apr 11, 2026

1.0.78

Mar 6, 2026

1.0.76

Mar 4, 2026

1.0.75

Mar 4, 2026

1.0.74

Mar 4, 2026

1.0.70

Aug 6, 2025

1.0.69

Aug 6, 2025

1.0.68

Aug 6, 2025

1.0.67

Aug 4, 2025

1.0.66

Aug 4, 2025

1.0.65

Aug 4, 2025

1.0.64

Aug 4, 2025

1.0.63

Aug 4, 2025

1.0.62

Aug 3, 2025

1.0.61

Jul 31, 2025

1.0.60

Jul 31, 2025

1.0.59

Jul 31, 2025

1.0.58

Jul 31, 2025

1.0.57

Jul 30, 2025

1.0.56

Jul 30, 2025

1.0.55

Jul 29, 2025

1.0.54

Jul 29, 2025

1.0.53

Jul 29, 2025

1.0.52

Jul 29, 2025

1.0.51

Jul 27, 2025

1.0.50

Jul 27, 2025

1.0.49

Jul 27, 2025

1.0.48

Jul 27, 2025

1.0.47

Jul 26, 2025

1.0.46

Jul 26, 2025

1.0.45

Jul 26, 2025

1.0.44

Jul 26, 2025

1.0.43

Jul 26, 2025

1.0.42

Jul 26, 2025

1.0.41

Jul 25, 2025

1.0.39

Jul 25, 2025

1.0.38

Jul 25, 2025

1.0.37

Jul 25, 2025

1.0.36

Jul 25, 2025

1.0.35

Jul 25, 2025

1.0.34

Jul 25, 2025

1.0.33

Jul 25, 2025

1.0.32

Jul 25, 2025

1.0.31

Jul 25, 2025

1.0.29

Jul 25, 2025

1.0.28

Jul 25, 2025

1.0.27

Jul 25, 2025

1.0.26

Jul 25, 2025

1.0.25

Jul 25, 2025

1.0.24

Jul 25, 2025

1.0.23

Jul 25, 2025

1.0.22

Jul 25, 2025

1.0.21

Jul 25, 2025

1.0.20

Jul 25, 2025

1.0.19

Jul 25, 2025

1.0.18

Jul 24, 2025

1.0.17

Jul 24, 2025

1.0.16

Jul 24, 2025

1.0.15

Jul 24, 2025

1.0.14

Jul 24, 2025

1.0.13

Jul 24, 2025

1.0.12

Jul 24, 2025

1.0.11

Jul 24, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

lmstrix-1.0.83.tar.gz (110.5 kB view details)

Uploaded Jun 10, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

lmstrix-1.0.83-py3-none-any.whl (115.4 kB view details)

Uploaded Jun 10, 2026 Python 3

File details

Details for the file lmstrix-1.0.83.tar.gz.

File metadata

Download URL: lmstrix-1.0.83.tar.gz
Upload date: Jun 10, 2026
Size: 110.5 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.11.17 {"installer":{"name":"uv","version":"0.11.17","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for lmstrix-1.0.83.tar.gz
Algorithm	Hash digest
SHA256	`833b63560b438d7870d85eb5336539684537a0841035735093295c81a24148b7`
MD5	`11fe556a8479c508d2ca05b6dc1cb03c`
BLAKE2b-256	`2bb3dd23962a4d5dfe7aed155bed25c18dad87a06a221812db72207afa581f38`

See more details on using hashes here.

File details

Details for the file lmstrix-1.0.83-py3-none-any.whl.

File metadata

Download URL: lmstrix-1.0.83-py3-none-any.whl
Upload date: Jun 10, 2026
Size: 115.4 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.11.17 {"installer":{"name":"uv","version":"0.11.17","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for lmstrix-1.0.83-py3-none-any.whl
Algorithm	Hash digest
SHA256	`c5fa0d1244b48f7e48c5e7b1490412e55aa1ebd083d82f3f4bd2b28cb05a0c1e`
MD5	`36aea6f1318a04bc1359296ee83f2ceb`
BLAKE2b-256	`7a0d6f847cb477a87d522c9983d7b9e5550aa27b7b91acd99da941a3940515c0`

See more details on using hashes here.

lmstrix 1.0.83

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

lmstrix

Background: what a context window is

What lmstrix does

Install

Quick start

How the context test works

The model registry

CLI reference

Python API

LM Studio setup

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes