A toolkit for managing and testing LM Studio models with automatic context limit discovery

These details have not been verified by PyPI

Project links

Project description

LMStrix: The Unofficial Toolkit for Mastering LM Studio

LMStrix is a professional, installable Python toolkit designed to supercharge your interaction with LM Studio. It provides a powerful command-line interface (CLI) and a clean Python API for managing, testing, and running local language models, with a standout feature: the Adaptive Context Optimizer.

Why LMStrix? The Problem it Solves

Working with local LLMs via LM Studio is powerful, but it comes with challenges:

The Context Window Mystery: What's the true maximum context a model can handle on your machine? Advertised context lengths are often theoretical. The practical limit depends on your hardware, the model's architecture, and LM Studio's own overhead. Finding this limit manually is a tedious, frustrating process of trial and error.
Repetitive Workflows: Managing models, crafting prompts, and running inference often involves repetitive boilerplate code or manual steps in the LM Studio UI.
Lack of Programmatic Control: The UI is great for exploration, but developers building applications on top of local LLMs need a robust, scriptable interface for automation and integration.

LMStrix solves these problems by providing a seamless, developer-friendly toolkit that automates the tedious parts and lets you focus on building.

How It Works: The Adaptive Context Optimizer

The core innovation in LMStrix is its ability to automatically discover the maximum operational context length for any model loaded in LM Studio.

It uses a sophisticated binary search algorithm:

It starts with a wide range for the possible context size.
It sends a specially crafted prompt to the model, progressively increasing the amount of "filler" text.
It analyzes the model's response (or lack thereof) to see if it successfully processed the context.
By repeatedly narrowing the search range, it quickly pinpoints the precise token count where the model's performance degrades or fails.

This gives you a reliable, empirical measurement of the model's true capabilities on your specific hardware, eliminating guesswork and ensuring your applications run with optimal performance.

Key Features

Automatic Context Optimization: Discover the true context limit of any model with the optimize command.
Full Model Management: Programmatically list available models and scan for newly downloaded ones.
Flexible Inference Engine: Run inference with a powerful two-phase prompt templating system that separates prompt structure from its content.
Rich CLI: A beautiful and intuitive command-line interface built with rich and fire, providing formatted tables, progress indicators, and clear feedback.
Modern Python API: An async-first API designed for high-performance, concurrent applications.
Robust and Resilient: Features automatic retries with exponential backoff for network requests and a comprehensive exception hierarchy.
Lightweight and Focused: Built with a minimal set of modern, high-quality dependencies.

Installation

# Using pip
pip install lmstrix

# Using uv (recommended)
uv pip install lmstrix

# For development
git clone https://github.com/twardoch/lmstrix
cd lmstrix
pip install -e .

Quick Start

Command-Line Interface (CLI)

# First, scan for available models in LM Studio
lmstrix scan

# List all models with their test status
lmstrix list

# Test the context limit for a specific model
lmstrix test "model-id-here"

# Test all untested models
lmstrix test --all

# Run inference on a model
lmstrix infer "Your prompt here" --model "model-id" --max-tokens 150

# Run inference with a prompt file
lmstrix infer "@prompts.toml:greeting" --model "model-id"

# Enable verbose output for debugging
lmstrix scan --verbose
lmstrix test "model-id" --verbose

Python API

import asyncio
from lmstrix import LMStrix

async def main():
    # Initialize the client
    lms = LMStrix()
    
    # Scan for available models
    await lms.scan_models()
    
    # List all models
    models = await lms.list_models()
    for model in models:
        print(f"Model: {model.id}")
        print(f"  Context limit: {model.context_limit:,} tokens")
        print(f"  Tested limit: {model.tested_max_context or 'Not tested'}")
        print(f"  Status: {model.context_test_status}")
    
    # Test a specific model's context limits
    model_id = models[0].id if models else None
    if model_id:
        print(f"\nTesting context limits for {model_id}...")
        result = await lms.test_model(model_id)
        print(f"Optimal context: {result.tested_max_context} tokens")
        print(f"Test status: {result.context_test_status}")
    
    # Run inference
    if model_id:
        response = await lms.infer(
            prompt="What is the meaning of life?",
            model_id=model_id,
            max_tokens=100
        )
        print(f"\nInference result:\n{response.content}")

if __name__ == "__main__":
    asyncio.run(main())

Batch Processing Example

from lmstrix.api.client import LMStudioClient
from lmstrix.core.scanner import ModelScanner

# Process multiple models
client = LMStudioClient()
scanner = ModelScanner(client)

# Scan and test all models
for model in scanner.scan():
    if not model.tested_max_context:
        print(f"Testing {model.id}...")
        # Testing happens automatically via CLI or API

Architecture

LMStrix is designed with a clean, modular architecture:

api/: A dedicated client for communicating with the LM Studio local server API.
core/: The heart of the application, containing the core business logic for models, inference, and the context optimization algorithm.
loaders/: Handles loading and managing data for models, prompts, and context files.
cli/: Implements the command-line interface.
utils/: Shared utilities and helper functions.

How Context Testing Works

LMStrix uses an innovative binary search algorithm to find the true operational context limit of each model:

Initial Range: Starts with the model's declared context size as the upper bound
Binary Search: Tests the model with progressively refined context sizes
Validation: Each test sends a simple prompt ("2+2=") padded with filler text to reach the target context size
Result Verification: Only marks a context size as "working" if the model returns the correct answer ("4")
Optimization: Finds the maximum context size that reliably works on your hardware

This process typically takes 30-60 seconds per model and saves the results for future use.

Development

# Clone the repository
git clone https://github.com/twardoch/lmstrix
cd lmstrix

# Install in development mode with all dependencies
pip install -e ".[dev]"

# Run the test suite
pytest

# Run with coverage
pytest --cov=src/lmstrix --cov-report=html

# Format code
black .
ruff format .

# Lint code
ruff check .
mypy src/lmstrix

# Build the package
python -m build

License

This project is licensed under the MIT License - see the LICENSE file for details.

Contributing

Contributions are highly welcome! Please feel free to submit pull requests or file issues on our GitHub repository.

Requirements

Python 3.10 or higher
LM Studio installed and running locally
At least one model downloaded in LM Studio

Support

For bugs, feature requests, or general questions, please file an issue on our GitHub repository.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

1.0.82

Apr 11, 2026

1.0.78

Mar 6, 2026

1.0.76

Mar 4, 2026

1.0.75

Mar 4, 2026

1.0.74

Mar 4, 2026

1.0.70

Aug 6, 2025

1.0.69

Aug 6, 2025

1.0.68

Aug 6, 2025

1.0.67

Aug 4, 2025

1.0.66

Aug 4, 2025

1.0.65

Aug 4, 2025

1.0.64

Aug 4, 2025

1.0.63

Aug 4, 2025

1.0.62

Aug 3, 2025

1.0.61

Jul 31, 2025

1.0.60

Jul 31, 2025

1.0.59

Jul 31, 2025

1.0.58

Jul 31, 2025

1.0.57

Jul 30, 2025

1.0.56

Jul 30, 2025

1.0.55

Jul 29, 2025

1.0.54

Jul 29, 2025

1.0.53

Jul 29, 2025

1.0.52

Jul 29, 2025

1.0.51

Jul 27, 2025

1.0.50

Jul 27, 2025

1.0.49

Jul 27, 2025

1.0.48

Jul 27, 2025

1.0.47

Jul 26, 2025

1.0.46

Jul 26, 2025

1.0.45

Jul 26, 2025

1.0.44

Jul 26, 2025

1.0.43

Jul 26, 2025

1.0.42

Jul 26, 2025

1.0.41

Jul 25, 2025

1.0.39

Jul 25, 2025

1.0.38

Jul 25, 2025

1.0.37

Jul 25, 2025

1.0.36

Jul 25, 2025

1.0.35

Jul 25, 2025

1.0.34

Jul 25, 2025

1.0.33

Jul 25, 2025

1.0.32

Jul 25, 2025

1.0.31

Jul 25, 2025

1.0.29

Jul 25, 2025

1.0.28

Jul 25, 2025

1.0.27

Jul 25, 2025

1.0.26

Jul 25, 2025

1.0.25

Jul 25, 2025

This version

1.0.24

Jul 25, 2025

1.0.23

Jul 25, 2025

1.0.22

Jul 25, 2025

1.0.21

Jul 25, 2025

1.0.20

Jul 25, 2025

1.0.19

Jul 25, 2025

1.0.18

Jul 24, 2025

1.0.17

Jul 24, 2025

1.0.16

Jul 24, 2025

1.0.15

Jul 24, 2025

1.0.14

Jul 24, 2025

1.0.13

Jul 24, 2025

1.0.12

Jul 24, 2025

1.0.11

Jul 24, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

lmstrix-1.0.24.tar.gz (45.7 kB view details)

Uploaded Jul 25, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

lmstrix-1.0.24-py3-none-any.whl (33.6 kB view details)

Uploaded Jul 25, 2025 Python 3

File details

Details for the file lmstrix-1.0.24.tar.gz.

File metadata

Download URL: lmstrix-1.0.24.tar.gz
Upload date: Jul 25, 2025
Size: 45.7 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: python-httpx/0.28.1

File hashes

Hashes for lmstrix-1.0.24.tar.gz
Algorithm	Hash digest
SHA256	`0ee1bb6f93e65ab67a54dc53140cb847a8b064843fb8c7773334ead95ba64616`
MD5	`d7cf76ce48d9518920e8136d10949b5c`
BLAKE2b-256	`f1dde1349bf9bdc54a88ac11048e81be092349d806da783de1730c7259e19c3b`

See more details on using hashes here.

File details

Details for the file lmstrix-1.0.24-py3-none-any.whl.

File metadata

Download URL: lmstrix-1.0.24-py3-none-any.whl
Upload date: Jul 25, 2025
Size: 33.6 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: python-httpx/0.28.1

File hashes

Hashes for lmstrix-1.0.24-py3-none-any.whl
Algorithm	Hash digest
SHA256	`b9f4f9faef91a1e38082e7e6438b396b382e6cd33ea0acad5d8ea5bba01c8b68`
MD5	`299ed2fe7e45118ae093f67d2405eeb3`
BLAKE2b-256	`3ccc26469e06ef48cc37fbefd9c3e4506b036ef3bd7d8af652ba215f23219548`

See more details on using hashes here.

lmstrix 1.0.24

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

LMStrix: The Unofficial Toolkit for Mastering LM Studio

Why LMStrix? The Problem it Solves

How It Works: The Adaptive Context Optimizer

Key Features

Installation

Quick Start

Command-Line Interface (CLI)

Python API

Batch Processing Example

Architecture

How Context Testing Works

Development

License

Contributing

Requirements

Support

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes