A toolkit for managing and testing LM Studio models with automatic context limit discovery

These details have not been verified by PyPI

Project links

Project description

LMStrix

LMStrix is a professional Python toolkit designed to supercharge your interaction with LM Studio. It provides a powerful command-line interface (CLI) and Python API for managing, testing, and running local language models, with a standout feature: Adaptive Context Optimization.

Key Features

🔍 Automatic Context Discovery: Binary search algorithm to find the true operational context limit of any model
📊 Beautiful Verbose Logging: Enhanced stats display with emojis showing inference metrics, timing, and token usage
🚀 Smart Model Management: Models persist between calls to reduce loading overhead
🎯 Flexible Inference Engine: Run inference with powerful prompt templating and percentage-based output control
📋 Comprehensive Model Registry: Track models, their context limits, and test results with JSON persistence
🛡️ Safety Controls: Configurable thresholds and fail-safes to prevent system crashes
💻 Rich CLI Interface: Beautiful terminal output with progress indicators and formatted tables
📈 Compact Test Output: Live-updating tables show test progress without verbose logging clutter

Installation

# Using pip
pip install lmstrix

# Using uv (recommended)
uv pip install lmstrix

Quick Start

Command-Line Interface

# Scan for available models in LM Studio
lmstrix scan

# List all models with their context limits and test status
lmstrix list

# Test context limit for a specific model
lmstrix test llama-3.2-3b-instruct

# Test all untested models with safety threshold
lmstrix test --all --threshold 102400

# Run inference with enhanced verbose logging
lmstrix infer "What is the capital of Poland?" -m llama-3.2-3b-instruct --verbose

# Run inference with percentage-based output tokens
lmstrix infer "Explain quantum computing" -m llama-3.2-3b-instruct --out_ctx "25%"

# Use file-based prompts with templates
lmstrix infer summary -m llama-3.2-3b-instruct --file_prompt adam.toml --text_file document.txt

# Direct text input for prompts
lmstrix infer "Summarize: {{text}}" -m llama-3.2-3b-instruct --text "Your content here"

Enhanced Verbose Output

When using --verbose, LMStrix provides comprehensive statistics:

════════════════════════════════════════════════════════════
🤖 MODEL: llama-3.2-3b-instruct
🔧 CONFIG: maxTokens=26214, temperature=0.7
📝 PROMPT (1 lines, 18 chars): Capital of Poland?
════════════════════════════════════════════════════════════
⠸ Running inference...
════════════════════════════════════════════════════════════
📊 INFERENCE STATS
⚡ Time to first token: 0.82s
⏱️  Total inference time: 11.66s
🔢 Predicted tokens: 338
📝 Prompt tokens: 5
🎯 Total tokens: 343
🚀 Tokens/second: 32.04
🛑 Stop reason: eosFound
════════════════════════════════════════════════════════════

Python API

from lmstrix.loaders.model_loader import load_model_registry
from lmstrix.core.inference_manager import InferenceManager

# Load model registry
registry = load_model_registry()

# List available models
models = registry.list_models()
print(f"Available models: {len(models)}")

# Run inference
manager = InferenceManager(verbose=True)
result = manager.infer(
    model_id="llama-3.2-3b-instruct",
    prompt="What is the meaning of life?",
    out_ctx=100,
    temperature=0.7
)

if result["succeeded"]:
    print(f"Response: {result['response']}")
    print(f"Tokens used: {result['tokens_used']}")
    print(f"Time: {result['inference_time']:.2f}s")

Context Testing & Optimization

LMStrix uses a sophisticated binary search algorithm to discover true model context limits:

Safety Features

Threshold Protection: Configurable maximum context size to prevent system crashes
Progressive Testing: Starts with small contexts and increases safely
Persistent Results: Saves test results to avoid re-testing

Testing Commands

# Test specific model
lmstrix test llama-3.2-3b-instruct

# Test all models with custom threshold
lmstrix test --all --threshold 65536

# Test at specific context size
lmstrix test --all --ctx 32768

# Reset and re-test a model
lmstrix test llama-3.2-3b-instruct --reset

# Test with custom prompt
lmstrix test llama-3.2-3b-instruct --prompt "What is 2+2?"

# Test with file-based prompt
lmstrix test llama-3.2-3b-instruct --file_prompt test_prompt.txt

Compact Output (Default)

When running without --verbose, tests display a clean, live-updating table:

Model                                   Context      Status
llama-3.2-3b-instruct                   32,768      Testing...
→
Model                                   Context      Status  
llama-3.2-3b-instruct                   32,768      ✓ Success

For batch testing with --all, a progress column is added to track multiple models.

Model Management

Registry Commands

# Scan for new models
lmstrix scan --verbose

# List models with different sorting
lmstrix list --sort size        # Sort by size
lmstrix list --sort ctx         # Sort by tested context
lmstrix list --show json        # Export as JSON

# Check system health
lmstrix health --verbose

Model Persistence

Models stay loaded between inference calls for improved performance:

When no explicit context is specified, models remain loaded
Last-used model is remembered for subsequent calls
Explicit context changes trigger model reloading

Prompt Templating

LMStrix supports flexible prompt templating with TOML files:

# adam.toml
[aps]
prompt = """
You are an AI assistant skilled in Abstractive Proposition Segmentation.
Convert the following text: {{text}}
"""

[summary] 
prompt = "Create a comprehensive summary: {{text}}"

Use with CLI:

lmstrix infer aps --file_prompt adam.toml --text "Your text here"
lmstrix infer summary --file_prompt adam.toml --text_file document.txt

Development

# Clone repository
git clone https://github.com/twardoch/lmstrix.git
cd lmstrix

# Install for development
pip install -e ".[dev]"

# Run tests
pytest

# Run linting
hatch run lint:all

Project Structure

src/lmstrix/
├── cli/main.py              # CLI interface
├── core/
│   ├── inference_manager.py # Unified inference engine
│   ├── models.py            # Model registry
│   └── context_tester.py    # Context limit testing
├── api/client.py            # LM Studio API client
├── loaders/                 # Data loading utilities
└── utils/                   # Helper utilities

Features in Detail

Adaptive Context Optimizer

Binary search algorithm for efficient context limit discovery
Safety thresholds to prevent system crashes
Automatic persistence of test results
Resume capability for interrupted tests

Enhanced Logging

Beautiful emoji-rich output in verbose mode
Comprehensive inference statistics
Progress indicators for long operations
Clear error messages with context

Smart Model Management

Automatic model discovery from LM Studio
Persistent registry with JSON storage
Model state tracking (loaded/unloaded)
Batch operations for multiple models

Requirements

Python 3.11+
LM Studio installed and configured
Models downloaded in LM Studio

License

MIT License - see LICENSE file for details.

Contributing

Contributions welcome! Please read our contributing guidelines and submit pull requests for any improvements.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

1.0.82

Apr 11, 2026

1.0.78

Mar 6, 2026

1.0.76

Mar 4, 2026

1.0.75

Mar 4, 2026

1.0.74

Mar 4, 2026

This version

1.0.70

Aug 6, 2025

1.0.69

Aug 6, 2025

1.0.68

Aug 6, 2025

1.0.67

Aug 4, 2025

1.0.66

Aug 4, 2025

1.0.65

Aug 4, 2025

1.0.64

Aug 4, 2025

1.0.63

Aug 4, 2025

1.0.62

Aug 3, 2025

1.0.61

Jul 31, 2025

1.0.60

Jul 31, 2025

1.0.59

Jul 31, 2025

1.0.58

Jul 31, 2025

1.0.57

Jul 30, 2025

1.0.56

Jul 30, 2025

1.0.55

Jul 29, 2025

1.0.54

Jul 29, 2025

1.0.53

Jul 29, 2025

1.0.52

Jul 29, 2025

1.0.51

Jul 27, 2025

1.0.50

Jul 27, 2025

1.0.49

Jul 27, 2025

1.0.48

Jul 27, 2025

1.0.47

Jul 26, 2025

1.0.46

Jul 26, 2025

1.0.45

Jul 26, 2025

1.0.44

Jul 26, 2025

1.0.43

Jul 26, 2025

1.0.42

Jul 26, 2025

1.0.41

Jul 25, 2025

1.0.39

Jul 25, 2025

1.0.38

Jul 25, 2025

1.0.37

Jul 25, 2025

1.0.36

Jul 25, 2025

1.0.35

Jul 25, 2025

1.0.34

Jul 25, 2025

1.0.33

Jul 25, 2025

1.0.32

Jul 25, 2025

1.0.31

Jul 25, 2025

1.0.29

Jul 25, 2025

1.0.28

Jul 25, 2025

1.0.27

Jul 25, 2025

1.0.26

Jul 25, 2025

1.0.25

Jul 25, 2025

1.0.24

Jul 25, 2025

1.0.23

Jul 25, 2025

1.0.22

Jul 25, 2025

1.0.21

Jul 25, 2025

1.0.20

Jul 25, 2025

1.0.19

Jul 25, 2025

1.0.18

Jul 24, 2025

1.0.17

Jul 24, 2025

1.0.16

Jul 24, 2025

1.0.15

Jul 24, 2025

1.0.14

Jul 24, 2025

1.0.13

Jul 24, 2025

1.0.12

Jul 24, 2025

1.0.11

Jul 24, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

lmstrix-1.0.70.tar.gz (72.5 kB view details)

Uploaded Aug 6, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

lmstrix-1.0.70-py3-none-any.whl (65.4 kB view details)

Uploaded Aug 6, 2025 Python 3

File details

Details for the file lmstrix-1.0.70.tar.gz.

File metadata

Download URL: lmstrix-1.0.70.tar.gz
Upload date: Aug 6, 2025
Size: 72.5 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: python-httpx/0.28.1

File hashes

Hashes for lmstrix-1.0.70.tar.gz
Algorithm	Hash digest
SHA256	`9275325ab5a371ee2b949133f4dd222d17dc76fc0bc712af2db2a2025e484e75`
MD5	`52bf1824e3159043d775e9d441927deb`
BLAKE2b-256	`c346716b8cb2a8c0a1975a6ab61176c0165abc7fd7860a526b70f963f4198abe`

See more details on using hashes here.

File details

Details for the file lmstrix-1.0.70-py3-none-any.whl.

File metadata

Download URL: lmstrix-1.0.70-py3-none-any.whl
Upload date: Aug 6, 2025
Size: 65.4 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: python-httpx/0.28.1

File hashes

Hashes for lmstrix-1.0.70-py3-none-any.whl
Algorithm	Hash digest
SHA256	`51e71cf882663f2571d69a5b208cd011f13937ec7451190dba36db02e01fab48`
MD5	`0d19e166b7651c844c286aba43d1636b`
BLAKE2b-256	`911e12a2a366d73105b897ea7039f16b2b62417ceab1d9e61e308b633b011563`

See more details on using hashes here.

lmstrix 1.0.70

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

LMStrix

Key Features

Installation

Quick Start

Command-Line Interface

Enhanced Verbose Output

Python API

Context Testing & Optimization

Safety Features

Testing Commands

Compact Output (Default)

Model Management

Registry Commands

Model Persistence

Prompt Templating

Development

Project Structure

Features in Detail

Adaptive Context Optimizer

Enhanced Logging

Smart Model Management

Requirements

License

Contributing

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes