Serve MESA models locally

These details have not been verified by PyPI

Project description

MESA local

Serve MESA models locally.

⬇️ Downloads weights from S3
📦 Unpacks
🚀 Serves via a local OpenAI-compatible server

Prerequisites

Software

Python 3.12

Hardware

A GPU with >=24GB VRAM (tested on NVIDIA A30)

Configuration

Create a file called .env in the directory where you intend to run this package. Populate it with the details you have been provided with in the following format:

MODEL_NAME=
WEIGHTS_ID=
WEIGHTS_KEY=

(Alternative) S3 URI

Download weights directly from an S3 bucket:

MODEL_NAME=
WEIGHTS_URI=
WEIGHTS_REGION=  # optional, defaults to eu-west-2

(Optional) Caching

Download weights and cache to S3 for faster subsequent downloads:

MODEL_NAME=
WEIGHTS_ID=
WEIGHTS_KEY=
WEIGHTS_URI=
WEIGHTS_REGION=  # optional, defaults to eu-west-2

With this configuration:

First run: Downloads weights and uploads to S3 cache
Subsequent runs: Downloads directly from S3 cache (faster)

vLLM configuration

The package provides a set of vLLM configuration files for running a specific model on a specific GPU. In addition to MODEL_NAME, this can be specified by adding GPU to the .env.

Individual vLLM settings can also be overridden by adding them to the .env file:

Setting	Alias	Type	Default
`MODEL`	`MODEL_NAME`	`str`	`mesalocal`
`GPU`		`str`	`None`
`MAX_MODEL_LEN`	`MODEL_LENGTH`	`int`	`41152`
`ENFORCE_EAGER`		`bool`	`False`
`ENABLE_CHUNKED_PREFILL`		`bool`	`True`
`ENABLE_PREFIX_CACHING`		`bool`	`True`
`GPU_MEMORY_UTILIZATION`		`float`	`0.9`
`MAX_NUM_SEQS`		`int`	`256`
`MAX_NUM_BATCHED_TOKENS`		`int`	`None`
`ENABLE_LOG_REQUESTS`		`bool`	`False`
`UVICORN_LOG_LEVEL`		`str`	`warning`
`HTTP_TIMEOUT_KEEP_ALIVE`		`int`	`30`

Installation

(Recommended) Create a virtual environment and activate it:
```
python -m venv .venv
source .venv/bin/activate
```
Install this package: pip install londonaicentre-mesa-local.

Usage

CLI (primary)

Note command line arguments:

Argument Description

-v, --verbose Enable debug output (optional)
Start the server as follows: mesalocal [args].

Argument	Description
-v, --verbose	Enable debug output (optional)

Library (secondary)

Import and use the logic of this package as a library:

import asyncio
from mesalocal.weights import Weights
from mesalocal.inferrer import VLLM
vllm_config: VLLMConfig = VLLMConfig() # VLLMConfig(model_name="foo", gpu="bar") to use a vLLM config without a .env file
weights: Weights = Weights(vllm_config.model)
if weights.unpack():
    vllm: VLLM = VLLM(weights.get_model_folder(), vllm_config)
    async def run():
        async for output in vllm.generate(prompt):
            print(output.outputs[0].text)
    asyncio.run(run())

Clients

OpenAI (example with Oncollama)

Interact with the server using the OpenAI client in python:

from openai import OpenAI
from oncoschema.prompt_builder import PromptBuilder # pip install londonaicentre-oncoschema

client = OpenAI(
    base_url="http://localhost:5000/v1",
    api_key="blank" 
)

response = client.chat.completions.create(
    model="oncollama3betav01",
    messages=[
        {"role": "system", "content": PromptBuilder().build_main_prompt()},
        {"role": "user", "content": "Diagnosis 01/01/26..."}
    ]
)

print(response.choices[0].message.content)

License

This project uses a proprietary license (see LICENSE).

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

2.7.1

Apr 30, 2026

2.7.0

Apr 29, 2026

2.6.0

Apr 29, 2026

2.5.3

Apr 29, 2026

This version

2.5.2

Apr 29, 2026

2.5.1

Apr 29, 2026

2.5.0

Apr 24, 2026

2.4.1

Mar 31, 2026

2.4.0

Mar 28, 2026

2.3.3

Mar 27, 2026

2.3.2

Mar 27, 2026

2.3.1

Mar 25, 2026

2.3.0

Mar 24, 2026

2.2.0

Mar 18, 2026

2.1.0

Mar 11, 2026

2.0.0

Mar 9, 2026

1.4.5

Mar 5, 2026

1.4.4

Mar 5, 2026

1.4.3

Mar 5, 2026

1.4.2

Mar 1, 2026

1.4.1

Feb 27, 2026

1.4.0

Feb 27, 2026

1.3.1

Feb 16, 2026

1.3.0

Jan 22, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

londonaicentre_mesa_local-2.5.2.tar.gz (27.2 kB view details)

Uploaded Apr 29, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

londonaicentre_mesa_local-2.5.2-py3-none-any.whl (24.4 kB view details)

Uploaded Apr 29, 2026 Python 3

File details

Details for the file londonaicentre_mesa_local-2.5.2.tar.gz.

File metadata

Download URL: londonaicentre_mesa_local-2.5.2.tar.gz
Upload date: Apr 29, 2026
Size: 27.2 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: uv/0.11.8 {"installer":{"name":"uv","version":"0.11.8","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Amazon Linux","version":"2023","id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for londonaicentre_mesa_local-2.5.2.tar.gz
Algorithm	Hash digest
SHA256	`20f5ee8e24329444bd5756a3b35f3d3da0442d02608f41e5c4cc06c0a6e61ecd`
MD5	`e86834cdc5189b178faeb289105dfa76`
BLAKE2b-256	`83f5ad0bc68a210b2503b1f3a3a4ad95c2f7fc815972021f364559f9543d9aec`

See more details on using hashes here.

File details

Details for the file londonaicentre_mesa_local-2.5.2-py3-none-any.whl.

File metadata

Download URL: londonaicentre_mesa_local-2.5.2-py3-none-any.whl
Upload date: Apr 29, 2026
Size: 24.4 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: uv/0.11.8 {"installer":{"name":"uv","version":"0.11.8","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Amazon Linux","version":"2023","id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for londonaicentre_mesa_local-2.5.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`a9f40ac97fef9c6b302692744f916757c5460a11f7f7a92404e983fa91807cd8`
MD5	`b6e1934d3074661a8c826429c10ee6bc`
BLAKE2b-256	`58b25af3cdd240e23104568d34796bfe0c08f677a0627d7bed5b613e70acf110`

See more details on using hashes here.

londonaicentre-mesa-local 2.5.2

Navigation

Verified details

Owner

Unverified details

Meta

Project description

MESA local

Prerequisites

Software

Hardware

Configuration

(Alternative) S3 URI

(Optional) Caching

vLLM configuration

Installation

Usage

CLI (primary)

Library (secondary)

Clients

OpenAI (example with Oncollama)

License

Project details

Verified details

Owner

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes