A model aggregator service for multiple LLM backends.

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

Wuodan

Project description

LLM Aggregator

LLM Aggregator keeps a live list of every model exposed by your local OpenAI-compatible servers.

Features

Polls models from configured LLM provider servers (/v1/models).
Enriches model information with a helper LLM.
Optionally hands model information from external websites to helper LLM.
Ships with a minimal UI showing providers, models, and host RAM.
The builtin UI can easily be replaced.

Web Interface

The builtin UI shows a single table plus a small RAM widget, so you immediately see what is running:

Model	Base URL	Types	Family	Context	Quant	Params	Summary
llama3.1:8b	`http://10.7.2.100:11434/v1`	llm	Llama 3.1	8K	Q4_K_M	8B	General chat tuned for balance
qwen2.5:14b	`http://10.7.2.100:8080/v1`	llm,embed	Qwen 2.5	32K	Q5_0	14B	Multilingual reasoning focused

Columns:

Model – identifier reported by the provider.
Base URL – where the model is served.
Types – capabilities (LLM, VLM, embedder, etc.).
Family – base architecture inferred by the helper LLM.
Context – approximate context window in tokens.
Quant – quantization hinted by the model name or docs.
Params – estimated parameter count.
Summary – one-line description generated by the helper LLM.

Installation

Prerequisites

Python 3.10 or higher
LLM servers (Ollama, llama.cpp, nexa, etc.) with OpenAI-compatible APIs

Install from PyPI

pip install llm-aggregator

Usage

Set the LLM_AGGREGATOR_CONFIG environment variable to point at your config.yaml and the service will load it on startup.

Starting the Service

export LLM_AGGREGATOR_CONFIG=/path/to/config.yaml
llm-aggregator

Or run directly:

export LLM_AGGREGATOR_CONFIG=/path/to/config.yaml
python -m llm_aggregator

By default, the web interface will be available at http://localhost:8888.

Configuration

All runtime behavior is controlled through the YAML file pointed to by the LLM_AGGREGATOR_CONFIG environment variable. Use config.yaml as a reference template.

UI modes

Use static_enabled and custom_static_path to set one of three modes:

static_enabled: true (default) serves the built-in UI.
static_enabled: true and custom_static_path: /path/to/dir serves your files instead of the built-in UI.
static_enabled: false serves no UI at all. Provide your own UI using the REST endpoints.

Configuration Options

host / port – Where the FastAPI server and static frontend bind.
log_level – Logging verbosity (DEBUG, INFO, WARNING, ERROR, CRITICAL). Defaults to INFO if omitted.
log_format – Optional logging format string. When omitted the service leaves existing logging configuration untouched.
logger_overrides – Map of logger names to override their logging level (e.g., httpx: WARNING).
brain – Settings for the enrichment LLM:
- base_url – HTTP endpoint of the enrichment provider.
- id – Model identifier passed to the provider.
- api_key – Optional API-Key.
- max_batch_size – Number of models to enrich at once (defaults to 1).
providers – Map of provider name to an OpenAI-compatible backend to query:
- base_url – Public URL returned via the REST API.
- internal_base_url – Optional internal URL used for server-to-server calls; defaults to base_url when omitted.
- api_key – Optional API-Key for that provider.
- files_size_gatherer – Optional block to report on-disk model size:
  - path – Script or executable invoked as <path> <base_path> <full_model_name>.
  - base_path – Filesystem root passed to the script.
  - timeout_seconds – Optional per-provider timeout (default: 15s).
model_info_sources – Optional external websites where model information is fetched from for enrichment. Each entry requires a human-readable name (shown to the LLM) and a url_template that contains {model_id}.
time – Background scheduling knobs (all in seconds):
- fetch_models_interval
- fetch_models_timeout
- enrich_models_timeout
- enrich_idle_sleep
- website_markdown_cache_ttl – TTL for cached markdown scraped from external sources.
ui – Optional static UI:
- static_enabled – true: static web frontend is served at /index.html and assets at /static.
- custom_static_path – Optional directory to replace the bundled UI; must contain a readable index.html and asset files.
brain_prompts – LLM instructions kept separate so the block can live at the end of the YAML:
- system – System message injected ahead of every enrichment request.
- user – Main user instruction describing the enrichment JSON contract.
- model_info_prefix_template – Optional prefix template applied to fetched markdown snippets; receives {model_id} and {provider_label} placeholders.

REST API

GET /v1/models – OpenAI ListModelsResponse plus a meta object on each data item with the enriched metadata. Example:

{
  "object": "list",
  "data": [
    {
      "id": "llama3.1:8b",
      "object": "model",
      "created": 1,
      "owned_by": "ollama",
      "meta": {
        "base_url": "http://127.0.0.1:11434/v1",
        "types": ["llm"],
        "model_family": "Llama 3.1",
        "context_size": "8K",
        "quant": "Q4_K_M",
        "param": "8B",
        "size": 481406976,
        "summary": "General chat tuned for balance"
      }
    }
  ]
}

GET /api/stats – Returns an array of recent RAM usage percentages sampled for the Chart.js widget in the UI
```
[57.5,57.6,57.6]
```
POST /api/clear – Empty request; clears model cache and restarts model information collection.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

Wuodan

Release history Release notifications | RSS feed

0.1.12

Nov 28, 2025

0.1.11

Nov 28, 2025

0.1.10

Nov 28, 2025

This version

0.1.9

Nov 28, 2025

0.1.8

Nov 28, 2025

0.1.7

Nov 28, 2025

0.1.6

Nov 27, 2025

0.1.5

Nov 20, 2025

0.1.4

Nov 20, 2025

0.1.3

Nov 19, 2025

0.0.3

Nov 17, 2025

0.0.2

Nov 17, 2025

0.0.1

Nov 17, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

llm_aggregator-0.1.9.tar.gz (44.9 kB view details)

Uploaded Nov 28, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

llm_aggregator-0.1.9-py3-none-any.whl (36.4 kB view details)

Uploaded Nov 28, 2025 Python 3

File details

Details for the file llm_aggregator-0.1.9.tar.gz.

File metadata

Download URL: llm_aggregator-0.1.9.tar.gz
Upload date: Nov 28, 2025
Size: 44.9 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for llm_aggregator-0.1.9.tar.gz
Algorithm	Hash digest
SHA256	`9bf3abd0b9f3e4306a6325c8f56435fc72e43de194b69448ae553d2bc7e767d6`
MD5	`de1de51b401f2e4a974375003db53d88`
BLAKE2b-256	`5efd2964ed44be92a999f6d03570f3a5122e97d1b6022f88866cc7d606f13446`

See more details on using hashes here.

Provenance

The following attestation bundles were made for llm_aggregator-0.1.9.tar.gz:

Publisher: ci.yml on Wuodan/llm-aggregator

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: llm_aggregator-0.1.9.tar.gz
- Subject digest: 9bf3abd0b9f3e4306a6325c8f56435fc72e43de194b69448ae553d2bc7e767d6
- Sigstore transparency entry: 730031505
- Sigstore integration time: Nov 28, 2025
Source repository:
- Permalink: Wuodan/llm-aggregator@6c6a84b5e4c2065fa31adf35a7bd593569670996
- Branch / Tag: refs/tags/0.1.9
- Owner: https://github.com/Wuodan
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: ci.yml@6c6a84b5e4c2065fa31adf35a7bd593569670996
- Trigger Event: push

File details

Details for the file llm_aggregator-0.1.9-py3-none-any.whl.

File metadata

Download URL: llm_aggregator-0.1.9-py3-none-any.whl
Upload date: Nov 28, 2025
Size: 36.4 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for llm_aggregator-0.1.9-py3-none-any.whl
Algorithm	Hash digest
SHA256	`bc3640aeb39bd4ca8df878f2ec9849d56098f5a18fed904739b7a9b6168d120f`
MD5	`30e3cf348f64e2b97e7ba3c961215471`
BLAKE2b-256	`b541578710d74c2f45096988700808dadba5ff1fee6ba954a66596df36ca7204`

See more details on using hashes here.

Provenance

The following attestation bundles were made for llm_aggregator-0.1.9-py3-none-any.whl:

Publisher: ci.yml on Wuodan/llm-aggregator

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: llm_aggregator-0.1.9-py3-none-any.whl
- Subject digest: bc3640aeb39bd4ca8df878f2ec9849d56098f5a18fed904739b7a9b6168d120f
- Sigstore transparency entry: 730031506
- Sigstore integration time: Nov 28, 2025
Source repository:
- Permalink: Wuodan/llm-aggregator@6c6a84b5e4c2065fa31adf35a7bd593569670996
- Branch / Tag: refs/tags/0.1.9
- Owner: https://github.com/Wuodan
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: ci.yml@6c6a84b5e4c2065fa31adf35a7bd593569670996
- Trigger Event: push

llm-aggregator 0.1.9

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Project description

LLM Aggregator

Features

Web Interface

Installation

Prerequisites

Install from PyPI

Usage

Starting the Service

Configuration

UI modes

Configuration Options

REST API

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance