Wrap a call to any remote LLM model and expose it as an MCP server tool to allow your main model to communicate with other models.

Project description

LLM Wrapper MCP Server

"Allow any MCP-capable LLM agent to communicate with or delegate tasks to any other LLM available through the OpenRouter.ai API."

A Model Context Protocol (MCP) server wrapper designed to facilitate seamless interaction with various Large Language Models (LLMs) through a standardized interface. This project enables developers to integrate LLM capabilities into their applications by providing a robust and flexible server that handles LLM calls, tool execution, and result processing.

Features

Implements the Model Context Protocol (MCP) specification for standardized LLM interactions.
Provides a FastAPI-based server for handling LLM requests and responses.
Supports advanced features like tool calls and results through the MCP protocol.
Configurable to use various LLM providers (e.g., OpenRouter, local models).
Designed for extensibility, allowing easy integration of new LLM backends.
Integrates with llm-accounting for robust logging, rate limiting, and audit functionalities, enabling monitoring of remote LLM usage, inference costs, and inspection of queries/responses for debugging or legal purposes.

Dependencies

This project relies on the following key dependencies:

Core Dependencies:

fastapi: A modern, fast (high-performance) web framework for building APIs with Python 3.7+.
uvicorn: An ASGI server, used to run FastAPI applications.
pydantic: Data validation and settings management using Python type hints.
pydantic-settings: Pydantic's settings management for environment variables and configuration.
python-dotenv: Reads key-value pairs from a .env file and sets them as environment variables.
requests: An elegant and simple HTTP library for Python.
tiktoken: A fast BPE tokeniser for use with OpenAI's models.
llm_accounting: For robust logging, rate limiting, and audit functionalities, enabling monitoring of remote LLM usage, inference costs, and inspection of queries/responses for debugging or legal purposes.

Development Dependencies:

pytest: A mature full-featured Python testing framework.
black: An uncompromising Python code formatter.
isort: A Python utility / library to sort imports alphabetically, and automatically separate into sections and by type.
mypy: An optional static type checker for Python.
pytest-mock: A pytest plugin that provides a mocker fixture for easier mocking.

Installation

Create and activate a virtual environment:

python -m venv .venv
source .venv/bin/activate  # On Windows: .venv\Scripts\activate

Install the package:

pip install -e .

Configuration

Create a .env file in the project root with the following variables:

OPENROUTER_API_KEY=your_openrouter_api_key_here
# Optional: Override default model
OPENROUTER_MODEL=your_preferred_model

The server is configured to use OpenRouter by default with the following settings:

API Base URL: https://openrouter.ai/api/v1
Default Model: perplexity/llama-3.1-sonar-small-128k-online

Usage

Running the Server

To run the server, execute the following command:

python -m llm_wrapper_mcp_server

The server will start on http://localhost:8000 by default.

API Endpoints

POST /ask: Main endpoint for LLM requests
GET /health: Health check endpoint

Client Code Examples

The llm-wrapper-mcp-server package can be used by client applications to create their own MCP servers and interact with remote LLM models. Here's an example of how to set up a basic client:

from llm_wrapper_mcp_server.llm_mcp_server import LLMMCPWrapperServer
from llm_wrapper_mcp_server.llm_client import LLMClient

# Initialize the LLM MCP Wrapper Server
# This server will handle communication with the actual LLM provider
llm_server = LLMMCPWrapperServer()

# Initialize the LLM Client
# This client can be used by your application to send requests to the LLM server
llm_client = LLMClient(server_url="http://localhost:8000") # Assuming your server is running locally

async def main():
    # Example: Ask the LLM a question
    response = await llm_client.ask("What is the capital of France?")
    print(f"LLM Response: {response}")

    # Example: Use a tool (if supported by the LLM and configured)
    # This is a simplified example, actual tool usage depends on your MCP server's capabilities
    tool_response = await llm_client.use_tool("calculator", {"expression": "2+2"})
    print(f"Tool Response: {tool_response}")

if __name__ == "__main__":
    import asyncio
    asyncio.run(main())

CLI Mode Usage

The llm-wrapper-mcp-server can also be used directly from the command line for quick interactions or testing.

Basic Query:

python -m llm_wrapper_mcp_server --query "Tell me a short story about a robot."

Query with Model Specification:

python -m llm_wrapper_mcp_server --query "What is the square root of 144?" --model "perplexity/llama-3.1-sonar-small-128k-online"

Query with Tool Call (if configured):

python -m llm_wrapper_mcp_server --query "Calculate 15 * 3." --tool "calculator" --tool-args '{"expression": "15 * 3"}'

Development

For a detailed overview of the project's directory and file structure, refer to docs/STRUCTURE.md. This document is useful for understanding the codebase during development.

Install development dependencies:

pip install -e ".[dev]"

License

MIT License

Project details

Release history Release notifications | RSS feed

This version

0.1.3

May 29, 2025

0.1.2

May 29, 2025

0.1.1

May 29, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

llm_wrapper_mcp_server-0.1.3.tar.gz (18.7 kB view details)

Uploaded May 29, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

llm_wrapper_mcp_server-0.1.3-py3-none-any.whl (14.9 kB view details)

Uploaded May 29, 2025 Python 3

File details

Details for the file llm_wrapper_mcp_server-0.1.3.tar.gz.

File metadata

Download URL: llm_wrapper_mcp_server-0.1.3.tar.gz
Upload date: May 29, 2025
Size: 18.7 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.3

File hashes

Hashes for llm_wrapper_mcp_server-0.1.3.tar.gz
Algorithm	Hash digest
SHA256	`0b49c228775a23c9ae80441635fd63060f5a1e6280ad86ee8a54d781e72eaaad`
MD5	`3743c1d3c4944fcf745a96efa65e4b3d`
BLAKE2b-256	`160a9fa799c3dd0860583b07c4ca8ae19af187ba597c830312d674fba7961886`

See more details on using hashes here.

File details

Details for the file llm_wrapper_mcp_server-0.1.3-py3-none-any.whl.

File metadata

Download URL: llm_wrapper_mcp_server-0.1.3-py3-none-any.whl
Upload date: May 29, 2025
Size: 14.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.3

File hashes

Hashes for llm_wrapper_mcp_server-0.1.3-py3-none-any.whl
Algorithm	Hash digest
SHA256	`3fb68b3a1017eb39ded5cf090c1206d50641f720707ef18fe0a63727fae02f71`
MD5	`0785cdd12a2ba81de770c26d4297f11b`
BLAKE2b-256	`783d94ace907d1ff2f1b05dbdc7881be0cb828f977c13f5c964dfb5db471e0f9`

See more details on using hashes here.

llm-wrapper-mcp-server 0.1.3

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

LLM Wrapper MCP Server

Features

Dependencies

Core Dependencies:

Development Dependencies:

Installation

Configuration

Usage

Running the Server

API Endpoints

Client Code Examples

CLI Mode Usage

Development

License

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes