Multi-server MCP client for LLM tool orchestration

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

Project description

🧠 Casual MCP

PyPI License: MIT

Casual MCP is a Python framework for building, evaluating, and serving LLMs with tool-calling capabilities using Model Context Protocol (MCP). It includes:

✅ A multi-server MCP client using FastMCP
✅ Provider support for OpenAI and Ollama (powered by casual-llm)
✅ A recursive tool-calling chat loop
✅ Usage statistics tracking (tokens, tool calls, LLM calls)
✅ System prompt templating with Jinja2
✅ A basic API exposing a chat endpoint

✨ Features

Plug-and-play multi-server tool orchestration
OpenAI and Ollama LLM providers (via casual-llm)
Usage statistics tracking (tokens, tool calls, LLM calls)
Prompt templating with Jinja2
Configurable via JSON
CLI and API access
Extensible architecture

🔧 Installation

Uv

uv add casual-mcp

Pip

pip install casual-mcp

Or for development:

git clone https://github.com/casualgenius/casual-mcp.git
cd casual-mcp
uv sync --group dev

🧩 System Prompt Templates

System prompts are defined as Jinja2 templates in the prompt-templates/ directory.

They are used in the config file to specify a system prompt to use per model.

This allows you to define custom prompts for each model — useful when using models that do not natively support tools. Templates are passed the tool list in the tools variable.

# prompt-templates/example_prompt.j2
Here is a list of functions in JSON format that you can invoke:
[
{% for tool in tools %}
  {
    "name": "{{ tool.name }}",
    "description": "{{ tool.description }}",
    "parameters": {
    {% for param_name, param in tool.inputSchema.items() %}
      "{{ param_name }}": {
        "description": "{{ param.description }}",
        "type": "{{ param.type }}"{% if param.default is defined %},
        "default": "{{ param.default }}"{% endif %}
      }{% if not loop.last %},{% endif %}
    {% endfor %}
    }
  }{% if not loop.last %},{% endif %}
{% endfor %}
]

⚙️ Configuration File (`casual_mcp_config.json`)

📄 See the Programmatic Usage section to build configs and messages with typed models.

The CLI and API can be configured using a casual_mcp_config.json file that defines:

🔧 Available models and their providers
🧰 Available MCP tool servers
🧩 Optional tool namespacing behavior

🔸 Example

{
  "models": {
    "gpt-4.1": {
      "provider": "openai",
      "model": "gpt-4.1"
    },
    "lm-qwen-3": {
      "provider": "openai",
      "endpoint": "http://localhost:1234/v1",
      "model": "qwen3-8b",
      "template": "lm-studio-native-tools"
    },
    "ollama-qwen": {
      "provider": "ollama",
      "endpoint": "http://localhost:11434",
      "model": "qwen2.5:7b-instruct"
    }
  },
  "servers": {
    "time": {
      "command": "python",
      "args": ["mcp-servers/time/server.py"]
    },
    "weather": {
      "url": "http://localhost:5050/mcp"
    }
  }
}

🔹 `models`

Each model has:

provider: "openai" or "ollama"
model: the model name (e.g., gpt-4.1, qwen2.5:7b-instruct)
endpoint: optional custom endpoint
- For OpenAI: custom OpenAI-compatible backends (e.g., LM Studio at http://localhost:1234/v1)
- For Ollama: defaults to http://localhost:11434 if not specified
template: optional Jinja2 template name for custom system prompt formatting (useful for models without native tool support)

🔹 `servers`

Servers can either be local (over stdio) or remote.

Local Config:

command: the command to run the server, e.g python, npm
args: the arguments to pass to the server as a list, e.g ["time/server.py"]
Optional: env: for subprocess environments, system_prompt to override server prompt

Remote Config:

url: the url of the mcp server
Optional: transport: the type of transport, http, sse, streamable-http. Defaults to http

Environmental Variables

OPENAI_API_KEY: required when using the openai provider (can be any string when using local OpenAI-compatible APIs)
TOOL_RESULT_FORMAT: adjusts the format of tool results returned to the LLM
- Options: result, function_result, function_args_result
- Default: result
MCP_TOOL_CACHE_TTL: tool cache TTL in seconds (default: 30, set to 0 for indefinite caching)
LOG_LEVEL: logging level (default: INFO)

You can set them using export or by creating a .env file.

🛠 CLI Reference

`casual-mcp serve`

Start the API server.

Options:

--host: Host to bind (default 0.0.0.0)
--port: Port to serve on (default 8000)

`casual-mcp servers`

Loads the config and outputs the list of MCP servers you have configured.

Example Output

$ casual-mcp servers
┏━━━━━━━━━┳━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━┓
┃ Name    ┃ Type   ┃ Command / Url                 ┃ Env ┃
┡━━━━━━━━━╇━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━┩
│ math    │ local  │ mcp-servers/math/server.py    │     │
│ time    │ local  │ mcp-servers/time-v2/server.py │     │
│ weather │ local  │ mcp-servers/weather/server.py │     │
│ words   │ remote │ https://localhost:3000/mcp    │     │
└─────────┴────────┴───────────────────────────────┴─────┘

`casual-mcp models`

Loads the config and outputs the list of models you have configured.

Example Output

$ casual-mcp models
┏━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━━━━┳━━━━━━━━━━━━━━━━━━━━━━━━┓
┃ Name              ┃ Provider ┃ Model                     ┃ Endpoint               ┃
┡━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━━━━╇━━━━━━━━━━━━━━━━━━━━━━━━┩
│ lm-phi-4-mini     │ openai   │ phi-4-mini-instruct       │ http://kovacs:1234/v1  │
│ lm-hermes-3       │ openai   │ hermes-3-llama-3.2-3b     │ http://kovacs:1234/v1  │
│ lm-groq           │ openai   │ llama-3-groq-8b-tool-use  │ http://kovacs:1234/v1  │
│ gpt-4o-mini       │ openai   │ gpt-4o-mini               │                        │
│ gpt-4.1-nano      │ openai   │ gpt-4.1-nano              │                        │
│ gpt-4.1-mini      │ openai   │ gpt-4.1-mini              │                        │
│ gpt-4.1           │ openai   │ gpt-4.1                   │                        │
└───────────────────┴──────────┴───────────────────────────┴────────────────────────┘

🧠 Programmatic Usage

You can import and use the core framework in your own Python code.

✅ Exposed Interfaces

`McpToolChat`

Orchestrates LLM interaction with tools using a recursive loop.

Accepts any provider that implements the LLMProvider protocol from casual-llm. This means you can use casual-llm's built-in providers (OpenAI, Ollama) or create your own custom provider.

from casual_llm import LLMProvider, SystemMessage, UserMessage
from casual_mcp import McpToolChat
from casual_mcp.tool_cache import ToolCache

# provider can be any object implementing the LLMProvider protocol
tool_cache = ToolCache(mcp_client)
chat = McpToolChat(mcp_client, provider, system_prompt, tool_cache=tool_cache)

# Generate method to take user prompt
response = await chat.generate("What time is it in London?")

# Generate method with session
response = await chat.generate("What time is it in London?", "my-session-id")

# Chat method that takes list of chat messages
# note: system prompt ignored if sent in messages so no need to set
chat = McpToolChat(mcp_client, provider, tool_cache=tool_cache)
messages = [
  SystemMessage(content="You are a cool dude who likes to help the user"),
  UserMessage(content="What time is it in London?")
]
response = await chat.chat(messages)

# Get usage statistics from the last call
stats = chat.get_stats()
if stats:
    print(f"Tokens used: {stats.tokens.total_tokens}")
    print(f"Tool calls: {stats.tool_calls.total}")
    print(f"LLM calls: {stats.llm_calls}")

Usage Statistics

After calling chat() or generate(), you can retrieve usage statistics via get_stats():

response = await chat.chat(messages)
stats = chat.get_stats()

# Token usage (accumulated across all LLM calls in the agentic loop)
stats.tokens.prompt_tokens      # Input tokens
stats.tokens.completion_tokens  # Output tokens
stats.tokens.total_tokens       # Total (computed)

# Tool call stats
stats.tool_calls.by_tool   # Dict of tool name -> call count, e.g. {"math_add": 2}
stats.tool_calls.by_server # Dict of server name -> call count, e.g. {"math": 2}
stats.tool_calls.total     # Total tool calls (computed)

# LLM call count
stats.llm_calls  # Number of LLM calls made (1 = no tools, 2+ = tool loop)

Stats are reset at the start of each new chat() or generate() call. Returns None if no calls have been made yet.

`ProviderFactory`

Instantiates LLM providers (from casual-llm) based on the selected model config.

from casual_mcp import ProviderFactory

provider_factory = ProviderFactory()
provider = provider_factory.get_provider("lm-qwen-3", model_config)

The factory returns an LLMProvider from casual-llm that can be used with McpToolChat.

ℹ️ Tool catalogues are cached to avoid repeated ListTools calls. The cache refreshes every 30 seconds by default. Override this with the MCP_TOOL_CACHE_TTL environment variable (set to 0 or a negative value to cache indefinitely).

`load_config`

Loads your casual_mcp_config.json into a validated config object.

from casual_mcp import load_config

config = load_config("casual_mcp_config.json")

`load_mcp_client`

Creats a multi server FastMCP client from the config object

from casual_mcp import load_mcp_client

config = load_mcp_client(config)

Model and Server Configs

Exported from casual_mcp.models:

StdioServerConfig
RemoteServerConfig
OpenAIModelConfig
OllamaModelConfig
ChatStats
TokenUsageStats
ToolCallStats

Use these types to build valid configs:

from casual_mcp.models import OpenAIModelConfig, OllamaModelConfig, StdioServerConfig

openai_model = OpenAIModelConfig(provider="openai", model="gpt-4.1")
ollama_model = OllamaModelConfig(provider="ollama", model="qwen2.5:7b-instruct", endpoint="http://localhost:11434")
server = StdioServerConfig(command="python", args=["time/server.py"])

Chat Messages

Exported from casual_llm (re-exported from casual_mcp.models for backwards compatibility):

AssistantMessage
SystemMessage
ToolResultMessage
UserMessage
ChatMessage

Use these types to build message chains:

from casual_llm import SystemMessage, UserMessage

messages = [
  SystemMessage(content="You are a friendly tool calling assistant."),
  UserMessage(content="What is the time?")
]

Example

from casual_llm import SystemMessage, UserMessage
from casual_mcp import McpToolChat, ProviderFactory, load_config, load_mcp_client

model = "gpt-4.1-nano"
messages = [
  SystemMessage(content="""You are a tool calling assistant.
You have access to up-to-date information through the tools.
Respond naturally and confidently, as if you already know all the facts."""),
  UserMessage(content="Will I need to take my umbrella to London today?")
]

# Load the Config from the File
config = load_config("casual_mcp_config.json")

# Setup the MCP Client
mcp_client = load_mcp_client(config)

# Get the Provider for the Model
provider_factory = ProviderFactory()
provider = provider_factory.get_provider(model, config.models[model])

# Perform the Chat and Tool calling
chat = McpToolChat(mcp_client, provider)
response_messages = await chat.chat(messages)

🏗️ Architecture Overview

Casual MCP orchestrates a flow between LLMs and MCP tool servers:

MCP Client connects to multiple tool servers (local via stdio or remote via HTTP/SSE)
Tool Cache fetches and caches available tools from all connected servers
Tool Conversion converts MCP tools to casual-llm's Tool format automatically
ProviderFactory creates LLM providers from casual-llm based on model config
McpToolChat orchestrates the recursive loop:
- Sends messages + tools to LLM provider
- LLM returns response (potentially with tool calls)
- Executes tool calls via MCP client
- Feeds results back to LLM
- Repeats until LLM provides final answer

┌─────────────┐      ┌──────────────┐      ┌─────────────┐
│ MCP Servers │─────▶│  Tool Cache  │─────▶│ Tool Converter│
└─────────────┘      └──────────────┘      └─────────────┘
                            │                      │
                            ▼                      ▼
                     ┌──────────────────────────────┐
                     │     McpToolChat Loop         │
                     │                              │
                     │  LLM ──▶ Tool Calls ──▶ MCP  │
                     │   ▲                      │   │
                     │   └──────── Results ─────┘   │
                     └──────────────────────────────┘

Tool Conversion

MCP tools are automatically converted from MCP's format to casual-llm's Tool format using the convert_tools module. This happens transparently in McpToolChat.chat() via tools_from_mcp().

📊 Response Structure

The chat() and generate() methods return a list of ChatMessage objects (from casual-llm):

response_messages = await chat.chat(messages)
# Returns: list[ChatMessage]
# Each message can be:
# - AssistantMessage: LLM's response (content + optional tool_calls)
# - ToolResultMessage: Result from tool execution

# Access the final response:
final_answer = response_messages[-1].content

# Check for tool calls in any message:
for msg in response_messages:
    if hasattr(msg, 'tool_calls') and msg.tool_calls:
        # Message contains tool calls
        for tool_call in msg.tool_calls:
            print(f"Called: {tool_call.function.name}")

💡 Common Patterns

Using Templates for Models Without Native Tool Support

Some models don't natively support tool calling. Use Jinja2 templates to format tools in the system prompt:

{
  "models": {
    "custom-model": {
      "provider": "ollama",
      "model": "some-model:7b",
      "template": "custom-tool-format"
    }
  }
}

Create prompt-templates/custom-tool-format.j2:

You are a helpful assistant with access to these tools:

{% for tool in tools %}
- {{ tool.name }}: {{ tool.description }}
  Parameters: {{ tool.inputSchema.properties | tojson }}
{% endfor %}

To use a tool, respond with JSON: {"tool": "tool_name", "args": {...}}

Formatting Tool Results

Control how tool results are presented to the LLM using TOOL_RESULT_FORMAT:

# Just the raw result
export TOOL_RESULT_FORMAT=result

# Function name → result
export TOOL_RESULT_FORMAT=function_result
# Example: "get_weather → Temperature: 72°F"

# Function with args → result
export TOOL_RESULT_FORMAT=function_args_result
# Example: "get_weather(location='London') → Temperature: 15°C"

Session Management

Important: Sessions are for testing/development only. In production, manage sessions in your own application.

Sessions are stored in-memory and cleared on server restart:

# Using sessions for development/testing
response = await chat.generate("What's the weather?", session_id="test-123")
response = await chat.generate("How about tomorrow?", session_id="test-123")

# For production: manage your own message history
messages = []
messages.append(UserMessage(content="What's the weather?"))
response_msgs = await chat.chat(messages)
messages.extend(response_msgs)

# Next turn
messages.append(UserMessage(content="How about tomorrow?"))
response_msgs = await chat.chat(messages)

🔧 Troubleshooting

Tool Not Found

If you see errors about tools not being found:

Check MCP servers are running: casual-mcp servers
List available tools: casual-mcp tools
Check tool cache TTL: Tools are cached for 30 seconds by default. Wait or restart if you just added a server.
Verify server config: Ensure command, args, or url are correct in your config

Provider Initialization Issues

OpenAI Provider:

# Ensure API key is set (even for local APIs)
export OPENAI_API_KEY=your-key-here

# For local OpenAI-compatible APIs (LM Studio, etc):
export OPENAI_API_KEY=dummy-key  # Can be any string

Ollama Provider:

# Check Ollama is running
curl http://localhost:11434/api/version

# Ensure model is pulled
ollama pull qwen2.5:7b-instruct

Cache Refresh Behavior

Tools are cached with a 30-second TTL by default. If you add/remove MCP servers:

Option 1: Wait 30 seconds for automatic refresh
Option 2: Restart the application
Option 3: Set MCP_TOOL_CACHE_TTL=0 for indefinite caching (refresh only on restart)
Option 4: Set a shorter TTL like MCP_TOOL_CACHE_TTL=5 for 5-second refresh

Common Configuration Errors

// ❌ Missing required fields
{
  "models": {
    "my-model": {
      "provider": "openai"
      // Missing "model" field!
    }
  }
}

// ✅ Correct
{
  "models": {
    "my-model": {
      "provider": "openai",
      "model": "gpt-4.1"
    }
  }
}

// ❌ Invalid provider
{
  "models": {
    "my-model": {
      "provider": "anthropic",  // Not supported!
      "model": "claude-3"
    }
  }
}

// ✅ Supported providers
{
  "models": {
    "openai-model": {
      "provider": "openai",
      "model": "gpt-4.1"
    },
    "ollama-model": {
      "provider": "ollama",
      "model": "qwen2.5:7b"
    }
  }
}

🚀 API Usage

Start the API Server

casual-mcp serve --host 0.0.0.0 --port 8000

Chat

Endpoint: `POST /chat`

Request Body:

model: the LLM model to use
messages: list of chat messages (system, assistant, user, etc) that you can pass to the api, allowing you to keep your own chat session in the client calling the api
include_stats: (optional, default: false) include usage statistics in the response

Example:

{
    "model": "gpt-4.1-nano",
    "messages": [
        {
            "role": "user",
            "content": "can you explain what the word consistent means?"
        }
    ],
    "include_stats": true
}

Response with stats:

{
    "messages": [...],
    "response": "Consistent means...",
    "stats": {
        "tokens": {
            "prompt_tokens": 150,
            "completion_tokens": 75,
            "total_tokens": 225
        },
        "tool_calls": {
            "by_tool": {"words_define": 1},
            "by_server": {"words": 1},
            "total": 1
        },
        "llm_calls": 2
    }
}

Generate

The generate endpoint allows you to send a user prompt as a string.

It also support sessions that keep a record of all messages in the session and feeds them back into the LLM for context. Sessions are stored in memory so are cleared when the server is restarted

Endpoint: `POST /generate`

Request Body:

model: the LLM model to use
prompt: the user prompt
session_id: an optional ID that stores all the messages from the session and provides them back to the LLM for context
include_stats: (optional, default: false) include usage statistics in the response

Example:

{
    "session_id": "my-session",
    "model": "gpt-4o-mini",
    "prompt": "can you explain what the word consistent means?",
    "include_stats": true
}

Get Session

Get all the messages from a session

Endpoint: `GET /generate/session/{session_id}`

License

This software is released under the MIT License

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

alexstansfield

Release history Release notifications | RSS feed

1.0.0

Feb 27, 2026

0.8.0

Feb 19, 2026

0.7.2

Feb 6, 2026

0.7.1 yanked

Feb 6, 2026

0.7.0

Feb 6, 2026

This version

0.6.0

Feb 3, 2026

0.5.0

Jan 28, 2026

0.4.0

Oct 9, 2025

0.3.1

May 30, 2025

0.3.0

May 28, 2025

0.2.2

May 27, 2025

0.1.0

May 23, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

casual_mcp-0.6.0.tar.gz (32.2 kB view details)

Uploaded Feb 3, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

casual_mcp-0.6.0-py3-none-any.whl (21.8 kB view details)

Uploaded Feb 3, 2026 Python 3

File details

Details for the file casual_mcp-0.6.0.tar.gz.

File metadata

Download URL: casual_mcp-0.6.0.tar.gz
Upload date: Feb 3, 2026
Size: 32.2 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for casual_mcp-0.6.0.tar.gz
Algorithm	Hash digest
SHA256	`aac1de9da037121eb2f8215c039cd003da2b7882512db0a00ca6020013d69d76`
MD5	`497ee29d063d768d94ddc2c91c0f0248`
BLAKE2b-256	`31ca738fca15ce895fbf10be64d3acd07a9ad1f5bdc62f964cc5deef649f360d`

See more details on using hashes here.

Provenance

The following attestation bundles were made for casual_mcp-0.6.0.tar.gz:

Publisher: release.yml on casualgenius/casual-mcp

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: casual_mcp-0.6.0.tar.gz
- Subject digest: aac1de9da037121eb2f8215c039cd003da2b7882512db0a00ca6020013d69d76
- Sigstore transparency entry: 907762396
- Sigstore integration time: Feb 3, 2026
Source repository:
- Permalink: casualgenius/casual-mcp@21bcb31a53ad851b2c19ede9dbb63f8cc48fe638
- Branch / Tag: refs/tags/v0.6.0
- Owner: https://github.com/casualgenius
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@21bcb31a53ad851b2c19ede9dbb63f8cc48fe638
- Trigger Event: push

File details

Details for the file casual_mcp-0.6.0-py3-none-any.whl.

File metadata

Download URL: casual_mcp-0.6.0-py3-none-any.whl
Upload date: Feb 3, 2026
Size: 21.8 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for casual_mcp-0.6.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`458ffd8e97d76893973a12b464e277ac655e592646e16dd1bb9cd5dece9ccd58`
MD5	`141de406209785ccb82d52aede8d6933`
BLAKE2b-256	`359d701cf72d5efde4cf50ef983ba7e63e2795163461b2fb9a678cfe1b6e24bf`

See more details on using hashes here.

Provenance

The following attestation bundles were made for casual_mcp-0.6.0-py3-none-any.whl:

Publisher: release.yml on casualgenius/casual-mcp

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: casual_mcp-0.6.0-py3-none-any.whl
- Subject digest: 458ffd8e97d76893973a12b464e277ac655e592646e16dd1bb9cd5dece9ccd58
- Sigstore transparency entry: 907762406
- Sigstore integration time: Feb 3, 2026
Source repository:
- Permalink: casualgenius/casual-mcp@21bcb31a53ad851b2c19ede9dbb63f8cc48fe638
- Branch / Tag: refs/tags/v0.6.0
- Owner: https://github.com/casualgenius
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@21bcb31a53ad851b2c19ede9dbb63f8cc48fe638
- Trigger Event: push

casual-mcp 0.6.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Project description

🧠 Casual MCP

✨ Features

🔧 Installation

Uv

Pip

🧩 System Prompt Templates

⚙️ Configuration File (casual_mcp_config.json)

🔸 Example

🔹 models

🔹 servers

Local Config:

Remote Config:

Environmental Variables

🛠 CLI Reference

casual-mcp serve

casual-mcp servers

Example Output

casual-mcp models

Example Output

🧠 Programmatic Usage

✅ Exposed Interfaces

McpToolChat

Usage Statistics

ProviderFactory

load_config

load_mcp_client

Model and Server Configs

Chat Messages

Example

🏗️ Architecture Overview

Tool Conversion

📊 Response Structure

💡 Common Patterns

Using Templates for Models Without Native Tool Support

Formatting Tool Results

Session Management

🔧 Troubleshooting

Tool Not Found

Provider Initialization Issues

Cache Refresh Behavior

Common Configuration Errors

🚀 API Usage

Start the API Server

Chat

Endpoint: POST /chat

Request Body:

Example:

Response with stats:

Generate

Endpoint: POST /generate

Request Body:

Example:

Get Session

Endpoint: GET /generate/session/{session_id}

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

⚙️ Configuration File (`casual_mcp_config.json`)

🔹 `models`

🔹 `servers`

`casual-mcp serve`

`casual-mcp servers`

`casual-mcp models`

`McpToolChat`

`ProviderFactory`

`load_config`

`load_mcp_client`

Endpoint: `POST /chat`

Endpoint: `POST /generate`

Endpoint: `GET /generate/session/{session_id}`