flowllm

FlowLLM: Simplifying LLM-based HTTP/MCP Service Development

These details have not been verified by PyPI

Project links

Project description

FlowLLM Logo

FlowLLM: Simplifying LLM-based HTTP/MCP Service Development
_{If you find it useful, please give us a ⭐ Star. Your support drives our continuous improvement.}

English | 简体中文

📖 Introduction

FlowLLM encapsulates LLM, Embedding, and vector_store capabilities as HTTP/MCP services. It is suitable for AI assistants, RAG applications, and workflow services, and can be integrated into MCP-compatible client tools.

🏗️ Architecture Overview

FlowLLM Framework

🌟 Applications Based on FlowLLM

Project Name	Description
ReMe	Memory management toolkit for agents

📢 Recent Updates

Date	Update Content
2025-11-15	Added File Tool Op feature with 13 file operation tools, supporting file reading, writing, editing, searching, directory operations, system command execution, and task management
2025-11-14	Added Token counting capability, supporting accurate calculation of token counts for messages and tools via `self.token_count()` method, with support for multiple backends (base, openai, hf). See configuration examples in default.yaml

📚 Learning Resources

Project developers will share their latest learning materials here.

Date	Title	Description
2025-11-24	Mem-PAL: Memory-Augmented Personalized Assistant	Mem-PAL: Memory-Augmented Personalized Assistant with Log-based Structured Memory
2025-11-14	HaluMem Analysis	HaluMem: Evaluating Hallucinations in Memory Systems of Agents Analysis
2025-11-13	Gemini CLI Context Management Mechanism	Multi-layer Context Management Strategy for Gemini CLI
2025-11-10	Context Management Guide	Context Management Guide
2025-11-10	LangChain&Manus Video Materials	LangChain & Manus Context Management Video

⭐ Core Features

Simple Op Development: Inherit from BaseOp or BaseAsyncOp and implement your business logic. FlowLLM provides lazy-initialized LLM, Embedding models, and vector stores accessible via self.llm, self.embedding_model, and self.vector_store. It also offers prompt template management through prompt_format() and get_prompt() methods. Additionally, FlowLLM includes built-in token counting capabilities. Use self.token_count() to accurately calculate token counts for messages and tools, supporting multiple backends (base, openai, hf, etc.).
Flexible Flow Orchestration: Compose Ops into Flows via YAML configuration. >> denotes serial composition; | denotes parallel composition. For example, SearchOp() >> (AnalyzeOp() | TranslateOp()) >> FormatOp() builds complex workflows. Define input/output schemas and start the service with flowllm config=your_config.
Automatic Service Generation: FlowLLM automatically generates HTTP, MCP, and CMD services. The HTTP service provides RESTful APIs with synchronous JSON and HTTP Stream responses. The MCP service registers as Model Context Protocol tools for MCP-compatible clients. The CMD service executes a single Op in command-line mode for quick testing and debugging.

⚡ Quick Start

📦 Step0 Installation

📥 From PyPI

pip install flowllm

🔧 From Source

git clone https://github.com/flowllm-ai/flowllm.git
cd flowllm
pip install -e .

For detailed installation and configuration, refer to the Installation Guide.

⚙️ Configuration

Create a .env file and configure your API keys. Copy from example.env and modify:

cp example.env .env

Configure your API keys in the .env file:

FLOW_LLM_API_KEY=sk-xxxx
FLOW_LLM_BASE_URL=https://xxxx/v1
FLOW_EMBEDDING_API_KEY=sk-xxxx
FLOW_EMBEDDING_BASE_URL=https://xxxx/v1

For detailed configuration, refer to the Configuration Guide.

🛠️ Step1 Build Op

from flowllm.core.context import C
from flowllm.core.op import BaseAsyncOp
from flowllm.core.schema import Message
from flowllm.core.enumeration import Role

@C.register_op()
class SimpleChatOp(BaseAsyncOp):
    async def async_execute(self):
        query = self.context.get("query", "")
        messages = [Message(role=Role.USER, content=query)]

        # Use token_count method to calculate token count
        token_num = self.token_count(messages)
        print(f"Input tokens: {token_num}")

        response = await self.llm.achat(messages=messages)
        self.context.response.answer = response.content.strip()

For details, refer to the Simple Op Guide, LLM Op Guide, and Advanced Op Guide (including Embedding, VectorStore, and concurrent execution).

📝 Step2 Configure Config

The following example demonstrates building an MCP (Model Context Protocol) service. Create a configuration file my_mcp_config.yaml:

backend: mcp

mcp:
  transport: sse
  host: "0.0.0.0"
  port: 8001

flow:
  demo_mcp_flow:
    flow_content: MockSearchOp()
    description: "Search results for a given query."
    input_schema:
      query:
        type: string
        description: "User query"
        required: true

llm:
  default:
    backend: openai_compatible
    model_name: qwen3-30b-a3b-instruct-2507
    params:
      temperature: 0.6
    token_count: # Optional, configure token counting backend
      model_name: Qwen/Qwen3-30B-A3B-Instruct-2507
      backend: hf  # Supports base, openai, hf, etc.
      params:
        use_mirror: true

🚀 Step3 Start MCP Service

flowllm \
  config=my_mcp_config \
  backend=mcp \  # Optional, overrides config
  mcp.transport=sse \  # Optional, overrides config
  mcp.port=8001 \  # Optional, overrides config
  llm.default.model_name=qwen3-30b-a3b-thinking-2507  # Optional, overrides config

After the service starts, refer to the Client Guide to use the service and obtain the tool_call required by the model.

📚 Detailed Documentation

🚀 Getting Started

🔧 Op Development

🔀 Flow Orchestration

Flow Guide

🌐 Service Usage

🤝 Contributing

Contributions of all forms are welcome! For participation methods, refer to the Contribution Guide.

📄 License

This project is licensed under the Apache 2.0 license.

Star 历史

GitHub • Documentation • PyPI

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.2.0.10

Jan 7, 2026

0.2.0.9

Jan 6, 2026

0.2.0.8

Dec 12, 2025

0.2.0.7

Dec 9, 2025

0.2.0.6

Dec 9, 2025

0.2.0.5

Nov 27, 2025

0.2.0.4

Nov 26, 2025

0.2.0.3

Nov 22, 2025

0.2.0.2

Nov 17, 2025

0.2.0.1

Nov 13, 2025

0.2.0.0

Nov 10, 2025

0.1.11.6

Oct 28, 2025

0.1.11.5

Oct 27, 2025

0.1.11.4

Oct 27, 2025

0.1.11.3

Oct 23, 2025

0.1.11.2

Oct 22, 2025

0.1.11.1

Oct 21, 2025

0.1.11

Oct 21, 2025

0.1.10

Sep 16, 2025

0.1.9

Sep 16, 2025

0.1.8

Sep 8, 2025

0.1.7

Sep 8, 2025

0.1.6

Sep 7, 2025

0.1.5

Sep 7, 2025

0.1.3

Aug 31, 2025

0.1.2

Aug 27, 2025

0.1.1

Aug 20, 2025

0.1.0

Aug 11, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

flowllm-0.2.0.10.tar.gz (183.2 kB view details)

Uploaded Jan 7, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

flowllm-0.2.0.10-py3-none-any.whl (222.4 kB view details)

Uploaded Jan 7, 2026 Python 3

File details

Details for the file flowllm-0.2.0.10.tar.gz.

File metadata

Download URL: flowllm-0.2.0.10.tar.gz
Upload date: Jan 7, 2026
Size: 183.2 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for flowllm-0.2.0.10.tar.gz
Algorithm	Hash digest
SHA256	`5f9aeb3bbe9d52440490d91303a3970a66d880371b9a22265c8b9d719429ae64`
MD5	`fed97d0da10b8010132afd0e76721eda`
BLAKE2b-256	`a6cd4a8607f186905d1072ff5f9cc94f542fb8be03ad757ce44f220c02670a61`

See more details on using hashes here.

File details

Details for the file flowllm-0.2.0.10-py3-none-any.whl.

File metadata

Download URL: flowllm-0.2.0.10-py3-none-any.whl
Upload date: Jan 7, 2026
Size: 222.4 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for flowllm-0.2.0.10-py3-none-any.whl
Algorithm	Hash digest
SHA256	`ea095ea55a404087cb1294548c19c83f32bb5bb25c18d8ea3d7f1c8162d69b73`
MD5	`f92d307156a04b4363d1da722150c901`
BLAKE2b-256	`4e3361bd83bcb8230df20e12e5e3a1d099c984e2904646b08b5523dbfbe4220c`

See more details on using hashes here.

flowllm 0.2.0.10

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

📖 Introduction

🏗️ Architecture Overview

🌟 Applications Based on FlowLLM

📢 Recent Updates

📚 Learning Resources

⭐ Core Features

⚡ Quick Start

📦 Step0 Installation

📥 From PyPI

🔧 From Source

⚙️ Configuration

🛠️ Step1 Build Op

📝 Step2 Configure Config

🚀 Step3 Start MCP Service

📚 Detailed Documentation

🚀 Getting Started

🔧 Op Development

🔀 Flow Orchestration

🌐 Service Usage

🤝 Contributing

📄 License

Star 历史

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes