Integrate RAG into AI Agents via MCP Server. Supports multiple Vector database technologies.

These details have not been verified by PyPI

Project description

Vector Database - A2A | AG-UI | MCP

PyPI - Version MCP Server PyPI - Downloads GitHub Repo stars GitHub forks GitHub contributors PyPI - License GitHub

GitHub last commit (by committer) GitHub pull requests GitHub closed pull requests GitHub issues

GitHub top language GitHub language count GitHub repo size GitHub repo file count (file type) PyPI - Wheel PyPI - Implementation

Version: 1.11.1

Overview

This is an MCP Server implementation which allows for a standardized collection management system across vector database technologies.

This was heavily inspired by the RAG implementation of Microsoft's Autogen V1 framework, however, this was changed to an MCP server model instead.

AI Agents can:

Hybrid search for document information (lexical/vector)
Create collections with documents stored on the local filesystem or URLs
Add documents to a collection
Utilize collection for retrieval augmented generation (RAG)
Delete collection

Supports:

ChromaDB
PGVector
Couchbase
Qdrant
MongoDB

This repository is actively maintained - Contributions and bug reports are welcome!

Automated tests are planned

MCP

MCP Tools

Function Name	Description	Tag(s)
`create_collection`	Creates a new collection or retrieves an existing one in the vector database.	`collection_management`
`semantic_search`	Retrieves and gathers related knowledge from the vector database instance using the question variable.	`semantic_search`
`add_documents`	Adds documents to an existing collection in the vector database. This can be used to extend collections with additional documents.	`collection_management`
`delete_collection`	Deletes a collection from the vector database.	`collection_management`
`list_collections`	Lists all collections in the vector database.	`collection_management`

A2A Agent

Architecture:

---
config:
  layout: dagre
---
flowchart TB
 subgraph subGraph0["Agent Capabilities"]
        C["Agent"]
        B["A2A Server - Uvicorn/FastAPI"]
        D["MCP Tools"]
        F["Agent Skills"]
  end
    C --> D & F
    A["User Query"] --> B
    B --> C
    D --> E["Platform API"]

     C:::agent
     B:::server
     A:::server
    classDef server fill:#f9f,stroke:#333
    classDef agent fill:#bbf,stroke:#333,stroke-width:2px
    style B stroke:#000000,fill:#FFD600
    style D stroke:#000000,fill:#BBDEFB
    style F fill:#BBDEFB
    style A fill:#C8E6C9
    style subGraph0 fill:#FFF9C4

Component Interaction Diagram

sequenceDiagram
    participant User
    participant Server as A2A Server
    participant Agent as Agent
    participant Skill as Agent Skills
    participant MCP as MCP Tools

    User->>Server: Send Query
    Server->>Agent: Invoke Agent
    Agent->>Skill: Analyze Skills Available
    Skill->>Agent: Provide Guidance on Next Steps
    Agent->>MCP: Invoke Tool
    MCP-->>Agent: Tool Response Returned
    Agent-->>Agent: Return Results Summarized
    Agent-->>Server: Final Response
    Server-->>User: Output

Usage

MCP CLI

Short Flag	Long Flag	Description
-h	--help	Display help information
-t	--transport	Transport method: 'stdio', 'http', or 'sse' [legacy] (default: stdio)
-s	--host	Host address for HTTP transport (default: 0.0.0.0)
-p	--port	Port number for HTTP transport (default: 8000)
	--auth-type	Authentication type: 'none', 'static', 'jwt', 'oauth-proxy', 'oidc-proxy', 'remote-oauth' (default: none)
	--token-jwks-uri	JWKS URI for JWT verification
	--token-issuer	Issuer for JWT verification
	--token-audience	Audience for JWT verification
	--oauth-upstream-auth-endpoint	Upstream authorization endpoint for OAuth Proxy
	--oauth-upstream-token-endpoint	Upstream token endpoint for OAuth Proxy
	--oauth-upstream-client-id	Upstream client ID for OAuth Proxy
	--oauth-upstream-client-secret	Upstream client secret for OAuth Proxy
	--oauth-base-url	Base URL for OAuth Proxy
	--oidc-config-url	OIDC configuration URL
	--oidc-client-id	OIDC client ID
	--oidc-client-secret	OIDC client secret
	--oidc-base-url	Base URL for OIDC Proxy
	--remote-auth-servers	Comma-separated list of authorization servers for Remote OAuth
	--remote-base-url	Base URL for Remote OAuth
	--allowed-client-redirect-uris	Comma-separated list of allowed client redirect URIs
	--eunomia-type	Eunomia authorization type: 'none', 'embedded', 'remote' (default: none)
	--eunomia-policy-file	Policy file for embedded Eunomia (default: mcp_policies.json)
	--eunomia-remote-url	URL for remote Eunomia server

Using as an MCP Server

The MCP Server can be run in two modes: stdio (for local testing) or http (for networked access). To start the server, use the following commands:

Run in stdio mode (default):

vector-mcp --transport "stdio"

Run in HTTP mode:

vector-mcp --transport "http"  --host "0.0.0.0"  --port "8000"

Creating Collection

AI Prompt:

Create a collection called zapdos with the documents in this directory:
/home/user/Documents/Chroma

AI Response:

The collection named "zapdos" has been successfully created or retrieved from the vector database
using the documents in the directory /home/user/Documents/Chroma.
Let me know if you'd like to perform any further actions, such as querying the collection or adding more documents!

Retrieving from Collection

AI Prompt:

Create a collection called zapdos with the documents in this directory:
/home/user/Documents/Chroma

AI Response:

The collection named "zapdos" has been successfully created or retrieved from the vector database
using the documents in the directory /home/user/Documents/Chroma.
Let me know if you'd like to perform any further actions, such as querying the collection or adding more documents!

Deleting Collection

AI Prompt:

Delete the collection called memory

AI Response:

The collection named "memory" has been successfully deleted.
Let me know if you'd like to create a new collection or perform any other actions!

A2A CLI

Endpoints

Web UI: http://localhost:8000/ (if enabled)
A2A: http://localhost:8000/a2a (Discovery: /a2a/.well-known/agent.json)
AG-UI: http://localhost:8000/ag-ui (POST)

Short Flag	Long Flag	Description
-h	--help	Display help information
	--host	Host to bind the server to (default: 0.0.0.0)
	--port	Port to bind the server to (default: 9000)
	--reload	Enable auto-reload
	--provider	LLM Provider: 'openai', 'anthropic', 'google', 'huggingface'
	--model-id	LLM Model ID (default: nvidia/nemotron-3-super)
	--base-url	LLM Base URL (for OpenAI compatible providers)
	--api-key	LLM API Key
	--mcp-url	MCP Server URL (default: http://localhost:8000/mcp)
	--web	Enable Pydantic AI Web UI

Deploy MCP Server as a Service

The MCP server can be deployed using Docker, with configurable authentication, middleware, and Eunomia authorization.

Using Docker Run

docker pull knucklessg1/vector-mcp:latest

docker run -d \
  --name vector-mcp \
  -p 8004:8004 \
  -e HOST=0.0.0.0 \
  -e PORT=8004 \
  -e TRANSPORT=http \
  -e AUTH_TYPE=none \
  -e EUNOMIA_TYPE=none \
  knucklessg1/vector-mcp:latest

For advanced authentication (e.g., JWT, OAuth Proxy, OIDC Proxy, Remote OAuth) or Eunomia, add the relevant environment variables:

docker run -d \
  --name vector-mcp \
  -p 8004:8004 \
  -e HOST=0.0.0.0 \
  -e PORT=8004 \
  -e TRANSPORT=http \
  -e AUTH_TYPE=oidc-proxy \
  -e OIDC_CONFIG_URL=https://provider.com/.well-known/openid-configuration \
  -e OIDC_CLIENT_ID=your-client-id \
  -e OIDC_CLIENT_SECRET=your-client-secret \
  -e OIDC_BASE_URL=https://your-server.com \
  -e ALLOWED_CLIENT_REDIRECT_URIS=http://localhost:*,https://*.example.com/* \
  -e EUNOMIA_TYPE=embedded \
  -e EUNOMIA_POLICY_FILE=/app/mcp_policies.json \
  knucklessg1/vector-mcp:latest

Using Docker Compose

Create a docker-compose.yml file:

services:
  vector-mcp:
    image: knucklessg1/vector-mcp:latest
    environment:
      - HOST=0.0.0.0
      - PORT=8004
      - TRANSPORT=http
      - AUTH_TYPE=none
      - EUNOMIA_TYPE=none
    ports:
      - 8004:8004

For advanced setups with authentication and Eunomia:

services:
  vector-mcp:
    image: knucklessg1/vector-mcp:latest
    environment:
      - HOST=0.0.0.0
      - PORT=8004
      - TRANSPORT=http
      - AUTH_TYPE=oidc-proxy
      - OIDC_CONFIG_URL=https://provider.com/.well-known/openid-configuration
      - OIDC_CLIENT_ID=your-client-id
      - OIDC_CLIENT_SECRET=your-client-secret
      - OIDC_BASE_URL=https://your-server.com
      - ALLOWED_CLIENT_REDIRECT_URIS=http://localhost:*,https://*.example.com/*
      - EUNOMIA_TYPE=embedded
      - EUNOMIA_POLICY_FILE=/app/mcp_policies.json
    ports:
      - 8004:8004
    volumes:
      - ./mcp_policies.json:/app/mcp_policies.json

Run the service:

docker-compose up -d

Configure `mcp.json` for AI Integration

{
  "mcpServers": {
    "vector_mcp": {
      "command": "uv",
      "args": [
        "run",
        "--with",
        "vector-mcp",
        "vector-mcp"
      ],
      "env": {
        "DATABASE_TYPE": "chromadb",                   // Optional
        "COLLECTION_NAME": "memory",                   // Optional
        "DOCUMENT_DIRECTORY": "/home/user/Documents/"  // Optional
      },
      "timeout": 300000
    }
  }
}

Security & Governance

This project is built on agent-utilities, inheriting enterprise-grade security and governance features.

Authentication & Authorization

Feature	Description
OIDC Token Delegation	RFC 8693 token exchange for user-context propagation from A2A → MCP
Eunomia Policies	Fine-grained, policy-driven tool authorization (`none`, `embedded`, `remote`)
Scoped Credentials	Tools execute with the caller's scoped identity where possible
3LO / OAuth / API Token	Multiple auth strategies with graceful fallback

Eunomia Policy Enforcement

Eunomia provides a policy enforcement point for all tool calls:

Embedded mode: Load local mcp_policies.json for role-based access, sensitivity gating, and audit logging
Remote mode: Forward authorization decisions to a central Eunomia policy server for multi-agent governance
Enable via CLI: --eunomia-type embedded --eunomia-policy-file mcp_policies.json

Runtime Protections

Protection	Description
Tool Guard	Sensitivity detection with human-in-the-loop approval gating
Prompt Injection Defense	Input scanning and repetition/loop guards
Content Filtering	Output schema enforcement and cost budget controls
Stuck Loop Detection	Automatic detection and recovery from agent loops
Context Limit Warnings	Proactive alerts before context window exhaustion

Graph Agent Architecture

The A2A agent uses pydantic-graph orchestration with:

RouterNode: Lightweight classifier that routes queries to specialized domains
DomainNode: Focused executor with only relevant tools loaded, preventing tool hallucination
Approval Gates: Policy-driven approval workflows before sensitive operations
Usage Guards: Budget and rate limiting enforcement

Production Recommendation: Enable --eunomia-type embedded (or remote) + OIDC delegation + containerized deployment. See agent-utilities documentation for full policy configuration.

Install Python Package

python -m pip install vector-mcp

PGVector dependencies

python -m pip install vector-mcp[postgres]

All

python -m pip install vector-mcp[all]

uv pip install --upgrade vector-mcp[all]

Repository Owners

GitHub followers GitHub User's stars

Special shoutouts to Microsoft Autogen V1 ♥️

MCP Configuration Examples

stdio (recommended for local development)

{
  "mcpServers": {
    "vector": {
      "command": ".venv/bin/vector-mcp",
      "args": [],
      "env": {}
    }
  }
}

Streamable HTTP (recommended for production)

{
  "mcpServers": {
    "vector": {
      "url": "http://localhost:8080/vector-mcp/mcp"
    }
  }
}

Available MCP Tools

This server implements an action-routed dynamic tool architecture, consolidating operations into categorized tools.

Tool Name	Action	Description
`vector_collection_management`	`create_collection`	Executes `create_collection` within the `collection_management` category.
`vector_collection_management`	`add_documents`	Executes `add_documents` within the `collection_management` category.
`vector_collection_management`	`delete_collection`	Executes `delete_collection` within the `collection_management` category.
`vector_collection_management`	`list_collections`	Executes `list_collections` within the `collection_management` category.
`vector_search`	`semantic_search`	Executes `semantic_search` within the `search` category.
`vector_search`	`lexical_search`	Executes `lexical_search` within the `search` category.
`vector_search`	`search`	Executes `search` within the `search` category.

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

2.0.1

Jul 2, 2026

1.41.0

Jun 17, 2026

1.39.0

Jun 6, 2026

1.37.0

Jun 4, 2026

1.34.0

Jun 1, 2026

1.32.4

Jun 1, 2026

1.32.0

May 31, 2026

1.28.0

May 29, 2026

1.24.0

May 29, 2026

1.14.0

May 22, 2026

This version

1.11.1

May 21, 2026

1.11.0

May 18, 2026

1.10.0

May 11, 2026

1.9.0

May 10, 2026

1.8.0

May 8, 2026

1.7.1

May 6, 2026

1.2.0

Apr 30, 2026

1.1.59

Apr 17, 2026

1.1.58

Apr 10, 2026

1.1.52

Mar 25, 2026

1.1.50

Mar 20, 2026

1.1.49

Mar 18, 2026

1.1.46

Mar 14, 2026

1.1.45

Mar 13, 2026

1.1.44

Mar 10, 2026

1.1.37

Mar 8, 2026

1.1.36

Mar 7, 2026

1.1.33

Mar 6, 2026

1.1.32

Mar 6, 2026

1.1.30

Mar 4, 2026

1.1.29

Mar 4, 2026

1.1.28

Mar 3, 2026

1.1.27

Mar 2, 2026

1.1.26

Feb 28, 2026

1.1.25

Feb 27, 2026

1.1.24

Feb 27, 2026

1.1.23

Feb 27, 2026

1.1.22

Feb 27, 2026

1.1.21

Feb 26, 2026

1.1.20

Feb 25, 2026

1.1.19

Feb 19, 2026

1.1.18

Feb 19, 2026

1.1.17

Feb 18, 2026

1.1.16

Feb 17, 2026

1.1.15

Feb 17, 2026

1.1.14

Feb 16, 2026

1.1.13

Feb 16, 2026

1.1.11

Feb 16, 2026

1.1.10

Feb 16, 2026

1.1.9

Feb 14, 2026

1.1.8

Feb 12, 2026

1.1.7

Feb 12, 2026

1.1.6

Feb 12, 2026

1.1.5

Feb 11, 2026

1.1.2

Feb 11, 2026

1.1.1

Feb 10, 2026

1.0.11

Feb 10, 2026

1.0.10

Feb 8, 2026

1.0.9

Feb 7, 2026

1.0.8

Feb 1, 2026

1.0.7

Jan 31, 2026

1.0.6

Jan 29, 2026

1.0.5

Jan 29, 2026

1.0.4

Jan 28, 2026

1.0.3

Jan 28, 2026

1.0.2

Jan 26, 2026

1.0.1

Jan 25, 2026

1.0.0

Jan 19, 2026

0.1.15

Jan 19, 2026

0.1.14

Oct 29, 2025

0.1.11

Oct 29, 2025

0.1.10

Oct 17, 2025

0.1.9

Oct 6, 2025

0.1.8

Oct 5, 2025

0.1.7

Oct 3, 2025

0.1.6

Oct 1, 2025

0.1.5

Sep 30, 2025

0.1.4

Sep 9, 2025

0.1.3

Sep 8, 2025

0.1.2

Sep 8, 2025

0.1.1

Sep 5, 2025

0.1.0

Sep 4, 2025

0.0.5

Sep 4, 2025

0.0.4

Sep 4, 2025

0.0.3

Sep 4, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vector_mcp-1.11.1.tar.gz (57.3 kB view details)

Uploaded May 21, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

vector_mcp-1.11.1-py3-none-any.whl (75.4 kB view details)

Uploaded May 21, 2026 Python 3

File details

Details for the file vector_mcp-1.11.1.tar.gz.

File metadata

Download URL: vector_mcp-1.11.1.tar.gz
Upload date: May 21, 2026
Size: 57.3 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.3

File hashes

Hashes for vector_mcp-1.11.1.tar.gz
Algorithm	Hash digest
SHA256	`3e3ce0b72ab9dc46ab68b7a1eec2184e500cd3eb09a47aa4723fe7aa20084ad6`
MD5	`b5614e1c3d74f9795168dea8e182aacf`
BLAKE2b-256	`b18377e8b5f62c2c9f4d2a5d00bcbaf9b19b61d0b18e9f8efe938f12747c9d93`

See more details on using hashes here.

File details

Details for the file vector_mcp-1.11.1-py3-none-any.whl.

File metadata

Download URL: vector_mcp-1.11.1-py3-none-any.whl
Upload date: May 21, 2026
Size: 75.4 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.3

File hashes

Hashes for vector_mcp-1.11.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`16062ed6b395186b82bc4082456e4b1935eeb2436f9724d9411b5b4a7a23e20f`
MD5	`0bfa0d8fd2d6a3fb56db51b4b62a8639`
BLAKE2b-256	`6de83fff25ba798cf902eef96845c4085894018e254783a99c85f060b11a6af1`

See more details on using hashes here.

vector-mcp 1.11.1

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers

Project description

Vector Database - A2A | AG-UI | MCP

Overview

MCP

MCP Tools

A2A Agent

Architecture:

Component Interaction Diagram

Usage

MCP CLI

Using as an MCP Server

Run in stdio mode (default):

Run in HTTP mode:

Creating Collection

Retrieving from Collection

Deleting Collection

A2A CLI

Endpoints

Deploy MCP Server as a Service

Using Docker Run

Using Docker Compose

Configure mcp.json for AI Integration

Security & Governance

Authentication & Authorization

Eunomia Policy Enforcement

Runtime Protections

Graph Agent Architecture

Install Python Package

Repository Owners

MCP Configuration Examples

stdio (recommended for local development)

Streamable HTTP (recommended for production)

Available MCP Tools

Project details

Verified details

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

Configure `mcp.json` for AI Integration