Open-source MCP Server for web search, extract, crawl, academic research, and library docs with embedded SearXNG

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

n24q02m

These details have not been verified by PyPI

Project description

WET - Web Extended Toolkit MCP Server

mcp-name: io.github.n24q02m/wet-mcp

Open-source MCP Server for web search, content extraction, library docs & multimodal analysis.

Features

Web Search -- Embedded SearXNG metasearch (Google, Bing, DuckDuckGo, Brave) with filters, semantic reranking, query expansion, and snippet enrichment
Academic Research -- Search Google Scholar, Semantic Scholar, arXiv, PubMed, CrossRef, BASE
Library Docs -- Auto-discover and index documentation with FTS5 hybrid search, HyDE-enhanced retrieval, and version-specific docs
Content Extract -- Clean content extraction (Markdown/Text), structured data extraction (LLM + JSON Schema), batch processing (up to 50 URLs), deep crawling, site mapping
Local File Conversion -- Convert PDF, DOCX, XLSX, CSV, HTML, EPUB, PPTX to Markdown
Media -- List, download, and analyze images, videos, audio files
Anti-bot -- Stealth mode bypasses Cloudflare, Medium, LinkedIn, Twitter
Zero Config -- Built-in local Qwen3 embedding + reranking, no API keys needed. Optional cloud providers (Jina AI, Gemini, OpenAI, Cohere)
Sync -- Cross-machine sync of indexed docs via Google Drive (OAuth Device Code, no browser redirect)

Quick Start

Claude Code Plugin (Recommended)

Via marketplace (includes skills: /fact-check, /compare):

/plugin marketplace add n24q02m/claude-plugins
/plugin install wet-mcp@n24q02m-plugins

Configure env vars in ~/.claude/settings.local.json or shell profile. See Environment Variables.

Gemini CLI Extension

gemini extensions install https://github.com/n24q02m/wet-mcp

Codex CLI

Add to ~/.codex/config.toml:

[mcp_servers.wet]
command = "uvx"
args = ["--python", "3.13", "wet-mcp"]

MCP Server

Python 3.13 required -- Python 3.14+ is not supported due to SearXNG incompatibility. You must specify --python 3.13 when using uvx.

On first run, the server automatically installs SearXNG, Playwright chromium, and starts the embedded search engine.

Option 1: uvx

{
  "mcpServers": {
    "wet": {
      "command": "uvx",
      "args": ["--python", "3.13", "wet-mcp@latest"]
    }
  }
}

Option 2: Docker

{
  "mcpServers": {
    "wet": {
      "command": "docker",
      "args": [
        "run", "-i", "--rm",
        "--name", "mcp-wet",
        "-v", "wet-data:/data",
        "-e", "API_KEYS",
        "-e", "GITHUB_TOKEN",
        "-e", "SYNC_ENABLED",
        "n24q02m/wet-mcp:latest"
      ]
    }
  }
}

Configure env vars in ~/.claude/settings.local.json or your shell profile. See Environment Variables below.

Tools

Tool	Actions	Description
`search`	`search`, `research`, `docs`, `similar`	Web search (with filters, reranking, expand/enrich), academic research, library docs (HyDE), find similar
`extract`	`extract`, `batch`, `crawl`, `map`, `convert`, `extract_structured`	Content extraction, batch processing (up to 50 URLs), deep crawling, site mapping, local file conversion, structured data extraction (JSON Schema)
`media`	`list`, `download`, `analyze`	Media discovery, download, and analysis
`config`	`status`, `set`, `cache_clear`, `docs_reindex`	Server configuration and cache management
`setup`	`warmup`, `setup_sync`	Pre-download models, configure cloud sync
`help`	--	Full documentation for any tool

MCP Prompts

Prompt	Parameters	Description
`research_topic`	`topic`	Research a topic using academic search
`library_docs`	`library`, `question`	Find library documentation

Zero-Config Setup

No environment variables needed. On first start, the server opens a setup page in your browser:

Start the server (via plugin, uvx, or Docker)
A setup URL appears -- open it in any browser
Fill in your credentials on the guided form
Credentials are encrypted and stored locally

Your credentials never leave your machine. The relay server only sees encrypted data.

For CI/automation, you can still use environment variables (see below).

Configuration

Pre-install (optional)

Use the setup MCP tool to warmup models and install dependencies:

# Via MCP tool call (recommended):
setup(action="warmup")

# With cloud embedding configured, warmup validates API keys
# and skips local model download if cloud models are available.

The warmup action pre-downloads SearXNG, Playwright, and embedding/reranker models (~1.1GB total) so the first real connection does not timeout.

Sync setup

Sync uses Google Drive with OAuth Device Code flow (no browser redirect needed):

Configure: Set SYNC_ENABLED=true, GOOGLE_DRIVE_CLIENT_ID, and GOOGLE_DRIVE_CLIENT_SECRET
First sync: Run setup(action="setup_sync") -- visit URL and enter code
Token saved: OAuth token is stored locally at ~/.wet-mcp/tokens/ (600 permissions)
Subsequent runs: Token is loaded automatically, auto-refreshed when expired

{
  "SYNC_ENABLED": "true",
  "GOOGLE_DRIVE_CLIENT_ID": "your-client-id.apps.googleusercontent.com",
  "GOOGLE_DRIVE_CLIENT_SECRET": "your-client-secret"
}

Environment Variables

Variable	Required	Default	Description
`API_KEYS`	No	--	API keys for cloud providers (format: `ENV_VAR:key,...`). Enables cloud embedding + reranking
`COHERE_API_KEY`	No	--	Cohere API key (embedding + reranking)
`JINA_AI_API_KEY`	No	--	Jina AI API key (embedding + reranking)
`GEMINI_API_KEY`	No	--	Google Gemini API key (LLM + embedding)
`OPENAI_API_KEY`	No	--	OpenAI API key (LLM + embedding)
`GITHUB_TOKEN`	No	auto-detect	GitHub token for docs discovery (60 -> 5000 req/hr). Auto-detected from `gh auth token`
`EMBEDDING_BACKEND`	No	auto-detect	`cloud` or `local` (Qwen3). Auto: API_KEYS -> cloud, else local
`EMBEDDING_MODEL`	No	auto-detect	Cloud embedding model name
`EMBEDDING_DIMS`	No	`0` (auto=768)	Embedding dimensions
`RERANK_ENABLED`	No	`true`	Enable reranking after search
`RERANK_BACKEND`	No	auto-detect	`cloud` or `local`. Auto: Cohere/Jina key -> cloud, else local
`RERANK_MODEL`	No	auto-detect	Cloud rerank model name
`RERANK_TOP_N`	No	`10`	Return top N results after reranking
`LLM_MODELS`	No	`gemini-3-flash-preview`	LLM model for media analysis (google-genai or openai)
`WET_AUTO_SEARXNG`	No	`true`	Auto-start embedded SearXNG subprocess
`WET_SEARXNG_PORT`	No	`41592`	SearXNG port
`SEARXNG_URL`	No	`http://localhost:41592`	External SearXNG URL (when auto disabled)
`SEARXNG_TIMEOUT`	No	`30`	SearXNG request timeout in seconds
`CONVERT_MAX_FILE_SIZE`	No	`104857600`	Max file size for local conversion in bytes (100MB)
`CONVERT_ALLOWED_DIRS`	No	--	Comma-separated paths to restrict local file conversion
`CACHE_DIR`	No	`~/.wet-mcp`	Data directory for cache, docs, downloads
`DOCS_DB_PATH`	No	`~/.wet-mcp/docs.db`	Docs database location
`DOWNLOAD_DIR`	No	`~/.wet-mcp/downloads`	Media download directory
`TOOL_TIMEOUT`	No	`120`	Tool execution timeout in seconds (0=no timeout)
`WET_CACHE`	No	`true`	Enable/disable web cache
`SYNC_ENABLED`	No	`false`	Enable Google Drive sync
`GOOGLE_DRIVE_CLIENT_ID`	No	--	OAuth client ID (required for sync)
`GOOGLE_DRIVE_CLIENT_SECRET`	No	--	OAuth client secret (required for sync)
`SYNC_FOLDER`	No	`wet-mcp`	Google Drive folder name
`SYNC_INTERVAL`	No	`300`	Auto-sync interval in seconds (0=manual)
`LOG_LEVEL`	No	`INFO`	Logging level

Embedding & Reranking

Both embedding and reranking are always available -- local models are built-in and require no configuration.

Jina AI (recommended): A single JINA_AI_API_KEY enables both embedding and reranking
Embedding priority: Jina AI > Gemini > OpenAI > Cohere. Local Qwen3 fallback always available
Reranking priority: Jina AI > Cohere. Local Qwen3 fallback always available
GPU auto-detection: CUDA/DirectML auto-detected, uses GGUF models for better performance
All embeddings stored at 768 dims. Switching providers never breaks the vector table

LLM Configuration (2-Mode Architecture)

Priority	Mode	Config	Use case
1	SDK	`GEMINI_API_KEY` or `OPENAI_API_KEY`	Direct API access (google-genai, openai)
2	Disabled	Nothing needed	Offline, embedding/rerank only (no LLM)

SearXNG Configuration (2-Mode)

Mode	Config	Description
Embedded (default)	`WET_AUTO_SEARXNG=true`	Auto-installs and manages SearXNG as subprocess
External	`WET_AUTO_SEARXNG=false` + `SEARXNG_URL=http://host:port`	Connects to pre-existing SearXNG instance

Security

SSRF prevention -- URL validation on crawl targets
Graceful fallbacks -- Cloud → Local embedding, multi-tier crawling
Error sanitization -- No credentials in error messages
File conversion sandboxing -- Optional CONVERT_ALLOWED_DIRS restriction

Build from Source

git clone https://github.com/n24q02m/wet-mcp.git
cd wet-mcp
uv sync
uv run wet-mcp

License

MIT -- See LICENSE.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

n24q02m

These details have not been verified by PyPI

Release history Release notifications | RSS feed

3.1.0b4 pre-release

Feb 8, 2026

3.1.0b3 pre-release

Feb 8, 2026

3.1.0b2 pre-release

Feb 8, 2026

3.1.0b1 pre-release

Feb 8, 2026

3.1.0b0 pre-release

Feb 8, 2026

3.0.1b0 pre-release

Feb 6, 2026

3.0.0 yanked

Feb 6, 2026

2.25.2

Apr 17, 2026

2.25.1

Apr 17, 2026

2.25.0

Apr 13, 2026

2.24.0

Apr 7, 2026

2.23.5b3 pre-release

Apr 7, 2026

2.23.5b2 pre-release

Apr 7, 2026

2.23.5b1 pre-release

Apr 7, 2026

2.23.4

Apr 7, 2026

2.23.3

Apr 7, 2026

2.23.2

Apr 6, 2026

2.23.1

Apr 6, 2026

2.23.0

Apr 6, 2026

2.22.0

Apr 6, 2026

2.21.0

Apr 4, 2026

2.20.1

Apr 3, 2026

2.20.0

Apr 3, 2026

This version

2.20.0b1 pre-release

Apr 3, 2026

2.19.0

Apr 1, 2026

2.19.0b1 pre-release

Mar 31, 2026

2.18.2b2 pre-release

Mar 31, 2026

2.18.2b1 pre-release

Mar 30, 2026

2.18.1

Mar 28, 2026

2.18.0

Mar 27, 2026

2.18.0b1 pre-release

Mar 27, 2026

2.17.0

Mar 26, 2026

2.17.0b1 pre-release

Mar 25, 2026

2.15.0

Mar 24, 2026

2.15.0b1 pre-release

Mar 23, 2026

2.14.2

Mar 20, 2026

2.14.1

Mar 20, 2026

2.14.0

Mar 17, 2026

2.13.0

Mar 11, 2026

2.13.0b1 pre-release

Mar 11, 2026

2.12.0

Mar 10, 2026

2.11.1

Mar 8, 2026

2.11.1b1 pre-release

Mar 8, 2026

2.11.0

Mar 8, 2026

2.11.0b2 pre-release

Mar 8, 2026

2.11.0b1 pre-release

Mar 7, 2026

2.10.12

Mar 7, 2026

2.10.11

Mar 6, 2026

2.10.10

Mar 6, 2026

2.10.9

Mar 6, 2026

2.10.8

Mar 6, 2026

2.10.7

Mar 6, 2026

2.10.6

Mar 6, 2026

2.10.5

Mar 6, 2026

2.10.4

Mar 6, 2026

2.10.3

Mar 6, 2026

2.10.2

Mar 6, 2026

2.10.1

Mar 6, 2026

2.10.0

Mar 5, 2026

2.9.8

Mar 4, 2026

2.9.7

Mar 4, 2026

2.9.6

Mar 4, 2026

2.9.5

Mar 3, 2026

2.9.4

Feb 28, 2026

2.9.3

Feb 27, 2026

2.9.2

Feb 27, 2026

2.9.1

Feb 27, 2026

2.9.0

Feb 27, 2026

2.8.0

Feb 25, 2026

2.7.0

Feb 23, 2026

2.6.3

Feb 20, 2026

2.6.2

Feb 19, 2026

2.6.1

Feb 18, 2026

2.6.0

Feb 18, 2026

2.6.0b4 pre-release

Feb 18, 2026

2.6.0b3 pre-release

Feb 18, 2026

2.6.0b2 pre-release

Feb 18, 2026

2.6.0b1 pre-release

Feb 18, 2026

2.5.2b2 pre-release

Feb 17, 2026

2.5.2b1 pre-release

Feb 17, 2026

2.5.1

Feb 17, 2026

2.5.0b8 pre-release

Feb 14, 2026

2.5.0b7 pre-release

Feb 14, 2026

2.5.0b6 pre-release

Feb 14, 2026

2.5.0b5 pre-release

Feb 13, 2026

2.5.0b4 pre-release

Feb 13, 2026

2.5.0b3 pre-release

Feb 13, 2026

2.5.0b2 pre-release

Feb 13, 2026

2.5.0b1 pre-release

Feb 13, 2026

2.5.0b0 pre-release

Feb 13, 2026

2.4.1

Feb 12, 2026

2.4.0

Feb 12, 2026

2.4.0b5 pre-release

Feb 12, 2026

2.4.0b4 pre-release

Feb 12, 2026

2.4.0b3 pre-release

Feb 12, 2026

2.4.0b2 pre-release

Feb 12, 2026

2.4.0b1 pre-release

Feb 12, 2026

2.4.0b0 pre-release

Feb 10, 2026

2.3.0

Feb 9, 2026

2.3.0b0 pre-release

Feb 9, 2026

2.2.0

Feb 8, 2026

2.2.0b2 pre-release

Mar 25, 2026

2.2.0b1 pre-release

Mar 25, 2026

2.1.4b4 pre-release

Feb 5, 2026

2.1.4b3 pre-release

Feb 5, 2026

2.1.4b2 pre-release

Feb 5, 2026

2.1.3

Feb 4, 2026

2.1.2

Feb 4, 2026

2.1.1

Feb 4, 2026

2.1.0

Feb 4, 2026

2.0.0

Feb 4, 2026

1.3.0

Feb 3, 2026

1.2.1

Feb 3, 2026

1.2.0

Feb 3, 2026

1.1.0

Feb 3, 2026

1.0.0

Feb 3, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

wet_mcp-2.20.0b1.tar.gz (108.5 kB view details)

Uploaded Apr 3, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

wet_mcp-2.20.0b1-py3-none-any.whl (119.9 kB view details)

Uploaded Apr 3, 2026 Python 3

File details

Details for the file wet_mcp-2.20.0b1.tar.gz.

File metadata

Download URL: wet_mcp-2.20.0b1.tar.gz
Upload date: Apr 3, 2026
Size: 108.5 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: uv/0.11.3 {"installer":{"name":"uv","version":"0.11.3","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for wet_mcp-2.20.0b1.tar.gz
Algorithm	Hash digest
SHA256	`2f96de356909e9390c0ee0dcd83941a79bbe5e2796a5bebf573d4bdc3fe1e041`
MD5	`a5463eaec503eb6d0f1f279414f3f124`
BLAKE2b-256	`5529db331850986ffb9e6fd331d5dc33e6382dbd88900ccac112a5d0884a65f8`

See more details on using hashes here.

File details

Details for the file wet_mcp-2.20.0b1-py3-none-any.whl.

File metadata

Download URL: wet_mcp-2.20.0b1-py3-none-any.whl
Upload date: Apr 3, 2026
Size: 119.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: uv/0.11.3 {"installer":{"name":"uv","version":"0.11.3","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for wet_mcp-2.20.0b1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`20a04a5dc5fe63ba5cc62ef585381cc45f20bc4515162cf3e77eafe71adbfd89`
MD5	`73251cad079d01f6a6d905633c7e9165`
BLAKE2b-256	`3d91641e8b9906b95b320a711b7d2385743271214cc1e74170ed64aeaa8e01e1`

See more details on using hashes here.

wet-mcp 2.20.0b1

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

WET - Web Extended Toolkit MCP Server

Features

Quick Start

Claude Code Plugin (Recommended)

Gemini CLI Extension

Codex CLI

MCP Server

Option 1: uvx

Option 2: Docker

Tools

MCP Prompts

Zero-Config Setup

Configuration

Pre-install (optional)

Sync setup

Environment Variables

Embedding & Reranking

LLM Configuration (2-Mode Architecture)

SearXNG Configuration (2-Mode)

Security

Build from Source

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes