Open-source MCP Server for web search, extract, crawl, academic research, and library docs with embedded SearXNG

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

n24q02m

These details have not been verified by PyPI

Project description

WET - Web Extended Toolkit MCP Server

mcp-name: io.github.n24q02m/wet-mcp

Open-source MCP Server for web search, content extraction, library docs & multimodal analysis.

Features

Web Search -- Embedded SearXNG metasearch (Google, Bing, DuckDuckGo, Brave) with filters, semantic reranking, query expansion, and snippet enrichment
Academic Research -- Search Google Scholar, Semantic Scholar, arXiv, PubMed, CrossRef, BASE
Library Docs -- Auto-discover and index documentation with FTS5 hybrid search, HyDE-enhanced retrieval, and version-specific docs
Content Extract -- Clean content extraction (Markdown/Text), structured data extraction (LLM + JSON Schema), batch processing (up to 50 URLs), deep crawling, site mapping
Local File Conversion -- Convert PDF, DOCX, XLSX, CSV, HTML, EPUB, PPTX to Markdown
Media -- List, download, and analyze images, videos, audio files
Anti-bot -- Stealth mode bypasses Cloudflare, Medium, LinkedIn, Twitter
Zero Config -- Built-in local Qwen3 embedding + reranking, no API keys needed. Optional cloud providers (Jina AI, Gemini, OpenAI, Cohere)
Sync -- Cross-machine sync of indexed docs via rclone (Google Drive, S3, Dropbox)

Quick Start

Claude Code Plugin (Recommended)

Via marketplace (includes skills: /fact-check, /compare):

/plugin marketplace add n24q02m/claude-plugins
/plugin install wet-mcp@n24q02m-plugins

Configure env vars in ~/.claude/settings.local.json or shell profile. See Environment Variables.

Gemini CLI Extension

gemini extensions install https://github.com/n24q02m/wet-mcp

Codex CLI

Add to ~/.codex/config.toml:

[mcp_servers.wet]
command = "uvx"
args = ["--python", "3.13", "wet-mcp"]

MCP Server

Python 3.13 required -- Python 3.14+ is not supported due to SearXNG incompatibility. You must specify --python 3.13 when using uvx.

On first run, the server automatically installs SearXNG, Playwright chromium, and starts the embedded search engine.

Option 1: uvx

{
  "mcpServers": {
    "wet": {
      "command": "uvx",
      "args": ["--python", "3.13", "wet-mcp@latest"]
    }
  }
}

Option 2: Docker

{
  "mcpServers": {
    "wet": {
      "command": "docker",
      "args": [
        "run", "-i", "--rm",
        "--name", "mcp-wet",
        "-v", "wet-data:/data",
        "-e", "API_KEYS",
        "-e", "GITHUB_TOKEN",
        "-e", "SYNC_ENABLED",
        "n24q02m/wet-mcp:latest"
      ]
    }
  }
}

Configure env vars in ~/.claude/settings.local.json or your shell profile. See Environment Variables below.

Tools

Tool	Actions	Description
`search`	`search`, `research`, `docs`, `similar`	Web search (with filters, reranking, expand/enrich), academic research, library docs (HyDE), find similar
`extract`	`extract`, `batch`, `crawl`, `map`, `convert`, `extract_structured`	Content extraction, batch processing (up to 50 URLs), deep crawling, site mapping, local file conversion, structured data extraction (JSON Schema)
`media`	`list`, `download`, `analyze`	Media discovery, download, and analysis
`config`	`status`, `set`, `cache_clear`, `docs_reindex`	Server configuration and cache management
`setup`	`warmup`, `setup_sync`	Pre-download models, configure cloud sync
`help`	--	Full documentation for any tool

MCP Prompts

Prompt	Parameters	Description
`research_topic`	`topic`	Research a topic using academic search
`library_docs`	`library`, `question`	Find library documentation

Zero-Config Setup

No environment variables needed. On first start, the server opens a setup page in your browser:

Start the server (via plugin, uvx, or Docker)
A setup URL appears -- open it in any browser
Fill in your credentials on the guided form
Credentials are encrypted and stored locally

Your credentials never leave your machine. The relay server only sees encrypted data.

For CI/automation, you can still use environment variables (see below).

Configuration

Pre-install (optional)

Use the setup MCP tool to warmup models and install dependencies:

# Via MCP tool call (recommended):
setup(action="warmup")

# With cloud embedding configured, warmup validates API keys
# and skips local model download if cloud models are available.

The warmup action pre-downloads SearXNG, Playwright, and embedding/reranker models (~1.1GB total) so the first real connection does not timeout.

Sync setup

Sync is fully automatic. Just set SYNC_ENABLED=true and the server handles everything:

First sync: rclone is auto-downloaded, a browser opens for OAuth authentication
Token saved: OAuth token is stored locally at ~/.wet-mcp/tokens/ (600 permissions)
Subsequent runs: Token is loaded automatically -- no manual steps needed

For non-Google Drive providers, set SYNC_PROVIDER and SYNC_REMOTE:

{
  "SYNC_ENABLED": "true",
  "SYNC_PROVIDER": "dropbox",
  "SYNC_REMOTE": "dropbox"
}

Environment Variables

Variable	Required	Default	Description
`API_KEYS`	No	--	API keys for cloud providers (format: `ENV_VAR:key,...`). Enables cloud embedding + reranking
`COHERE_API_KEY`	No	--	Cohere API key (embedding + reranking)
`JINA_AI_API_KEY`	No	--	Jina AI API key (embedding + reranking)
`GEMINI_API_KEY`	No	--	Google Gemini API key (LLM + embedding)
`OPENAI_API_KEY`	No	--	OpenAI API key (LLM + embedding)
`GITHUB_TOKEN`	No	auto-detect	GitHub token for docs discovery (60 -> 5000 req/hr). Auto-detected from `gh auth token`
`EMBEDDING_BACKEND`	No	auto-detect	`cloud` or `local` (Qwen3). Auto: API_KEYS -> cloud, else local
`EMBEDDING_MODEL`	No	auto-detect	Cloud embedding model name
`EMBEDDING_DIMS`	No	`0` (auto=768)	Embedding dimensions
`RERANK_ENABLED`	No	`true`	Enable reranking after search
`RERANK_BACKEND`	No	auto-detect	`cloud` or `local`. Auto: Cohere/Jina key -> cloud, else local
`RERANK_MODEL`	No	auto-detect	Cloud rerank model name
`RERANK_TOP_N`	No	`10`	Return top N results after reranking
`LLM_MODELS`	No	`gemini-3-flash-preview`	LLM model for media analysis (google-genai or openai)
`WET_AUTO_SEARXNG`	No	`true`	Auto-start embedded SearXNG subprocess
`WET_SEARXNG_PORT`	No	`41592`	SearXNG port
`SEARXNG_URL`	No	`http://localhost:41592`	External SearXNG URL (when auto disabled)
`SEARXNG_TIMEOUT`	No	`30`	SearXNG request timeout in seconds
`CONVERT_MAX_FILE_SIZE`	No	`104857600`	Max file size for local conversion in bytes (100MB)
`CONVERT_ALLOWED_DIRS`	No	--	Comma-separated paths to restrict local file conversion
`CACHE_DIR`	No	`~/.wet-mcp`	Data directory for cache, docs, downloads
`DOCS_DB_PATH`	No	`~/.wet-mcp/docs.db`	Docs database location
`DOWNLOAD_DIR`	No	`~/.wet-mcp/downloads`	Media download directory
`TOOL_TIMEOUT`	No	`120`	Tool execution timeout in seconds (0=no timeout)
`WET_CACHE`	No	`true`	Enable/disable web cache
`SYNC_ENABLED`	No	`false`	Enable rclone sync
`SYNC_PROVIDER`	No	`drive`	rclone provider type (drive, dropbox, s3, etc.)
`SYNC_REMOTE`	No	`gdrive`	rclone remote name
`SYNC_FOLDER`	No	`wet-mcp`	Remote folder name
`SYNC_INTERVAL`	No	`300`	Auto-sync interval in seconds (0=manual)
`LOG_LEVEL`	No	`INFO`	Logging level

Embedding & Reranking

Both embedding and reranking are always available -- local models are built-in and require no configuration.

Jina AI (recommended): A single JINA_AI_API_KEY enables both embedding and reranking
Embedding priority: Jina AI > Gemini > OpenAI > Cohere. Local Qwen3 fallback always available
Reranking priority: Jina AI > Cohere. Local Qwen3 fallback always available
GPU auto-detection: CUDA/DirectML auto-detected, uses GGUF models for better performance
All embeddings stored at 768 dims. Switching providers never breaks the vector table

LLM Configuration (2-Mode Architecture)

Priority	Mode	Config	Use case
1	SDK	`GEMINI_API_KEY` or `OPENAI_API_KEY`	Direct API access (google-genai, openai)
2	Disabled	Nothing needed	Offline, embedding/rerank only (no LLM)

SearXNG Configuration (2-Mode)

Mode	Config	Description
Embedded (default)	`WET_AUTO_SEARXNG=true`	Auto-installs and manages SearXNG as subprocess
External	`WET_AUTO_SEARXNG=false` + `SEARXNG_URL=http://host:port`	Connects to pre-existing SearXNG instance

Security

SSRF prevention -- URL validation on crawl targets
Graceful fallbacks -- Cloud → Local embedding, multi-tier crawling
Error sanitization -- No credentials in error messages
File conversion sandboxing -- Optional CONVERT_ALLOWED_DIRS restriction

Build from Source

git clone https://github.com/n24q02m/wet-mcp.git
cd wet-mcp
uv sync
uv run wet-mcp

License

MIT -- See LICENSE.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

n24q02m

These details have not been verified by PyPI

Release history Release notifications | RSS feed

3.1.0b4 pre-release

Feb 8, 2026

3.1.0b3 pre-release

Feb 8, 2026

3.1.0b2 pre-release

Feb 8, 2026

3.1.0b1 pre-release

Feb 8, 2026

3.1.0b0 pre-release

Feb 8, 2026

3.0.1b0 pre-release

Feb 6, 2026

3.0.0 yanked

Feb 6, 2026

2.25.2

Apr 17, 2026

2.25.1

Apr 17, 2026

2.25.0

Apr 13, 2026

2.24.0

Apr 7, 2026

2.23.5b3 pre-release

Apr 7, 2026

2.23.5b2 pre-release

Apr 7, 2026

2.23.5b1 pre-release

Apr 7, 2026

2.23.4

Apr 7, 2026

2.23.3

Apr 7, 2026

2.23.2

Apr 6, 2026

2.23.1

Apr 6, 2026

2.23.0

Apr 6, 2026

2.22.0

Apr 6, 2026

2.21.0

Apr 4, 2026

2.20.1

Apr 3, 2026

2.20.0

Apr 3, 2026

2.20.0b1 pre-release

Apr 3, 2026

2.19.0

Apr 1, 2026

This version

2.19.0b1 pre-release

Mar 31, 2026

2.18.2b2 pre-release

Mar 31, 2026

2.18.2b1 pre-release

Mar 30, 2026

2.18.1

Mar 28, 2026

2.18.0

Mar 27, 2026

2.18.0b1 pre-release

Mar 27, 2026

2.17.0

Mar 26, 2026

2.17.0b1 pre-release

Mar 25, 2026

2.15.0

Mar 24, 2026

2.15.0b1 pre-release

Mar 23, 2026

2.14.2

Mar 20, 2026

2.14.1

Mar 20, 2026

2.14.0

Mar 17, 2026

2.13.0

Mar 11, 2026

2.13.0b1 pre-release

Mar 11, 2026

2.12.0

Mar 10, 2026

2.11.1

Mar 8, 2026

2.11.1b1 pre-release

Mar 8, 2026

2.11.0

Mar 8, 2026

2.11.0b2 pre-release

Mar 8, 2026

2.11.0b1 pre-release

Mar 7, 2026

2.10.12

Mar 7, 2026

2.10.11

Mar 6, 2026

2.10.10

Mar 6, 2026

2.10.9

Mar 6, 2026

2.10.8

Mar 6, 2026

2.10.7

Mar 6, 2026

2.10.6

Mar 6, 2026

2.10.5

Mar 6, 2026

2.10.4

Mar 6, 2026

2.10.3

Mar 6, 2026

2.10.2

Mar 6, 2026

2.10.1

Mar 6, 2026

2.10.0

Mar 5, 2026

2.9.8

Mar 4, 2026

2.9.7

Mar 4, 2026

2.9.6

Mar 4, 2026

2.9.5

Mar 3, 2026

2.9.4

Feb 28, 2026

2.9.3

Feb 27, 2026

2.9.2

Feb 27, 2026

2.9.1

Feb 27, 2026

2.9.0

Feb 27, 2026

2.8.0

Feb 25, 2026

2.7.0

Feb 23, 2026

2.6.3

Feb 20, 2026

2.6.2

Feb 19, 2026

2.6.1

Feb 18, 2026

2.6.0

Feb 18, 2026

2.6.0b4 pre-release

Feb 18, 2026

2.6.0b3 pre-release

Feb 18, 2026

2.6.0b2 pre-release

Feb 18, 2026

2.6.0b1 pre-release

Feb 18, 2026

2.5.2b2 pre-release

Feb 17, 2026

2.5.2b1 pre-release

Feb 17, 2026

2.5.1

Feb 17, 2026

2.5.0b8 pre-release

Feb 14, 2026

2.5.0b7 pre-release

Feb 14, 2026

2.5.0b6 pre-release

Feb 14, 2026

2.5.0b5 pre-release

Feb 13, 2026

2.5.0b4 pre-release

Feb 13, 2026

2.5.0b3 pre-release

Feb 13, 2026

2.5.0b2 pre-release

Feb 13, 2026

2.5.0b1 pre-release

Feb 13, 2026

2.5.0b0 pre-release

Feb 13, 2026

2.4.1

Feb 12, 2026

2.4.0

Feb 12, 2026

2.4.0b5 pre-release

Feb 12, 2026

2.4.0b4 pre-release

Feb 12, 2026

2.4.0b3 pre-release

Feb 12, 2026

2.4.0b2 pre-release

Feb 12, 2026

2.4.0b1 pre-release

Feb 12, 2026

2.4.0b0 pre-release

Feb 10, 2026

2.3.0

Feb 9, 2026

2.3.0b0 pre-release

Feb 9, 2026

2.2.0

Feb 8, 2026

2.2.0b2 pre-release

Mar 25, 2026

2.2.0b1 pre-release

Mar 25, 2026

2.1.4b4 pre-release

Feb 5, 2026

2.1.4b3 pre-release

Feb 5, 2026

2.1.4b2 pre-release

Feb 5, 2026

2.1.3

Feb 4, 2026

2.1.2

Feb 4, 2026

2.1.1

Feb 4, 2026

2.1.0

Feb 4, 2026

2.0.0

Feb 4, 2026

1.3.0

Feb 3, 2026

1.2.1

Feb 3, 2026

1.2.0

Feb 3, 2026

1.1.0

Feb 3, 2026

1.0.0

Feb 3, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

wet_mcp-2.19.0b1.tar.gz (117.7 kB view details)

Uploaded Mar 31, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

wet_mcp-2.19.0b1-py3-none-any.whl (130.0 kB view details)

Uploaded Mar 31, 2026 Python 3

File details

Details for the file wet_mcp-2.19.0b1.tar.gz.

File metadata

Download URL: wet_mcp-2.19.0b1.tar.gz
Upload date: Mar 31, 2026
Size: 117.7 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: uv/0.11.2 {"installer":{"name":"uv","version":"0.11.2","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for wet_mcp-2.19.0b1.tar.gz
Algorithm	Hash digest
SHA256	`c5da71d4fee2feb41737993f7365920c6a101cd8a39c4dbb162df484c866f0a5`
MD5	`ebad956e13bb8271b31ecb6e30cc0523`
BLAKE2b-256	`dd5f63252fb7ebb40ecc35b2e2ff7c7a37b67cfc7e692859fb5b114c52fe62d6`

See more details on using hashes here.

File details

Details for the file wet_mcp-2.19.0b1-py3-none-any.whl.

File metadata

Download URL: wet_mcp-2.19.0b1-py3-none-any.whl
Upload date: Mar 31, 2026
Size: 130.0 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: uv/0.11.2 {"installer":{"name":"uv","version":"0.11.2","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for wet_mcp-2.19.0b1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`4ebb1adcdd84cc3c2d7ae7239eacff6fd1bdada750c98b4434ca463078a27dfb`
MD5	`465eb3a6322a8f8f623c5621c2cfa582`
BLAKE2b-256	`c0a19a8e728f793a55bac3ff714107256116bc85608d7eb0d159380c17611079`

See more details on using hashes here.

wet-mcp 2.19.0b1

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

WET - Web Extended Toolkit MCP Server

Features

Quick Start

Claude Code Plugin (Recommended)

Gemini CLI Extension

Codex CLI

MCP Server

Option 1: uvx

Option 2: Docker

Tools

MCP Prompts

Zero-Config Setup

Configuration

Pre-install (optional)

Sync setup

Environment Variables

Embedding & Reranking

LLM Configuration (2-Mode Architecture)

SearXNG Configuration (2-Mode)

Security

Build from Source

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes