Skip to main content

Open-source MCP Server thay thế Tavily - Web search, extract, crawl với SearXNG

Project description

WET - Web ExTract MCP Server

PyPI version License: MIT

Open-source MCP Server replacing Tavily for web scraping & multimodal extraction

Zero-install experience: just uvx wet-mcp - automatically setups and manages SearXNG container.

Features

Feature Description
Web Search Search via SearXNG (metasearch: Google, Bing, DuckDuckGo, Brave)
Content Extract Extract clean content (Markdown/Text/HTML)
Deep Crawl Crawl multiple pages from a root URL with depth control
Site Map Discover website URL structure
Media List and download images, videos, audio files
Anti-bot Stealth mode bypasses Cloudflare, Medium, LinkedIn, Twitter

Quick Start

Prerequisites

  • Docker daemon running (for SearXNG)
  • Python 3.13+ (or use uvx)

MCP Client Configuration

Claude Desktop / Cursor / Windsurf / Antigravity:

{
  "mcpServers": {
    "wet": {
      "command": "uvx",
      "args": ["wet-mcp"]
    }
  }
}

That's it! When the MCP client calls wet-mcp for the first time:

  1. Automatically installs Playwright chromium
  2. Automatically pulls SearXNG Docker image
  3. Starts wet-searxng container
  4. Runs the MCP server

Without uvx

pip install wet-mcp
wet-mcp

Tools

Tool Actions Description
web search, extract, crawl, map Web operations
media list, download Media discovery & download
help - Full documentation

Examples

# Search
{"action": "search", "query": "python web scraping", "max_results": 10}

# Extract content
{"action": "extract", "urls": ["https://example.com"]}

# Crawl with depth
{"action": "crawl", "urls": ["https://docs.python.org"], "depth": 2}

# Map site structure
{"action": "map", "urls": ["https://example.com"]}

# List media
{"action": "list", "url": "https://github.com/python/cpython"}

# Download media
{"action": "download", "media_urls": ["https://example.com/image.png"]}

Tech Stack

Component Technology
Language Python 3.13
MCP Framework FastMCP
Web Search SearXNG (auto-managed Docker)
Web Crawling Crawl4AI
Docker Management python-on-whales

How It Works

┌─────────────────────────────────────────────────────────┐
│                    MCP Client                           │
│            (Claude, Cursor, Windsurf)                   │
└─────────────────────┬───────────────────────────────────┘
                      │ MCP Protocol
                      ▼
┌─────────────────────────────────────────────────────────┐
│                   WET MCP Server                        │
│  ┌──────────┐  ┌──────────┐  ┌──────────────────────┐   │
│  │   web    │  │  media   │  │        help          │   │
│  │ (search, │  │ (list,   │  │  (full documentation)│   │
│  │ extract, │  │ crawl,   │  └──────────────────────┘   │
│  │ crawl,   │  │ download)│                             │
│  │ map)     │  └────┬─────┘                             │
│  └────┬─────┘       │                                   │
│       │             │                                   │
│       ▼             ▼                                   │
│  ┌──────────┐  ┌──────────┐                             │
│  │ SearXNG  │  │ Crawl4AI │                             │
│  │ (Docker) │  │(Playwright)│                           │
│  └──────────┘  └──────────┘                             │
└─────────────────────────────────────────────────────────┘

Configuration

Environment variables:

Variable Default Description
WET_AUTO_DOCKER true Auto-manage SearXNG container
WET_SEARXNG_PORT 8080 SearXNG container port
SEARXNG_URL http://localhost:8080 External SearXNG URL
LOG_LEVEL INFO Logging level

Container Management

# View SearXNG logs
docker logs wet-searxng

# Stop SearXNG
docker stop wet-searxng

# Remove container (will be recreated on next run)
docker rm wet-searxng

# Reset auto-setup (forces re-install Playwright)
rm ~/.wet-mcp/.setup-complete

License

MIT License

Project details


Release history Release notifications | RSS feed

This version

1.1.0

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

wet_mcp-1.1.0.tar.gz (13.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

wet_mcp-1.1.0-py3-none-any.whl (18.2 kB view details)

Uploaded Python 3

File details

Details for the file wet_mcp-1.1.0.tar.gz.

File metadata

  • Download URL: wet_mcp-1.1.0.tar.gz
  • Upload date:
  • Size: 13.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.9.28 {"installer":{"name":"uv","version":"0.9.28","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for wet_mcp-1.1.0.tar.gz
Algorithm Hash digest
SHA256 88b38533e418a41cc7404403f3d79c24ca7f0de3dae4e9ecd52810d4e2a2a7f8
MD5 947225ea1e565a02d116a3ca88ee393d
BLAKE2b-256 f6130b78fda3cc253f8467ad7c5541b737ad7b3c2aa2e707f5d03f49670813ef

See more details on using hashes here.

File details

Details for the file wet_mcp-1.1.0-py3-none-any.whl.

File metadata

  • Download URL: wet_mcp-1.1.0-py3-none-any.whl
  • Upload date:
  • Size: 18.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.9.28 {"installer":{"name":"uv","version":"0.9.28","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for wet_mcp-1.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 4ff2855755c17423b40ce0ff88a31a90cc71ea0d21c844b2849b561f170b62de
MD5 7e0b7b83e363556f38231edade0f49a7
BLAKE2b-256 f8046080b03c38d3f2bbb4ab8f3f2264fcb6c8ded70aebfde281c334d747a21d

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page