Skip to main content

A Model Context Protocol server providing tools to execute spider crawler

Project description

mcp-server-spider: A spider MCP server

Overview

A Model Context Protocol server for Spider crawler interaction and automation. This server provides tools to crawl and scrape web pages.

Please note that mcp-server-spider is currently in early develpoment. There might be bugs and features added in the future.

Tools

  1. crawl
    • Crawls the given url and returns the list of URLs that were found
    • Input:
      • url: The url to crawl
      • headers: Additional headers passed along with crawl requests
      • user_agent: User agent to use for the crawl requests
      • depth: The depth of link traversal
      • blacklist: A list of regural expression to blacklist URLs from the crawling process
      • whitelist: A list of regular expression to whitelist URLS from the crawling process
      • respect_robots_txt: Whether to respect robots.txt file
      • accept_invalid_certs: Whether to accept invalid certifcates or not
    • Returns: List of URLs found
  2. scrape
    • Scrapes the given url and returns a list of JSON objects that contain the url, links and content of each page discovered
    • Input: Same as crawl
    • Returns: A list of JSON objects (as a string) that contain the url, links and content of each page discovered

Installation

Using uv (recommended)

When using uv no specific installation is needed. We will use uvx to directly run mcp-server-spider.

Using PIP

Alternatively you can install mcp-server-spider via pip:

pip install mcp-server-spider

After installation, you can run it as a script using:

python -m mcp_server_spider

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mcp_server_spider-0.1.0.tar.gz (10.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

mcp_server_spider-0.1.0-py3-none-any.whl (5.1 kB view details)

Uploaded Python 3

File details

Details for the file mcp_server_spider-0.1.0.tar.gz.

File metadata

  • Download URL: mcp_server_spider-0.1.0.tar.gz
  • Upload date:
  • Size: 10.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.6.14

File hashes

Hashes for mcp_server_spider-0.1.0.tar.gz
Algorithm Hash digest
SHA256 513526e9acaa3fb03f8dfb3ccdadaad5b6178acabb9d87b14ed6e5a7d4641e20
MD5 5f6d2d34d3d224038df8dc039495debe
BLAKE2b-256 116494a56effb489a1c844747b93e54b9ce80c9cbc957956578215240031e286

See more details on using hashes here.

File details

Details for the file mcp_server_spider-0.1.0-py3-none-any.whl.

File metadata

File hashes

Hashes for mcp_server_spider-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 504d6491abd1003a7fc2ed4ea24f2544a518b7efff0d67ce99218fe6c52841e1
MD5 771ed5863b56ae86daf31eb0cc336043
BLAKE2b-256 7effa269381f5ff15d0d605448592a0624bf54f01b77cfd3cf528a9de2309f76

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page