Context-aware web fetching MCP server that respects token limits

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

These details have not been verified by PyPI

Project description

Smart WebFetch MCP Server

Context-aware web fetching for LLMs. Prevents context window flooding by checking page size before fetching and providing surgical extraction tools.

The Problem

Standard web fetch tools dump entire pages into the context window, often:

Exceeding token limits
Wasting context on navigation, footers, ads
Flooding the model with irrelevant content

The Solution

Smart WebFetch provides 7 tools for intelligent web fetching:

Tool	Purpose
`web_preflight`	Check page size before fetching
`web_smart_fetch`	Fetch with automatic truncation
`web_fetch_code`	Extract only code blocks
`web_fetch_section`	Fetch specific heading/section
`web_fetch_chunked`	Paginated fetching for large docs
`web_fetch_links`	Extract all links from a page
`web_fetch_tables`	Extract tables as markdown

Installation

# Install from PyPI
pip install smart-webfetch-mcp

# Or with uvx (recommended for MCP)
uvx smart-webfetch-mcp

Configuration

Claude Code

claude mcp add --transport stdio smart-webfetch -- uvx smart-webfetch-mcp

OpenCode

Add to your opencode.json:

{
  "mcp": {
    "smart-webfetch": {
      "type": "local",
      "command": ["uvx", "smart-webfetch-mcp"],
      "enabled": true
    }
  }
}

Claude Desktop

Add to claude_desktop_config.json:

{
  "mcpServers": {
    "smart-webfetch": {
      "command": "uvx",
      "args": ["smart-webfetch-mcp"]
    }
  }
}

Usage Examples

Check before fetching

Use web_preflight to check https://docs.python.org/3/library/asyncio.html

Response:

{
  "url": "https://docs.python.org/3/library/asyncio.html",
  "estimated_tokens": 45000,
  "safe_for_context": false,
  "recommendation": "Very large page (~45,000 tokens). Use web_fetch_section or web_fetch_chunked."
}

Fetch with automatic truncation

Use web_smart_fetch on https://example.com/docs with max_tokens=4000

Extract only code examples

Use web_fetch_code on https://docs.python.org/3/library/asyncio-task.html

Get specific section

Use web_fetch_section on https://docs.python.org/3/library/asyncio.html 
with heading="Running an asyncio Program"

Paginated reading

Use web_fetch_chunked on https://large-docs.com/api with chunk=0, chunk_size=4000

Then continue with chunk=1, chunk=2, etc.

Tool Reference

web_preflight

Check page metadata before fetching.

Parameters:

url (required): URL to check

Returns:

estimated_tokens: Approximate token count
content_type: MIME type
is_html: Whether content is HTML
title: Page title (if HTML)
safe_for_context: Boolean (true if < 8000 tokens)
recommendation: Human-readable advice

web_smart_fetch

Fetch with automatic truncation for large pages.

Parameters:

url (required): URL to fetch
max_tokens (optional, default 8000): Maximum tokens to return
strategy (optional, default "auto"): "auto" finds natural break points, "truncate" hard cuts

Returns: Markdown content with metadata header

web_fetch_code

Extract only code blocks from a page.

Parameters:

url (required): URL to extract code from

Returns: Code blocks with language annotations and context

web_fetch_section

Fetch content under a specific heading.

Parameters:

url (required): URL to fetch from
heading (required): Heading text to find (case-insensitive)

Returns: Section content or list of available sections if not found

web_fetch_chunked

Fetch large documents in chunks.

Parameters:

url (required): URL to fetch
chunk (optional, default 0): Chunk index (0-based)
chunk_size (optional, default 4000): Tokens per chunk

Returns: Chunk content with navigation metadata

web_fetch_links

Extract all links from a page.

Parameters:

url (required): URL to extract links from
filter_pattern (optional): Regex to filter link URLs
external_only (optional, default false): Only return external links

Returns: Markdown list of links with text and URL

web_fetch_tables

Extract tables from a page as markdown.

Parameters:

url (required): URL to extract tables from
table_index (optional): Specific table index (0-based), returns all if not specified

Returns: Markdown formatted tables

Development

# Clone and install dev dependencies
git clone https://github.com/mathisto/smart-webfetch-mcp
cd smart-webfetch-mcp
pip install -e ".[dev]"

# Run tests
pytest

# Format code
ruff format .
ruff check --fix .

License

MIT

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

mathisto

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

0.3.0

Nov 28, 2025

0.2.0

Nov 28, 2025

0.1.0

Nov 28, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

smart_webfetch_mcp-0.3.0.tar.gz (23.5 kB view details)

Uploaded Nov 28, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

smart_webfetch_mcp-0.3.0-py3-none-any.whl (22.3 kB view details)

Uploaded Nov 28, 2025 Python 3

File details

Details for the file smart_webfetch_mcp-0.3.0.tar.gz.

File metadata

Download URL: smart_webfetch_mcp-0.3.0.tar.gz
Upload date: Nov 28, 2025
Size: 23.5 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for smart_webfetch_mcp-0.3.0.tar.gz
Algorithm	Hash digest
SHA256	`c1f70d9f031085e4914d903aad818e494d29dd8f868d6b33b6af9038b98386fd`
MD5	`dcb44644b8c0ff3a2ab9b76013c06910`
BLAKE2b-256	`dff181a2255a5416517154c80a3809c66af58c4bc20f0ec43e22c3e64034013d`

See more details on using hashes here.

Provenance

The following attestation bundles were made for smart_webfetch_mcp-0.3.0.tar.gz:

Publisher: publish.yml on mathisto/smart-webfetch-mcp

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: smart_webfetch_mcp-0.3.0.tar.gz
- Subject digest: c1f70d9f031085e4914d903aad818e494d29dd8f868d6b33b6af9038b98386fd
- Sigstore transparency entry: 730421455
- Sigstore integration time: Nov 28, 2025
Source repository:
- Permalink: mathisto/smart-webfetch-mcp@6e4c80bb108b3d1ab6942f9b047c7695d0dbd106
- Branch / Tag: refs/tags/v0.3.0
- Owner: https://github.com/mathisto
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@6e4c80bb108b3d1ab6942f9b047c7695d0dbd106
- Trigger Event: push

File details

Details for the file smart_webfetch_mcp-0.3.0-py3-none-any.whl.

File metadata

Download URL: smart_webfetch_mcp-0.3.0-py3-none-any.whl
Upload date: Nov 28, 2025
Size: 22.3 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for smart_webfetch_mcp-0.3.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`283cba72f56f68a90026d0d13a8da91c3283ee85745d67e933f02eeb3af598da`
MD5	`3fa87df9932a53550ee7c5501890b31c`
BLAKE2b-256	`03e1001fb3b4c9444a754f1a4833b2bcfc788a9f18f344fc4908aaf1ee6154ea`

See more details on using hashes here.

Provenance

The following attestation bundles were made for smart_webfetch_mcp-0.3.0-py3-none-any.whl:

Publisher: publish.yml on mathisto/smart-webfetch-mcp

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: smart_webfetch_mcp-0.3.0-py3-none-any.whl
- Subject digest: 283cba72f56f68a90026d0d13a8da91c3283ee85745d67e933f02eeb3af598da
- Sigstore transparency entry: 730421464
- Sigstore integration time: Nov 28, 2025
Source repository:
- Permalink: mathisto/smart-webfetch-mcp@6e4c80bb108b3d1ab6942f9b047c7695d0dbd106
- Branch / Tag: refs/tags/v0.3.0
- Owner: https://github.com/mathisto
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@6e4c80bb108b3d1ab6942f9b047c7695d0dbd106
- Trigger Event: push

smart-webfetch-mcp 0.3.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

Smart WebFetch MCP Server

The Problem

The Solution

Installation

Configuration

Claude Code

OpenCode

Claude Desktop

Usage Examples

Check before fetching

Fetch with automatic truncation

Extract only code examples

Get specific section

Paginated reading

Tool Reference

web_preflight

web_smart_fetch

web_fetch_code

web_fetch_section

web_fetch_chunked

web_fetch_links

web_fetch_tables

Development

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance