MCP server to search across NVIDIA blogs and releases to empower LLMs to better answer NVIDIA specific queries

These details have not been verified by PyPI

Project links

Project description

mcp-nvidia

MCP server to search across NVIDIA blogs and releases to empower LLMs to better answer NVIDIA-specific queries.

Overview

This Model Context Protocol (MCP) server enables Large Language Models (LLMs) to search across multiple NVIDIA domains to find relevant information about NVIDIA technologies, products, and services.

Supported Domains (16 default, customizable)

The server searches across the following NVIDIA domains by default. You can customize this list using the MCP_NVIDIA_DOMAINS environment variable or the domains parameter in search queries:

blogs.nvidia.com - NVIDIA blog posts and articles
build.nvidia.com - NVIDIA AI Foundation models and services
catalog.ngc.nvidia.com - NGC catalog of GPU-accelerated software
developer.download.nvidia.com - Developer downloads and resources
developer.nvidia.com - Developer resources, SDKs, and technical documentation
docs.api.nvidia.com - API documentation
docs.nvidia.com - Comprehensive technical documentation
docs.omniverse.nvidia.com - Omniverse documentation
forums.developer.nvidia.com - Developer forums
forums.nvidia.com - Community forums
gameworksdocs.nvidia.com - GameWorks documentation
ngc.nvidia.com - NVIDIA GPU Cloud
nvidia.github.io - NVIDIA GitHub Pages documentation
nvidianews.nvidia.com - Official NVIDIA news and press releases
research.nvidia.com - NVIDIA research publications
resources.nvidia.com - NVIDIA resources and whitepapers

Note: For security, only nvidia.com domains and nvidia.github.io are allowed.

Key Features

Structured JSON output compatible with AI agents and LLMs
Intelligent keyword extraction using NLTK stopwords for better relevance
Context-aware search with enhanced snippets and relevance scoring
Domain-specific filtering for targeted searches

Installation

Via npx (Easiest - recommended for MCP clients)

npx @bharatr21/mcp-nvidia

Or add to your Claude Desktop config:

{
  "mcpServers": {
    "nvidia": {
      "command": "npx",
      "args": ["-y", "@bharatr21/mcp-nvidia"]
    }
  }
}

Note: This requires Python 3.10+ to be installed on your system. The package will automatically use the Python backend.

Via pip

pip install mcp-nvidia

From source

git clone https://github.com/bharatr21/mcp-nvidia.git
cd mcp-nvidia
pip install -e .

Usage

Running the Server

The MCP server can be run directly from the command line:

mcp-nvidia

Configuration

The server can be configured using environment variables:

MCP_NVIDIA_DOMAINS: Comma-separated list of custom NVIDIA domains to search (overrides defaults)
- Security: Only nvidia.com domains and subdomains are allowed. Invalid domains are automatically filtered out.
- Example: "https://developer.nvidia.com/,https://docs.nvidia.com/"
MCP_NVIDIA_LOG_LEVEL: Logging level (DEBUG, INFO, WARNING, ERROR, CRITICAL)

Example:

export MCP_NVIDIA_DOMAINS="https://developer.nvidia.com/,https://docs.nvidia.com/"
export MCP_NVIDIA_LOG_LEVEL="DEBUG"
mcp-nvidia

Configuring with Claude Desktop

Add the following to your Claude Desktop configuration file:

MacOS: ~/Library/Application Support/Claude/claude_desktop_config.json Windows: %APPDATA%/Claude/claude_desktop_config.json

{
  "mcpServers": {
    "nvidia": {
      "command": "mcp-nvidia"
    }
  }
}

With custom configuration (environment variables):

{
  "mcpServers": {
    "nvidia": {
      "command": "mcp-nvidia",
      "env": {
        "MCP_NVIDIA_LOG_LEVEL": "DEBUG",
        "MCP_NVIDIA_DOMAINS": "https://developer.nvidia.com/,https://docs.nvidia.com/"
      }
    }
  }
}

If you installed from source, you may need to use the full path to the Python executable:

{
  "mcpServers": {
    "nvidia": {
      "command": "/path/to/python",
      "args": ["-m", "mcp_nvidia"]
    }
  }
}

Available Tools

search_nvidia

Search across NVIDIA domains for specific information. Results include citations with URLs for easy reference.

Parameters:

query (required): The search query to find information across NVIDIA domains
domains (optional): List of specific NVIDIA domains to search. If not provided, searches all default domains
max_results_per_domain (optional): Maximum number of results to return per domain (default: 3)
min_relevance_score (optional): Minimum relevance score threshold (0-100) to filter results (default: 33)

Example queries:

"CUDA programming best practices"
"RTX 4090 specifications"
"TensorRT optimization techniques"
"Latest AI announcements"
"Omniverse development tutorials"

Features:

Enhanced search using ddgs package for reliable DuckDuckGo integration with domain filtering
Structured JSON output: Returns data in a structured format with both machine-readable and human-readable fields
Context-aware snippets: Automatically fetches surrounding text from source URLs and highlights the relevant snippet with **bold** formatting
Relevance scoring (0-100 scale): Each result includes a relevance score based on query term matches in title, snippet, and URL
- Results are sorted by relevance score (highest first)
- Results below the threshold are automatically filtered out
- Score displayed as "Score: X/100" in formatted text
Security controls: Input validation and limits (customizable via code)
- Query/topic length: 500 characters max
- Results per domain: 10 max
- Domain whitelist: nvidia.com and nvidia.github.io only
- Rate limiting: 200ms between search API calls (~5 searches/sec)
- Concurrent searches: 5 max simultaneous searches
Concurrent search across multiple domains for fast results
Formatted results with titles, URLs, enhanced snippets with context, and source domains
Dedicated citations section with numbered references for easy copying

Output Format: Results are returned as structured JSON with the following schema:

{
  "success": true,
  "results": [
    {
      "title": "Page Title",
      "url": "https://example.nvidia.com/page",
      "snippet": "Enhanced snippet with **highlighted** keywords",
      "domain": "example.nvidia.com",
      "relevance_score": 85,
      "formatted_text": "Markdown-formatted result for display"
    }
  ],
  "metadata": {
    "domains_searched": 16,
    "search_time_ms": 1234
  }
}

discover_nvidia_content

Discover specific types of NVIDIA educational and learning content such as videos, courses, tutorials, webinars, or blog posts.

Parameters:

content_type (required): Type of content to find - one of:
- video: Video tutorials and demonstrations
- course: Training courses and certifications (NVIDIA DLI)
- tutorial: Step-by-step guides and how-tos
- webinar: Webinars and live sessions
- blog: Blog posts and articles
topic (required): The topic or technology to find content about (e.g., "CUDA", "Omniverse", "AI")
max_results (optional): Maximum number of content items to return (default: 5)

Example queries:

Find video tutorials: discover_nvidia_content(content_type="video", topic="CUDA programming")
Find training courses: discover_nvidia_content(content_type="course", topic="Deep Learning")
Find webinars: discover_nvidia_content(content_type="webinar", topic="AI in Healthcare")

Features:

Content-specific search strategies optimized for each type
Relevance scoring on 0-100 scale to highlight best matches
- Score displayed as "Score: X/100" for transparency
Direct links to videos, courses, tutorials, and other resources
Resource links section for easy access to all discovered content

Development

Setting up a development environment

# Clone the repository
git clone https://github.com/bharatr21/mcp-nvidia.git
cd mcp-nvidia

# Create a virtual environment
python -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate

# Install in development mode with dev dependencies
pip install -e ".[dev]"

Running tests

# Install test dependencies
pip install -e ".[test]"

# Run all tests
pytest tests/ -v

# Run tests excluding slow tests (rate limiting)
pytest tests/ -v -m "not slow"

Architecture

The server uses the Model Context Protocol (MCP) to expose search functionality to LLMs.

Search Flow

sequenceDiagram
    participant LLM as LLM/AI Agent
    participant MCP as MCP Server
    participant Validator as Input Validator
    participant RateLimit as Rate Limiter
    participant DDGS as DuckDuckGo Search
    participant Fetcher as URL Fetcher

    LLM->>MCP: search_nvidia(query, domains)
    MCP->>Validator: Validate inputs
    Validator-->>MCP: ✓ Valid (or ✗ Error)

    loop For each domain (max 5 concurrent)
        MCP->>RateLimit: Check rate limit
        RateLimit-->>MCP: ✓ Proceed (wait 200ms)
        MCP->>DDGS: Search "site:domain query"
        DDGS-->>MCP: Search results

        loop For each result
            MCP->>Fetcher: Fetch URL context
            Fetcher-->>MCP: Enhanced snippet
        end
    end

    MCP->>MCP: Calculate relevance scores
    MCP->>MCP: Sort & format JSON
    MCP-->>LLM: Structured JSON response

System Architecture

flowchart TD
    A[MCP Client<br/>Claude/LLMs] -->|JSON-RPC| B[MCP Server]
    B --> C{Tool Router}

    C -->|search_nvidia| D[Search Handler]
    C -->|discover_nvidia_content| E[Discovery Handler]

    D --> F[Security Layer]
    F --> G[Input Validator]
    F --> H[Rate Limiter]
    F --> I[Concurrency Control]

    G --> J[Domain Searcher]
    J --> K[DuckDuckGo API]

    J --> L[URL Context Fetcher]
    L --> M[BeautifulSoup Parser]

    J --> N[Relevance Scorer]
    N --> O[Response Builder]
    O --> P[Structured JSON]

    P -->|CallToolResult| A

    style F fill:#ffcccc
    style P fill:#ccffcc
    style K fill:#cce5ff

Key Components

Input Validation: Validates query length, domain whitelist, and parameter limits
Rate Limiting: Enforces 200ms minimum interval between DuckDuckGo API calls
Concurrent Search: Searches up to 5 domains simultaneously with semaphore control
Context Enhancement: Fetches actual page content and highlights relevant snippets
Relevance Scoring: Calculates 0-100 scores based on keyword matches
JSON Output: Returns structured data compatible with both AI agents and humans

Extending Domain Coverage

The list of searchable domains is configured in src/mcp_nvidia/server.py in the DEFAULT_DOMAINS constant. To add more NVIDIA domains:

Edit src/mcp_nvidia/server.py
Add new domain URLs to the DEFAULT_DOMAINS list
Reinstall the package

License

MIT License - see LICENSE file for details

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

Support

For issues, questions, or contributions, please visit the GitHub repository.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.5.0

Jan 13, 2026

0.4.0

Nov 17, 2025

0.3.0

Nov 13, 2025

0.2.5

Nov 10, 2025

0.2.0

Nov 8, 2025

This version

0.1.0

Nov 5, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mcp_nvidia-0.1.0.tar.gz (19.8 kB view details)

Uploaded Nov 5, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

mcp_nvidia-0.1.0-py3-none-any.whl (19.5 kB view details)

Uploaded Nov 5, 2025 Python 3

File details

Details for the file mcp_nvidia-0.1.0.tar.gz.

File metadata

Download URL: mcp_nvidia-0.1.0.tar.gz
Upload date: Nov 5, 2025
Size: 19.8 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.10.18

File hashes

Hashes for mcp_nvidia-0.1.0.tar.gz
Algorithm	Hash digest
SHA256	`f6024fe6200e64e636bb34de4b80d111ad5fded10d2848307b17da4628b9bd38`
MD5	`b691711d18cbdeb7be3bbdbed89e2bb3`
BLAKE2b-256	`828b7dd497c80f0436f432c10149b0ba65f9c1748e0b8150c685262d259af57a`

See more details on using hashes here.

File details

Details for the file mcp_nvidia-0.1.0-py3-none-any.whl.

File metadata

Download URL: mcp_nvidia-0.1.0-py3-none-any.whl
Upload date: Nov 5, 2025
Size: 19.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.10.18

File hashes

Hashes for mcp_nvidia-0.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`6c67cdb0a1d0cc54390175140c64bfb06291dcd1dfdb70267643c8c0e4d33afe`
MD5	`58e36a6062509b13970d05cf4b0204cf`
BLAKE2b-256	`0b13a16e0910778fcf1ed50a644f861280927676c1a95ab9a0ca71820b26eb93`

See more details on using hashes here.

mcp-nvidia 0.1.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

mcp-nvidia

Overview

Supported Domains (16 default, customizable)

Key Features

Installation

Via npx (Easiest - recommended for MCP clients)

Via pip

From source

Usage

Running the Server

Configuration

Configuring with Claude Desktop

Available Tools

search_nvidia

discover_nvidia_content

Development

Setting up a development environment

Running tests

Architecture

Search Flow

System Architecture

Key Components

Extending Domain Coverage

License

Contributing

Support

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes