MCP server to search across NVIDIA blogs and releases to empower LLMs to better answer NVIDIA specific queries
Project description
mcp-nvidia
MCP server to search across NVIDIA blogs and releases to empower LLMs to better answer NVIDIA-specific queries.
Overview
This Model Context Protocol (MCP) server enables Large Language Models (LLMs) to search across multiple NVIDIA domains to find relevant information about NVIDIA technologies, products, and services.
Supported Domains (16 default, customizable)
The server searches across the following NVIDIA domains by default. You can customize this list using the MCP_NVIDIA_DOMAINS environment variable or the domains parameter in search queries:
- blogs.nvidia.com - NVIDIA blog posts and articles
- build.nvidia.com - NVIDIA AI Foundation models and services
- catalog.ngc.nvidia.com - NGC catalog of GPU-accelerated software
- developer.download.nvidia.com - Developer downloads and resources
- developer.nvidia.com - Developer resources, SDKs, and technical documentation
- docs.api.nvidia.com - API documentation
- docs.nvidia.com - Comprehensive technical documentation
- docs.omniverse.nvidia.com - Omniverse documentation
- forums.developer.nvidia.com - Developer forums
- forums.nvidia.com - Community forums
- gameworksdocs.nvidia.com - GameWorks documentation
- ngc.nvidia.com - NVIDIA GPU Cloud
- nvidia.github.io - NVIDIA GitHub Pages documentation
- nvidianews.nvidia.com - Official NVIDIA news and press releases
- research.nvidia.com - NVIDIA research publications
- resources.nvidia.com - NVIDIA resources and whitepapers
Note: For security, only nvidia.com domains and nvidia.github.io are allowed.
Key Features
- Structured JSON output compatible with AI agents and LLMs
- Intelligent keyword extraction using NLTK stopwords for better relevance
- Context-aware search with enhanced snippets and relevance scoring
- Domain-specific filtering for targeted searches
Installation
Via npx (Easiest - recommended for MCP clients)
npx @bharatr21/mcp-nvidia
Or add to your Claude Desktop config:
{
"mcpServers": {
"nvidia": {
"command": "npx",
"args": ["-y", "@bharatr21/mcp-nvidia"]
}
}
}
Note: This requires Python 3.10+ to be installed on your system. The package will automatically use the Python backend.
Via pip
pip install mcp-nvidia
From source
git clone https://github.com/bharatr21/mcp-nvidia.git
cd mcp-nvidia
pip install -e .
Usage
Running the Server
The MCP server can be run directly from the command line:
mcp-nvidia
Configuration
The server can be configured using environment variables:
MCP_NVIDIA_DOMAINS: Comma-separated list of custom NVIDIA domains to search (overrides defaults)- Security: Only nvidia.com domains and subdomains are allowed. Invalid domains are automatically filtered out.
- Example:
"https://developer.nvidia.com/,https://docs.nvidia.com/"
MCP_NVIDIA_LOG_LEVEL: Logging level (DEBUG, INFO, WARNING, ERROR, CRITICAL)
Example:
export MCP_NVIDIA_DOMAINS="https://developer.nvidia.com/,https://docs.nvidia.com/"
export MCP_NVIDIA_LOG_LEVEL="DEBUG"
mcp-nvidia
Configuring with Claude Desktop
Add the following to your Claude Desktop configuration file:
MacOS: ~/Library/Application Support/Claude/claude_desktop_config.json
Windows: %APPDATA%/Claude/claude_desktop_config.json
{
"mcpServers": {
"nvidia": {
"command": "mcp-nvidia"
}
}
}
With custom configuration (environment variables):
{
"mcpServers": {
"nvidia": {
"command": "mcp-nvidia",
"env": {
"MCP_NVIDIA_LOG_LEVEL": "DEBUG",
"MCP_NVIDIA_DOMAINS": "https://developer.nvidia.com/,https://docs.nvidia.com/"
}
}
}
}
If you installed from source, you may need to use the full path to the Python executable:
{
"mcpServers": {
"nvidia": {
"command": "/path/to/python",
"args": ["-m", "mcp_nvidia"]
}
}
}
Available Tools
search_nvidia
Search across NVIDIA domains for specific information. Results include citations with URLs for easy reference.
Parameters:
query(required): The search query to find information across NVIDIA domainsdomains(optional): List of specific NVIDIA domains to search. If not provided, searches all default domainsmax_results_per_domain(optional): Maximum number of results to return per domain (default: 3)min_relevance_score(optional): Minimum relevance score threshold (0-100) to filter results (default: 33)
Example queries:
- "CUDA programming best practices"
- "RTX 4090 specifications"
- "TensorRT optimization techniques"
- "Latest AI announcements"
- "Omniverse development tutorials"
Features:
- Enhanced search using ddgs package for reliable DuckDuckGo integration with domain filtering
- Structured JSON output: Returns data in a structured format with both machine-readable and human-readable fields
- Context-aware snippets: Automatically fetches surrounding text from source URLs and highlights the relevant snippet with
**bold**formatting - Relevance scoring (0-100 scale): Each result includes a relevance score based on query term matches in title, snippet, and URL
- Results are sorted by relevance score (highest first)
- Results below the threshold are automatically filtered out
- Score displayed as "Score: X/100" in formatted text
- Security controls: Input validation and limits (customizable via code)
- Query/topic length: 500 characters max
- Results per domain: 10 max
- Domain whitelist: nvidia.com and nvidia.github.io only
- Rate limiting: 200ms between search API calls (~5 searches/sec)
- Concurrent searches: 5 max simultaneous searches
- Concurrent search across multiple domains for fast results
- Formatted results with titles, URLs, enhanced snippets with context, and source domains
- Dedicated citations section with numbered references for easy copying
Output Format: Results are returned as structured JSON with the following schema:
{
"success": true,
"results": [
{
"title": "Page Title",
"url": "https://example.nvidia.com/page",
"snippet": "Enhanced snippet with **highlighted** keywords",
"domain": "example.nvidia.com",
"relevance_score": 85,
"formatted_text": "Markdown-formatted result for display"
}
],
"metadata": {
"domains_searched": 16,
"search_time_ms": 1234
}
}
discover_nvidia_content
Discover specific types of NVIDIA educational and learning content such as videos, courses, tutorials, webinars, or blog posts.
Parameters:
content_type(required): Type of content to find - one of:video: Video tutorials and demonstrationscourse: Training courses and certifications (NVIDIA DLI)tutorial: Step-by-step guides and how-toswebinar: Webinars and live sessionsblog: Blog posts and articles
topic(required): The topic or technology to find content about (e.g., "CUDA", "Omniverse", "AI")max_results(optional): Maximum number of content items to return (default: 5)
Example queries:
- Find video tutorials:
discover_nvidia_content(content_type="video", topic="CUDA programming") - Find training courses:
discover_nvidia_content(content_type="course", topic="Deep Learning") - Find webinars:
discover_nvidia_content(content_type="webinar", topic="AI in Healthcare")
Features:
- Content-specific search strategies optimized for each type
- Relevance scoring on 0-100 scale to highlight best matches
- Score displayed as "Score: X/100" for transparency
- Direct links to videos, courses, tutorials, and other resources
- Resource links section for easy access to all discovered content
Development
Setting up a development environment
# Clone the repository
git clone https://github.com/bharatr21/mcp-nvidia.git
cd mcp-nvidia
# Create a virtual environment
python -m venv venv
source venv/bin/activate # On Windows: venv\Scripts\activate
# Install in development mode with dev dependencies
pip install -e ".[dev]"
Running tests
# Install test dependencies
pip install -e ".[test]"
# Run all tests
pytest tests/ -v
# Run tests excluding slow tests (rate limiting)
pytest tests/ -v -m "not slow"
Architecture
The server uses the Model Context Protocol (MCP) to expose search functionality to LLMs.
Search Flow
sequenceDiagram
participant LLM as LLM/AI Agent
participant MCP as MCP Server
participant Validator as Input Validator
participant RateLimit as Rate Limiter
participant DDGS as DuckDuckGo Search
participant Fetcher as URL Fetcher
LLM->>MCP: search_nvidia(query, domains)
MCP->>Validator: Validate inputs
Validator-->>MCP: ✓ Valid (or ✗ Error)
loop For each domain (max 5 concurrent)
MCP->>RateLimit: Check rate limit
RateLimit-->>MCP: ✓ Proceed (wait 200ms)
MCP->>DDGS: Search "site:domain query"
DDGS-->>MCP: Search results
loop For each result
MCP->>Fetcher: Fetch URL context
Fetcher-->>MCP: Enhanced snippet
end
end
MCP->>MCP: Calculate relevance scores
MCP->>MCP: Sort & format JSON
MCP-->>LLM: Structured JSON response
System Architecture
flowchart TD
A[MCP Client<br/>Claude/LLMs] -->|JSON-RPC| B[MCP Server]
B --> C{Tool Router}
C -->|search_nvidia| D[Search Handler]
C -->|discover_nvidia_content| E[Discovery Handler]
D --> F[Security Layer]
F --> G[Input Validator]
F --> H[Rate Limiter]
F --> I[Concurrency Control]
G --> J[Domain Searcher]
J --> K[DuckDuckGo API]
J --> L[URL Context Fetcher]
L --> M[BeautifulSoup Parser]
J --> N[Relevance Scorer]
N --> O[Response Builder]
O --> P[Structured JSON]
P -->|CallToolResult| A
style F fill:#ffcccc
style P fill:#ccffcc
style K fill:#cce5ff
Key Components
- Input Validation: Validates query length, domain whitelist, and parameter limits
- Rate Limiting: Enforces 200ms minimum interval between DuckDuckGo API calls
- Concurrent Search: Searches up to 5 domains simultaneously with semaphore control
- Context Enhancement: Fetches actual page content and highlights relevant snippets
- Relevance Scoring: Calculates 0-100 scores based on keyword matches
- JSON Output: Returns structured data compatible with both AI agents and humans
Extending Domain Coverage
The list of searchable domains is configured in src/mcp_nvidia/server.py in the DEFAULT_DOMAINS constant. To add more NVIDIA domains:
- Edit
src/mcp_nvidia/server.py - Add new domain URLs to the
DEFAULT_DOMAINSlist - Reinstall the package
License
MIT License - see LICENSE file for details
Contributing
Contributions are welcome! Please feel free to submit a Pull Request.
Support
For issues, questions, or contributions, please visit the GitHub repository.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file mcp_nvidia-0.1.0.tar.gz.
File metadata
- Download URL: mcp_nvidia-0.1.0.tar.gz
- Upload date:
- Size: 19.8 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.10.18
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
f6024fe6200e64e636bb34de4b80d111ad5fded10d2848307b17da4628b9bd38
|
|
| MD5 |
b691711d18cbdeb7be3bbdbed89e2bb3
|
|
| BLAKE2b-256 |
828b7dd497c80f0436f432c10149b0ba65f9c1748e0b8150c685262d259af57a
|
File details
Details for the file mcp_nvidia-0.1.0-py3-none-any.whl.
File metadata
- Download URL: mcp_nvidia-0.1.0-py3-none-any.whl
- Upload date:
- Size: 19.5 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.10.18
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
6c67cdb0a1d0cc54390175140c64bfb06291dcd1dfdb70267643c8c0e4d33afe
|
|
| MD5 |
58e36a6062509b13970d05cf4b0204cf
|
|
| BLAKE2b-256 |
0b13a16e0910778fcf1ed50a644f861280927676c1a95ab9a0ca71820b26eb93
|