Skip to main content

MCP Server for interacting with Statistics Canada Web Data Services API

Project description

Statistics Canada MCP Server

📊 Statistics Canada API MCP Server

Python 3.10+ License: MIT MCP GitHub

📝 Description

This project implements a Model Context Protocol (MCP) server that provides tools for interacting with Statistics Canada (StatCan) data APIs. It allows LLMs or other MCP clients to access and retrieve Canadian statistical data in a structured way.

-- Please note LLM's may fabricate data, current MCP integration is limited to basic data discovery, and data exploration using a SQLite database, always verify original tables and keep track of changes you let LLM's (download, update, delete, checking your pc memory) --

The server is built using the FastMCP library and interacts with the StatCan Web Data Service via httpx.

📑 Table of Contents

💬 Claude Chat Examples

Dataset Query Example Demo Data Source
Canada's Greenhouse Gas Emissions (2018-2022) "Hey Claude! Can you please create a simple visualization for greenhouse emissions for Canada as a whole over the last 4 years?" View Demo StatCan Table
Canada's International Trade in Services "Hey Claude, can you create a quick analysis for international trade in services for the last 6 months. Create a visualization with key figures please!" View Demo StatCan Table
Ontario Building Construction Price Index "Hey Claude! Can you please generate a visualization for Ontario's Building Price index from Q4 2023 to Q4 2024. Thanks!" View Demo StatCan Table

Effective Querying Tips

To get the most accurate results from Claude when using this Statistics Canada MCP server:

  • Be Specific: Use precise, well-formed requests with exact details about the data you need
  • Provide Context: Clearly specify tables, vectors, time periods, and geographical areas
  • Avoid Typos: Double-check spelling of statistical terms and place names
  • Structured Questions: Break complex queries into clear, logical steps
  • Verify Results: Always cross-check important data against official Statistics Canada sources

⚠️ Warning: LLMs like Claude may occasionally create mock visualizations or fabricate data when unable to retrieve actual information. They might also generate responses with data not available in Statistics Canada to satisfy queries. Always verify results against official sources.

✨ Features

This server exposes StatCan API functionalities as MCP tools, including:

API Functionality

Cube Operations:

  • Listing all available data cubes/tables (full and lite versions)
  • Searching cubes by title
  • Retrieving detailed cube metadata
  • Getting data for the latest N periods based on ProductId and Coordinate
  • Getting series info based on ProductId and Coordinate
  • Getting changed series data based on ProductId and Coordinate
  • Listing cubes changed on a specific date
  • Providing download links for full cubes (CSV/SDMX) (Discouraged)

Vector Operations:

  • Retrieving series metadata by Vector ID
  • Getting data for the latest N periods by Vector ID
  • Getting data for multiple vectors by reference period range
  • Getting bulk data for multiple vectors by release date range
  • Getting changed series data by Vector ID
  • Listing series changed on a specific date

Database Functionality

The server automatically creates a SQLite database (temp_statcan_data.db) for:

  • Creating tables from API data
  • Inserting data into tables
  • Querying the database with SQL
  • Viewing table schemas and listing available tables

This allows for persistent storage of retrieved data and more complex data manipulation through SQL.

(Refer to the specific tool functions within src/api/ for detailed parameters and return types.)

🏗️ Project Structure

  • src/: Contains the main source code for the MCP server.
  • api/: Defines the MCP tools wrapping the StatCan API calls (cube_tools.py, vector_tools.py, metadata_tools.py).
  • db/: Handles database interactions, including connection, schema, and queries.
  • models/: Contains Pydantic models for API request/response validation and database representation.
  • util/: Utility functions (e.g., coordinate padding).
  • config.py: Configuration loading (e.g., database credentials, API base URL).
  • server.py: Main FastMCP server definition and tool registration.
  • __init__.py: Package initialization for src.
  • pyproject.toml: Project dependency and build configuration.
  • .env: (Assumed) Used for storing sensitive configuration like database credentials, loaded by src/config.py.

📥 Installation Guide for Beginners

If you're new to Python or programming in general, follow these simple steps to get started:

  1. Install Python (version 3.10 or higher):
  • Download from python.org
  • Make sure to check "Add Python to PATH" during installation
  1. Install uv (a fast Python package installer):
# Open your Terminal (Mac/Linux) or Command Prompt (Windows) and run:
curl -fsSL https://astral.sh/uv/install.sh | bash
# Or on Windows:
# curl.exe -fsSL https://astral.sh/uv/install.ps1 -o install.ps1; powershell -ExecutionPolicy Bypass -File install.ps1
  1. Install fastmcp:
uv pip install fastmcp httpx pydantic
  1. Download this project:
git clone https://github.com/Aryan-Jhaveri/mcp-statcan.git
cd mcp-statcan

Tip: If you encounter any "module not found" errors, install the missing package with:

uv pip install package_name

🔧 Setting Up Claude Desktop Configuration

To integrate with Claude Desktop:

  1. Manually edit the generated config in your claude_desktop_config.json:

Navigate to: Claude Desktop App → Settings (⌘ + ,) → Developer → Edit Config

{
"mcpServers": {
"StatCanAPI_DB_Server": {
"command": "uv",
"args": [
  "run",
  "--with", "fastmcp",
  "--with", "httpx", 
  "sh",
  "-c",
  "cd /path/to/mcp-statcan && python -m src.server"
]
}
}
}

Replace /path/to/mcp-statcan with the absolute path to your project directory. The manual edit is necessary to ensure the server runs with the correct working directory context for proper module resolution.

⚠️ Known Issues and Limitations

  • SSL Verification: Currently disabled for development. Should be enabled for production use.
  • Claude Behavior: May occasionally get stuck in loops or inefficiently make multiple REST calls when a bulk operation would be more efficient.
  • Data Validation: Always cross-check your data with official Statistics Canada sources.
  • Security Concerns: Query validation is basic; avoid using with untrusted input.
  • Performance: Some endpoints may timeout with large data requests.
  • API Rate Limits: The StatCan API may impose rate limits that affect usage during high-demand periods.

🚀 Usage Examples

API Operations

# Search for data tables about employment
tables = await search_cubes_by_title("employment")

# Get recent data points for a specific vector ID
data = await get_data_from_vectors_and_latest_n_periods(VectorLatestNInput(vectorId=12345, latestN=5))

# Get data for a specific range of periods
range_data = await get_data_from_vector_by_reference_period_range(
VectorPeriodRangeInput(vectorId=12345, startDate="2020-01-01", endDate="2020-12-31")
)

Database Operations

# Store API results in SQLite database
create_table_from_data(TableDataInput(table_name="employment_data", data=data))

# Query the database
result = query_database(QueryInput(sql_query="SELECT * FROM employment_data LIMIT 10"))

# List available tables
tables = list_tables()

# Get schema for a table
schema = get_table_schema(TableSchemaInput(table_name="employment_data"))

Made with ❤️❤️❤️ for Statistics Canada

GitHubReport BugStatistics Canada

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

statcan_mcp_server-0.1.0.tar.gz (134.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

statcan_mcp_server-0.1.0-py3-none-any.whl (29.5 kB view details)

Uploaded Python 3

File details

Details for the file statcan_mcp_server-0.1.0.tar.gz.

File metadata

  • Download URL: statcan_mcp_server-0.1.0.tar.gz
  • Upload date:
  • Size: 134.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for statcan_mcp_server-0.1.0.tar.gz
Algorithm Hash digest
SHA256 8e3a5f29cac54e5af86d842ac161c5328e3940678fa6ee2b2a9746aa63f424cf
MD5 9fe9bf5242196aaafe8e03cac1016397
BLAKE2b-256 28ef63a781f2ac47df554ec2100f703de8e7d58e4845d5bdf29d06b054c52a23

See more details on using hashes here.

Provenance

The following attestation bundles were made for statcan_mcp_server-0.1.0.tar.gz:

Publisher: publish-mcp-registry.yml on Aryan-Jhaveri/mcp-statcan

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file statcan_mcp_server-0.1.0-py3-none-any.whl.

File metadata

File hashes

Hashes for statcan_mcp_server-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 d953a4eeae4a7dd22b68adfddb541286bdd2f81a2fdc82fee4c241d5f846d05c
MD5 41410dfbf65880b4af1d6cf5e45d0f10
BLAKE2b-256 acf156e1a8827f27cd00e2a8c5849dc6633db9cf23617d2727e4afae825bd67c

See more details on using hashes here.

Provenance

The following attestation bundles were made for statcan_mcp_server-0.1.0-py3-none-any.whl:

Publisher: publish-mcp-registry.yml on Aryan-Jhaveri/mcp-statcan

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page