MCP server for accessing data.gov.hk open data

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

chankwongyintony

These details have not been verified by PyPI

Development Status
- 3 - Alpha
Intended Audience
- Developers
License
- OSI Approved :: MIT License
Programming Language

Project description

mcp-open-data-hk

This is an MCP (Model Context Protocol) server that provides access to data from data.gov.hk, the official open data portal of the Hong Kong government.

Installation

You can install the package directly from PyPI:

pip install mcp-open-data-hk

After installation, configure your MCP-compatible client (like Cursor, Claude Code, or Claude Desktop) by adding the following to your settings.json:

{
  "mcpServers": {
    "mcp-open-data-hk": {
      "command": "python",
      "args": ["-m", "mcp_open_data_hk"]
    }
  }
}

Features

The server provides the following tools to interact with the data.gov.hk API:

list_datasets - Get a list of dataset IDs
get_dataset_details - Get detailed information about a specific dataset
list_categories - Get a list of data categories
get_category_details - Get detailed information about a specific category
search_datasets - Search for datasets by query term with advanced options
search_datasets_with_facets - Search datasets and return faceted results
get_datasets_by_format - Get datasets by file format
get_supported_formats - Get list of supported file formats
For local development:
```
python src/server.py
```
Using FastMCP CLI:
```
fastmcp run src/server.py:mcp
```

Tools

list_datasets

Get a list of dataset IDs from data.gov.hk

Parameters:

limit (optional): Maximum number of datasets to return (default: 1000)
offset (optional): Offset of the first dataset to return
language (optional): Language code (en, tc, sc) - defaults to "en"

get_dataset_details

Get detailed information about a specific dataset

Parameters:

dataset_id: The ID or name of the dataset to retrieve
language (optional): Language code (en, tc, sc) - defaults to "en"
include_tracking (optional): Add tracking information to dataset and resources - defaults to False

list_categories

Get a list of data categories (groups)

Parameters:

order_by (optional): Field to sort by ('name' or 'packages') - deprecated, use sort instead
sort (optional): Sorting of results ('name asc', 'package_count desc', etc.) - defaults to "title asc"
limit (optional): Maximum number of categories to return
offset (optional): Offset for pagination
all_fields (optional): Return full group dictionaries instead of just names - defaults to False
language (optional): Language code (en, tc, sc) - defaults to "en"

get_category_details

Get detailed information about a specific category (group)

Parameters:

category_id: The ID or name of the category to retrieve
include_datasets (optional): Include a truncated list of the category's datasets - defaults to False
include_dataset_count (optional): Include the full package count - defaults to True
include_extras (optional): Include the category's extra fields - defaults to True
include_users (optional): Include the category's users - defaults to True
include_groups (optional): Include the category's sub groups - defaults to True
include_tags (optional): Include the category's tags - defaults to True
include_followers (optional): Include the category's number of followers - defaults to True
language (optional): Language code (en, tc, sc) - defaults to "en"

search_datasets

Search for datasets by query term using the package_search API.

This function searches across dataset titles, descriptions, and other metadata to find datasets matching the query term. It supports advanced Solr search parameters.

Parameters:

query (optional): The solr query string (e.g., "transport", "weather", ":" for all) - defaults to ":"
limit (optional): Maximum number of datasets to return (default: 10, max: 1000)
offset (optional): Offset for pagination - defaults to 0
language (optional): Language code (en, tc, sc) - defaults to "en"

Returns: A dictionary containing:

count: Total number of matching datasets
results: List of matching datasets (up to limit)
search_facets: Faceted information about the results
has_more: Boolean indicating if there are more results available

search_datasets_with_facets

Search for datasets and return faceted results for better data exploration.

This function is useful for exploring what types of data are available by showing counts of datasets grouped by tags, organizations, or other facets.

Parameters:

query (optional): The solr query string - defaults to ":"
language (optional): Language code (en, tc, sc) - defaults to "en"

Returns: A dictionary containing:

count: Total number of matching datasets
search_facets: Faceted information about the results
sample_results: First 3 matching datasets

get_datasets_by_format

Get datasets that have resources in a specific file format.

Parameters:

file_format: The file format to filter by (e.g., "CSV", "JSON", "GeoJSON")
limit (optional): Maximum number of datasets to return - defaults to 10
language (optional): Language code (en, tc, sc) - defaults to "en"

Returns: A dictionary containing:

count: Total number of matching datasets
results: List of matching datasets

get_supported_formats

Get a list of file formats supported by data.gov.hk

Returns: A list of supported file formats

Local Testing

Run test scripts:

python tests/test_client.py
python tests/debug_search.py
python tests/comprehensive_test.py

Run server directly:

python src/server.py

Run with FastMCP CLI:

fastmcp run src/server.py:mcp

Run unit tests:

pytest tests/

Adding to AI Assistants

For Cursor:

Using FastMCP CLI (recommended):

fastmcp install cursor src/server.py --server-name "mcp-open-data-hk"

Manual Configuration via UI:

Open Cursor Settings
Go to MCP Servers
Click "Add Server"
Enter:
- Name: mcp-open-data-hk
- Command: python
- Arguments: ["-m", "mcp_open_data_hk"]
- Working Directory: /full/path/to/mcp-open-data-hk

Manual Configuration via settings.json: Edit your Cursor settings.json file and add:

{
  "mcpServers": {
    "mcp-open-data-hk": {
      "command": "python",
      "args": ["-m", "mcp_open_data_hk"],
      "cwd": "/full/path/to/mcp-open-data-hk"
    }
  }
}

For Claude Code:

Using FastMCP CLI (recommended):

fastmcp install claude-code src/server.py --server-name "mcp-open-data-hk"

Manual Configuration via UI:

Open Claude Code Settings
Go to MCP Servers
Click "Add Server"
Enter:
- Name: mcp-open-data-hk
- Command: python
- Arguments: ["-m", "mcp_open_data_hk"]
- Working Directory: /full/path/to/mcp-open-data-hk

Manual Configuration via settings.json: Edit your Claude Code settings.json file and add:

{
  "mcpServers": {
    "mcp-open-data-hk": {
      "command": "python",
      "args": ["-m", "mcp_open_data_hk"],
      "cwd": "/full/path/to/mcp-open-data-hk"
    }
  }
}

For Claude Desktop:

Using FastMCP CLI (recommended):

fastmcp install claude-desktop src/server.py --server-name "mcp-open-data-hk"

Understanding Path Configuration

When installed as a package, the server can be referenced by its module name rather than file path. This is more convenient for users as they don't need to specify full file paths.

Installed Package:

{
  "mcpServers": {
    "mcp-open-data-hk": {
      "command": "python",
      "args": ["-m", "mcp_open_data_hk"]
    }
  }
}

Local Development (file path approach):

{
  "mcpServers": {
    "mcp-open-data-hk": {
      "command": "python",
      "args": ["/full/path/to/mcp-open-data-hk/src/server.py"],
      "cwd": "/full/path/to/mcp-open-data-hk"
    }
  }
}

The package installation approach is recommended for end users, while the file path approach is useful for local development and testing.

Alternative Approach: Packaging for Distribution

If you want to make your server work like the published examples, you can package it for distribution:

Package as a Python package and publish it to PyPI

Users would then install and run it like:

pip install mcp-open-data-hk
mcp-open-data-hk

However, for local development and testing, you can run it directly as shown in the Installation section.

Using Environment Variables

Create a .env file in the project root:

DEBUG=true
TIMEOUT=30

Then install with:

# For Cursor
fastmcp install cursor src/server.py --server-name "mcp-open-data-hk" --env-file .env

# For Claude Code
fastmcp install claude-code src/server.py --server-name "mcp-open-data-hk" --env-file .env

Or add to settings.json:

{
  "mcpServers": {
    "mcp-open-data-hk": {
      "command": "python",
      "args": ["/full/path/to/data_gov_hk_mcp/src/server.py"],
      "cwd": "/full/path/to/data_gov_hk_mcp",
      "env": {
        "DEBUG": "true",
        "TIMEOUT": "30"
      }
    }
  }
}

Example Queries

Once installed, try these queries with your AI assistant:

"List some datasets from the Hong Kong government data portal"
"Find datasets related to transportation in Hong Kong"
"What categories of data are available on data.gov.hk?"
"Get details about the flight information dataset"
"Search for datasets about weather in Hong Kong"
"What file formats are supported by data.gov.hk?"
"Find CSV datasets about population"
"Show me the most common tags in transport datasets"

The AI will automatically use the appropriate tools from your MCP server to fetch the requested information.

Troubleshooting

Common Issues

Module not found errors: Make sure you've installed the dependencies with pip install -r requirements.txt
Path issues: Ensure the paths in your IDE configuration are correct absolute paths
Permission errors: On Unix systems, make sure the scripts have execute permissions:
```
chmod +x src/server.py
```
FastMCP not found: Install it with:
```
pip install fastmcp
```

Testing the Connection

If you're having issues, you can test the connection manually:

Run the server in one terminal:
```
python src/server.py
```
In another terminal, run the test client:
```
python tests/test_client.py
```

If this works, the issue is likely in the IDE configuration.

Extending the Server

You can extend the server by adding more tools in src/server.py. Follow the existing patterns:

Add a new function decorated with @mcp.tool
Provide a clear docstring explaining the function and parameters
Implement the functionality
Test with the client

The server automatically exposes all functions decorated with @mcp.tool to MCP clients.

GitHub Workflows

This project includes GitHub Actions workflows for CI/CD:

CI Workflow: Runs tests across multiple Python versions (3.10-3.12) on every push/PR to main branch
Publish Workflow: Automatically builds and publishes to TestPyPI on every push to main, and to PyPI on version tags (v*..)
Code Quality Workflow: Checks code formatting and linting on every push/PR
Release Workflow: Automatically creates GitHub releases when tags are pushed

Setup for Publishing (Trusted Publishing)

This project uses PyPI's Trusted Publishing which is more secure than using API tokens. To set it up:

Go to https://pypi.org/manage/account/publishing/ and add a new pending publisher with:
- Project name: mcp-open-data-hk
- Owner: Your GitHub username or organization
- Repository name: mcp-open-data-hk
- Workflow name: publish.yml
- Environment name: pypi
Go to https://test.pypi.org/manage/account/publishing/ and add a new pending publisher with the same information but use testpypi as the environment name.
In your GitHub repository, go to "Settings" > "Environments" and create two environments:
- pypi - Set "Required reviewers" to your username for security
- testpypi - No additional configuration needed

With Trusted Publishing, no API tokens need to be created or stored as secrets.

GitHub Environments

For the Trusted Publishing to work correctly, you need to create two environments in your GitHub repository settings:

pypi - This environment requires manual approval for security when publishing to PyPI
testpypi - This environment doesn't require manual approval and will automatically publish to TestPyPI

To create these environments:

Go to your repository's "Settings" tab
Click on "Environments" in the left sidebar
Click "New environment"
Create the pypi environment and enable "Required reviewers" with your username
Create the testpypi environment with no additional settings

Releasing New Versions

To release a new version:

Update the version number in pyproject.toml
Commit the changes

Create and push a new tag:

git tag -a v1.0.0 -m "Release version 1.0.0"
git push origin v1.0.0

Or use the provided release script:

./release.sh 1.0.0

This will automatically trigger the publish workflow to build and publish the package to TestPyPI and PyPI (for tagged releases), and create a GitHub release.

Contributing

Contributions are welcome! Please read our Contributing Guide and Code of Conduct for details on how to contribute to this project.

Project Structure

mcp-open-data-hk/
├── src/
│   └── server.py          # Main MCP server implementation
├── tests/
│   ├── test_client.py     # Client test script
│   ├── debug_search.py    # Search functionality test
│   ├── comprehensive_test.py # Comprehensive functionality test
│   └── test_data_gov_hk.py # Unit tests
├── requirements.txt       # Python dependencies
├── pyproject.toml         # Project configuration
├── README.md             # This file
├── run_examples.sh       # Example commands script
├── install.sh            # Installation helper script
├── release.sh            # Release helper script
└── .gitignore            # Git ignore file

License

This project is licensed under the MIT License.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

chankwongyintony

These details have not been verified by PyPI

Development Status
- 3 - Alpha
Intended Audience
- Developers
License
- OSI Approved :: MIT License
Programming Language

Release history Release notifications | RSS feed

0.2.3

Aug 14, 2025

0.2.2

Aug 14, 2025

This version

0.1.3

Aug 13, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mcp_open_data_hk-0.1.3.tar.gz (9.0 kB view details)

Uploaded Aug 13, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

mcp_open_data_hk-0.1.3-py3-none-any.whl (6.9 kB view details)

Uploaded Aug 13, 2025 Python 3

File details

Details for the file mcp_open_data_hk-0.1.3.tar.gz.

File metadata

Download URL: mcp_open_data_hk-0.1.3.tar.gz
Upload date: Aug 13, 2025
Size: 9.0 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for mcp_open_data_hk-0.1.3.tar.gz
Algorithm	Hash digest
SHA256	`d0ce34d527dbc9c9710879c6cff079c746663f3689cf8bfd0b65c17e7c45b00a`
MD5	`4ded7a1287addc47c53c693448ed0c3e`
BLAKE2b-256	`9c95134dfd4760bcc60e8b7dcb245c67df7cca24854b7e67ba5484e74f0dcd53`

See more details on using hashes here.

Provenance

The following attestation bundles were made for mcp_open_data_hk-0.1.3.tar.gz:

Publisher: publish.yml on mcp-open-data-hk/mcp-open-data-hk

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: mcp_open_data_hk-0.1.3.tar.gz
- Subject digest: d0ce34d527dbc9c9710879c6cff079c746663f3689cf8bfd0b65c17e7c45b00a
- Sigstore transparency entry: 390645646
- Sigstore integration time: Aug 13, 2025
Source repository:
- Permalink: mcp-open-data-hk/mcp-open-data-hk@dbb17052b2e7d2140b318f3d58190b940e5648d1
- Branch / Tag: refs/tags/v0.1.3
- Owner: https://github.com/mcp-open-data-hk
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@dbb17052b2e7d2140b318f3d58190b940e5648d1
- Trigger Event: push

File details

Details for the file mcp_open_data_hk-0.1.3-py3-none-any.whl.

File metadata

Download URL: mcp_open_data_hk-0.1.3-py3-none-any.whl
Upload date: Aug 13, 2025
Size: 6.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for mcp_open_data_hk-0.1.3-py3-none-any.whl
Algorithm	Hash digest
SHA256	`9e1e7237416c602567c618e3bbc2bb3e0640b5ef073ea371d3f7a5f830e04f44`
MD5	`bcb6a4273d441c4d78fb1fc4de3d4dfd`
BLAKE2b-256	`61e72d575b6d27838d187bcbe0b89c8770ed34a47f2083de3e476d61bba618de`

See more details on using hashes here.

Provenance

The following attestation bundles were made for mcp_open_data_hk-0.1.3-py3-none-any.whl:

Publisher: publish.yml on mcp-open-data-hk/mcp-open-data-hk

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: mcp_open_data_hk-0.1.3-py3-none-any.whl
- Subject digest: 9e1e7237416c602567c618e3bbc2bb3e0640b5ef073ea371d3f7a5f830e04f44
- Sigstore transparency entry: 390645657
- Sigstore integration time: Aug 13, 2025
Source repository:
- Permalink: mcp-open-data-hk/mcp-open-data-hk@dbb17052b2e7d2140b318f3d58190b940e5648d1
- Branch / Tag: refs/tags/v0.1.3
- Owner: https://github.com/mcp-open-data-hk
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@dbb17052b2e7d2140b318f3d58190b940e5648d1
- Trigger Event: push

mcp-open-data-hk 0.1.3

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

mcp-open-data-hk

Installation

Features

Tools

list_datasets

get_dataset_details

list_categories

get_category_details

search_datasets

search_datasets_with_facets

get_datasets_by_format

get_supported_formats

Local Testing

Run test scripts:

Run server directly:

Run with FastMCP CLI:

Run unit tests:

Adding to AI Assistants

For Cursor:

For Claude Code:

For Claude Desktop:

Understanding Path Configuration

Installed Package:

Local Development (file path approach):

Alternative Approach: Packaging for Distribution

Using Environment Variables

Example Queries

Troubleshooting

Common Issues

Testing the Connection

Extending the Server

GitHub Workflows

Setup for Publishing (Trusted Publishing)

GitHub Environments

Releasing New Versions

Contributing

Project Structure

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance