Skip to main content

Model Context Protocol server for Caltrain schedules

Project description

๐Ÿš‚ Caltrain MCP Server (Because You Love Waiting for Trains)

PyPI CI & Semantic release

Caltrain MCP Demo

A Model Context Protocol (MCP) server that promises to tell you exactly when the next Caltrain will arrive... and then be 10 minutes late anyway. Uses real GTFS data, so at least the disappointment is official!

Features (Or: "Why We Built This Thing")

  • ๐Ÿš† "Real-time" train schedules - Get the next departures between any two stations (actual arrival times may vary by +/- infinity)
  • ๐Ÿ“ Station lookup - Because apparently 31 stations is too many to memorize ๐Ÿคทโ€โ™€๏ธ
  • ๐Ÿ• Time-specific queries - Plan your commute with surgical precision, then watch it all fall apart
  • โœจ Smart search - Type 'sf' instead of the full name because we're all lazy here
  • ๐Ÿ“Š GTFS-based - We use the same data Caltrain does, so when things go wrong, we can blame them together

Setup (The Fun Part ๐Ÿ™„)

  1. Install dependencies (aka "More stuff to break"):

    # Install uv if you haven't already (because pip is apparently too mainstream now)
    curl -LsSf https://astral.sh/uv/install.sh | sh
    
    # Install dependencies using uv (fingers crossed it actually works)
    uv sync
    
  2. Get that sweet, sweet GTFS data: The server expects Caltrain GTFS data in the src/caltrain_mcp/data/caltrain-ca-us/ directory. Because apparently we can't just ask the trains nicely where they are.

    uv run python scripts/fetch_gtfs.py
    

    This magical script downloads files that contain:

    • stops.txt - All the places trains pretend to stop
    • trips.txt - Theoretical journeys through space and time
    • stop_times.txt - When trains are supposed to arrive (spoiler: they don't)
    • calendar.txt - Weekday vs weekend schedules (because trains also need work-life balance)

Usage (Good Luck!)

As an MCP Server (The Real Deal)

This server is designed to be used with MCP clients like Claude Desktop, not run directly by humans (because that would be too easy). Here's how to actually use it:

With Claude Desktop

Add this to your Claude Desktop MCP configuration file:

{
  "mcpServers": {
    "caltrain": {
      "command": "uvx",
      "args": ["caltrain-mcp"]
    }
  }
}

This will automatically install and run the latest version from PyPI.

Then restart Claude Desktop and you'll have access to Caltrain schedules directly in your conversations!

With Other MCP Clients

Any MCP-compatible client can use this server by starting it with:

uvx caltrain-mcp

The server communicates via stdin/stdout using the MCP protocol. It doesn't do anything exciting when run directly - it just sits there waiting for proper MCP messages.

Testing the Server (For Development)

You can test if this thing actually works by importing it directly:

from caltrain_mcp.server import next_trains, list_stations

# Test next trains functionality (prepare for disappointment)
result = await next_trains('San Jose Diridon', 'San Francisco')
print(result)  # Spoiler: there are no trains

# Test stations list (all 31 of them, because apparently that's manageable)
stations = await list_stations()
print(stations)

Available Tools (Your New Best Friends)

next_trains(origin, destination, when_iso=None)

Ask politely when the next train will show up. The server will consult its crystal ball (GTFS data) and give you times that are technically accurate.

Parameters:

  • origin (str): Where you are now (probably regretting your life choices)
  • destination (str): Where you want to be (probably anywhere but here)
  • when_iso (str, optional): When you want to travel (as if time has any meaning in public transit)

Examples:

# Next trains from current time (aka "right now would be nice")
next_trains('San Jose Diridon', 'San Francisco')

# Trains at a specific time (for the optimists who think schedules matter)
next_trains('Palo Alto', 'sf', '2025-05-23T06:00:00')

# Using abbreviations (because typing is hard)
next_trains('diridon', 'sf')

list_stations()

Get a list of all 31 Caltrain stations, because memorizing them is apparently too much to ask.

Returns: A formatted list that will make you realize just how many places this train supposedly goes.

Station Name Recognition (We're Not Mind Readers, But We Try)

The server supports various ways to be lazy about typing station names:

  • Full names: "San Jose Diridon Station" (for the perfectionists)
  • Short names: "San Francisco" (for the slightly less perfectionist)
  • Abbreviations: "sf" โ†’ "San Francisco" (for the truly lazy)
  • Partial matching: "diridon" matches "San Jose Diridon Station" (for when you can't be bothered)

Available Stations (All 31 Glorious Stops)

The server covers every single Caltrain station because we're completionists:

San Francisco to San Jose (The Main Event):

  • San Francisco, 22nd Street, Bayshore, South San Francisco, San Bruno, Millbrae, Broadway, Burlingame, San Mateo, Hayward Park, Hillsdale, Belmont, San Carlos, Redwood City, Menlo Park, Palo Alto, Stanford, California Avenue, San Antonio, Mountain View, Sunnyvale, Lawrence, Santa Clara, College Park, San Jose Diridon

San Jose to Gilroy (The "Why Does This Exist?" Extension):

  • Tamien, Capitol, Blossom Hill, Morgan Hill, San Martin, Gilroy

Sample Output (Prepare to Be Amazed)

๐Ÿš† Next Caltrain departures from San Jose Diridon Station to San Francisco Caltrain Station on Thursday, May 22, 2025:
โ€ข Train 153: 17:58:00 โ†’ 19:16:00 (to San Francisco)
โ€ข Train 527: 18:22:00 โ†’ 19:22:00 (to San Francisco)
โ€ข Train 155: 18:28:00 โ†’ 19:46:00 (to San Francisco)
โ€ข Train 429: 18:43:00 โ†’ 19:53:00 (to San Francisco)
โ€ข Train 157: 18:58:00 โ†’ 20:16:00 (to San Francisco)

Actual arrival times may vary. Side effects may include existential dread and a deep appreciation for remote work.

Technical Details (For the Nerds)

  • GTFS Processing: We automatically handle the relationship between stations and their platforms (because apparently trains are complicated)
  • Service Calendar: Respects weekday/weekend schedules (trains also need their beauty rest)
  • Data Types: Handles the chaos that is mixed integer/string formats in GTFS files
  • Time Parsing: Supports 24+ hour format for those mythical late-night services
  • Error Handling: Gracefully fails when you type "Narnia" as a station name

Project Structure (The Organized Chaos)

caltrain-mcp/
โ”œโ”€โ”€ .github/workflows/         # GitHub Actions (the CI/CD overlords)
โ”‚   โ”œโ”€โ”€ ci.yml                 # Main CI pipeline (linting, testing, the works)
โ”‚   โ””โ”€โ”€ update-gtfs.yml        # Automated GTFS data updates
โ”œโ”€โ”€ src/caltrain_mcp/          # Main package (because modern Python demands structure)
โ”‚   โ”œโ”€โ”€ data/caltrain-ca-us/   # GTFS data storage (where CSV files go to retire)
โ”‚   โ”œโ”€โ”€ __init__.py            # Package initialization (the ceremony of Python)
โ”‚   โ”œโ”€โ”€ __main__.py            # Entry point for python -m caltrain_mcp
โ”‚   โ”œโ”€โ”€ server.py              # MCP server implementation (where the magic happens)
โ”‚   โ””โ”€โ”€ gtfs.py                # GTFS data processing (aka "CSV wrestling")
โ”œโ”€โ”€ scripts/                   # Utility scripts (the supporting cast)
โ”‚   โ”œโ”€โ”€ __init__.py            # Makes scripts a proper Python package
โ”‚   โ”œโ”€โ”€ fetch_gtfs.py          # Downloads the latest disappointment data
โ”‚   โ””โ”€โ”€ lint.py                # Run all CI checks locally (before embarrassment)
โ”œโ”€โ”€ tests/                     # Test suite (because trust but verify)
โ”‚   โ”œโ”€โ”€ conftest.py            # Shared test fixtures (the common ground)
โ”‚   โ”œโ”€โ”€ test_gtfs.py           # GTFS functionality tests (8 tests of data wrangling)
โ”‚   โ”œโ”€โ”€ test_server.py         # Server functionality tests (4 tests of MCP protocol)
โ”‚   โ””โ”€โ”€ test_fetch_gtfs.py     # Data fetching tests (7 tests of download chaos)
โ”œโ”€โ”€ .pre-commit-config.yaml    # Pre-commit hooks configuration
โ”œโ”€โ”€ pyproject.toml             # Modern Python config (because setup.py is so 2020)
โ””โ”€โ”€ README.md                  # This literary masterpiece

Development & Testing (For When Things Inevitably Break)

Code Quality & CI/CD

This project uses modern Python tooling to keep the code clean and maintainable:

  • Ruff: Lightning-fast linting and formatting (because life's too short for slow tools)
  • MyPy: Type checking (because guessing types is for amateurs)
  • Pytest: Testing framework with coverage reporting

Release Process (Automated Awesomeness)

This project uses automated versioning and publishing:

  • Semantic Versioning: Version numbers are automatically determined from commit messages using Conventional Commits
  • Automatic Tagging: When you push to main, semantic-release creates version tags automatically
  • PyPI Publishing: Tagged releases are automatically built and published to PyPI via GitHub Actions
  • Trusted Publishing: Uses OIDC authentication with PyPI (no API tokens needed!)

Making a Release

Just commit using conventional commit format and push to main:

# For bug fixes (patch version bump: 1.0.0 โ†’ 1.0.1)
git commit -m "fix: correct station name lookup bug"

# For new features (minor version bump: 1.0.0 โ†’ 1.1.0)
git commit -m "feat: add support for weekend schedules"

# For breaking changes (major version bump: 1.0.0 โ†’ 2.0.0)
git commit -m "feat!: redesign API structure"
# or
git commit -m "feat: major API changes

BREAKING CHANGE: This changes the function signatures"

The semantic-release workflow will:

  1. Analyze your commit messages
  2. Determine the appropriate version bump
  3. Create a git tag (e.g., v1.2.3)
  4. Generate a changelog
  5. Trigger the release workflow to publish to PyPI

Local Testing

Test the build process locally before pushing:

# Build packages locally
uv run python -m build --sdist --wheel

# Validate packages
uv run twine check dist/*

# Test upload to Test PyPI (optional)
uv run twine upload --repository testpypi dist/*

GitHub Actions CI

Every PR and push to main triggers automatic checks:

  • โœ… Linting: Ruff checks for code quality issues
  • โœ… Formatting: Ensures consistent code style
  • โœ… Type Checking: MyPy validates type annotations
  • โœ… Tests: Full test suite with coverage reporting
  • โœ… Coverage: Test coverage reporting in CI logs

The CI will politely reject your PR if any checks fail, because standards matter.

MCP Integration (For the AI Overlords)

This server implements the Model Context Protocol (MCP), which means it's designed to work seamlessly with AI assistants and other MCP clients. Once configured:

  • Claude Desktop: Ask Claude about train schedules directly in conversation
  • Other MCP Clients: Any MCP-compatible tool can access Caltrain data
  • Real-time Integration: Your AI can check schedules, suggest routes, and help plan trips
  • Natural Language: No need to remember station names or command syntax

The server exposes two main tools:

  • next_trains - Get upcoming departures between stations
  • list_stations - Browse all available Caltrain stations

So your AI assistant can now disappoint you about train schedules just like a real human would! The future is truly here.

License (The Legal Stuff)

This project uses official Caltrain GTFS data. If something goes wrong, blame them, not us. We're just the messenger.


Built with โค๏ธ and a concerning amount of caffeine in the Bay Area, where public transit is both a necessity and a source of eternal suffering.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

caltrain_mcp-0.8.6.tar.gz (108.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

caltrain_mcp-0.8.6-py3-none-any.whl (111.3 kB view details)

Uploaded Python 3

File details

Details for the file caltrain_mcp-0.8.6.tar.gz.

File metadata

  • Download URL: caltrain_mcp-0.8.6.tar.gz
  • Upload date:
  • Size: 108.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for caltrain_mcp-0.8.6.tar.gz
Algorithm Hash digest
SHA256 e476bd909f2bb89daeebfaa078f1745bcdcab006ea26c99246c40ec349fa7f0f
MD5 74cfad8126c2183bf06592fa67ce5be7
BLAKE2b-256 b8fd794ed24b68bff46623ff644d352b0e8d95112fe775f0c73a47560f33794e

See more details on using hashes here.

Provenance

The following attestation bundles were made for caltrain_mcp-0.8.6.tar.gz:

Publisher: ci.yml on davidyen1124/caltrain-mcp

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file caltrain_mcp-0.8.6-py3-none-any.whl.

File metadata

  • Download URL: caltrain_mcp-0.8.6-py3-none-any.whl
  • Upload date:
  • Size: 111.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for caltrain_mcp-0.8.6-py3-none-any.whl
Algorithm Hash digest
SHA256 f6a1958cb748633b37e67da67c6a2d49691b9159e0fb75b6f0be305997747a07
MD5 82a0939044eb41b885e4e89b51399247
BLAKE2b-256 9908350e5931466d473e8988ab87c383686575f0dba5c2b960ab0477b6f59f22

See more details on using hashes here.

Provenance

The following attestation bundles were made for caltrain_mcp-0.8.6-py3-none-any.whl:

Publisher: ci.yml on davidyen1124/caltrain-mcp

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page