Skip to main content

MCP server for interacting with an Apache Iceberg catalog and data lake

Project description

MCP Iceberg Catalog

smithery badge

A MCP (Model Context Protocol) server implementation for interacting with Apache Iceberg. This server provides a SQL interface for querying and managing Iceberg tables through Claude desktop.

Claude Desktop as your Iceberg Data Lake Catalog

image

How to Install in Claude Desktop

Installing via Smithery

To install MCP Iceberg Catalog for Claude Desktop automatically via Smithery:

npx -y @smithery/cli install @ahodroj/mcp-iceberg-service --client claude
  1. Prerequisites

    • Python 3.10 or higher
    • UV package installer (recommended) or pip
    • Access to an Iceberg REST catalog and S3-compatible storage
  2. How to install in Claude Desktop Add the following configuration to claude_desktop_config.json:

{
  "mcpServers": {
    "iceberg": {
      "command": "uv",
      "args": [
        "--directory",
        "PATH_TO_/mcp-iceberg-service",
        "run",
        "mcp-server-iceberg"
      ],
      "env": {
        "ICEBERG_CATALOG_URI" : "http://localhost:8181",
        "ICEBERG_WAREHOUSE" : "YOUR ICEBERG WAREHOUSE NAME",
        "S3_ENDPOINT" : "OPTIONAL IF USING S3",
        "AWS_ACCESS_KEY_ID" : "YOUR S3 ACCESS KEY",
        "AWS_SECRET_ACCESS_KEY" : "YOUR S3 SECRET KEY"
      }
    }
  }
}

Design

Architecture

The MCP server is built on three main components:

  1. MCP Protocol Handler

    • Implements the Model Context Protocol for communication with Claude
    • Handles request/response cycles through stdio
    • Manages server lifecycle and initialization
  2. Query Processor

    • Parses SQL queries using sqlparse
    • Supports operations:
      • LIST TABLES
      • DESCRIBE TABLE
      • SELECT
      • INSERT
  3. Iceberg Integration

    • Uses pyiceberg for table operations
    • Integrates with PyArrow for efficient data handling
    • Manages catalog connections and table operations

PyIceberg Integration

The server utilizes PyIceberg in several ways:

  1. Catalog Management

    • Connects to REST catalogs
    • Manages table metadata
    • Handles namespace operations
  2. Data Operations

    • Converts between PyIceberg and PyArrow types
    • Handles data insertion through PyArrow tables
    • Manages table schemas and field types
  3. Query Execution

    • Translates SQL to PyIceberg operations
    • Handles data scanning and filtering
    • Manages result set conversion

Further Implementation Needed

  1. Query Operations

    • Implement UPDATE operations
    • Add DELETE support
    • Support for CREATE TABLE with schema definition
    • Add ALTER TABLE operations
    • Implement table partitioning support
  2. Data Types

    • Support for complex types (arrays, maps, structs)
    • Add timestamp with timezone handling
    • Support for decimal types
    • Add nested field support
  3. Performance Improvements

    • Implement batch inserts
    • Add query optimization
    • Support for parallel scans
    • Add caching layer for frequently accessed data
  4. Security Features

    • Add authentication mechanisms
    • Implement role-based access control
    • Add row-level security
    • Support for encrypted connections
  5. Monitoring and Management

    • Add metrics collection
    • Implement query logging
    • Add performance monitoring
    • Support for table maintenance operations
  6. Error Handling

    • Improve error messages
    • Add retry mechanisms for transient failures
    • Implement transaction support
    • Add data validation

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

iflow_mcp_mcp_server_iceberg-0.1.0.tar.gz (395.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

iflow_mcp_mcp_server_iceberg-0.1.0-py3-none-any.whl (7.2 kB view details)

Uploaded Python 3

File details

Details for the file iflow_mcp_mcp_server_iceberg-0.1.0.tar.gz.

File metadata

File hashes

Hashes for iflow_mcp_mcp_server_iceberg-0.1.0.tar.gz
Algorithm Hash digest
SHA256 0d1a0f601a45f62e3ac969a3e96afcd1b489421021da23df17aa939bdb825a43
MD5 b4d5b2ac8204a67eebc7982a36bd67da
BLAKE2b-256 b885486c1d452af95a059f3672e52a39800f3e53ff609de5cb7d483cf07282a3

See more details on using hashes here.

File details

Details for the file iflow_mcp_mcp_server_iceberg-0.1.0-py3-none-any.whl.

File metadata

File hashes

Hashes for iflow_mcp_mcp_server_iceberg-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 98f2961cca1584fc3e78e25b930d41b1a8d3f68ef0ae7bbb2e6d3167557c0f2a
MD5 ff288300e6fc2fddadeb640e1230ff4b
BLAKE2b-256 aecb9010f4bfd9709de84f7c94b4057e3081398fb73f2ff114fbf130999bd167

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page