Skip to main content

Python SDK for Gulp

Project description

Gulp Python SDK

Async Python SDK for the Gulp document analysis and collaboration platform.

Features

  • Fully Async — Built on httpx and asyncio for high-performance, non-blocking I/O
  • Complete API Coverage — All major REST endpoints (operations, documents, ingestion, queries, users, collaboration)
  • WebSocket Support — Real-time ingestion progress, query results, and collaborative updates
  • Type-Safe — Full type hints, Pydantic models, and static typing support
  • Error Handling — Comprehensive exception hierarchy with HTTP status codes and response data
  • Pagination — Async iterators for large result sets
  • Retry Logic — Automatic exponential backoff on transient failures

Installation

pip install gulp-sdk

Quick Start

If you prefer a dedicated guide, see docs/quickstart.md.

import asyncio
from gulp_sdk import GulpClient

async def main():
    # Connect to Gulp server
    async with GulpClient("http://localhost:8080") as client:
        # Login
        session = await client.auth.login("user@example.com", "password")
        print(f"Logged in: {session.user_id}")

        # Create operation
        op = await client.operations.create(
            name="My Investigation",
            description="Analyze event logs"
        )
        print(f"Created operation: {op.id}")

        # Get current user
        user = await client.users.get_current()
        print(f"Current user: {user.display_name}")

asyncio.run(main())

API Reference

Documentation site

This repository includes official docs for local or remote hosting:

  • Local preview: mkdocs serve
  • Build HTML: mkdocs build

Run docs locally

pip install -r requirements-docs.txt
mkdocs serve

Documentation pages

  • Quick start: docs/quickstart.md
  • API reference: docs/api_reference.md
  • Examples: docs/examples/ (scripts)

Authentication

# Login with credentials
session = await client.auth.login("user", "password")

# Token is automatically stored
# Logout
await client.auth.logout()

Operations

# Create operation
op = await client.operations.create("Name", "Description")

# Get operation
op = await client.operations.get(op.id)

Documents

# Get document
doc = await client.documents.get(operation_id, document_id)

# Query documents (with async iteration)
async for doc in client.documents.list(operation_id):
    print(doc.content[:100])

Ingestion

# Ingest file
job = await client.ingest.file(operation_id, plugin="json", file_path="/path/to/file.json")

# Ingest raw data
job = await client.ingest.raw(operation_id, plugin="json", data={"key": "value"})

# Monitor with WebSocket
async for progress in client.ingest.stream(operation_id, req_id=job.req_id):
    print(f"Progress: {progress.percent}%")

Collaboration

# Add note
note = await client.collab.create_note(operation_id, document_id, "Important finding")

# Create link between documents
link = await client.collab.create_link(doc_id_from, doc_id_to, "related_to")

WebSocket Real-Time Updates

Auto-Managed Mode

async with GulpClient("http://localhost:8080", ws_auto_connect=True) as client:
    # WebSocket automatically connected
    
    # Subscribe to document updates
    await client.websocket.subscribe(operation_id)
    
    # Receive messages
    async for message in client.websocket:
        print(f"Update: {message.type}{message.data}")

Manual Mode

async with GulpClient("http://localhost:8080") as client:
    async with client.websocket() as ws:
        # Authenticate and subscribe
        await ws.subscribe(operation_id, req_id="ingest-123")
        
        # Receive real-time updates
        async for message in ws:
            if message.type == "WSDATA_INGEST_RAW_PROGRESS":
                print(f"Ingestion: {message.data.percent}%")
            elif message.type == "WSDATA_QUERY_DONE":
                print(f"Query complete: {len(message.data.documents)} results")

Request status: websocket vs polling

For async operations, realtime websocket monitoring is recommended; polling is a fallback.

  • WebSocket pattern: use wait_for_request_stats(client, req_id, timeout, ws_callback=...).
  • Polling: client.plugins.request_get(req_id) in a loop.

See docs/api_reference.md for details and examples.

For advanced websocket note/QUERY_DONE tracking, consult tests/integration/test_stress.py.

Error Handling

from gulp_sdk import (
    GulpClient,
    AuthenticationError,
    PermissionError,
    NotFoundError,
    ValidationError,
    GulpSDKError,
)

async with GulpClient("http://localhost:8080") as client:
    try:
        await client.auth.login("user", "pass")
    except AuthenticationError as e:
        print(f"Login failed: {e.message}")
    except GulpSDKError as e:
        print(f"SDK error: {e.message} (status: {e.status_code})")

Configuration

Environment variables are used by tests and example scripts (fixtures) to parameterize connection settings. The SDK core methods in src/gulp_sdk accept explicit arguments and do not read env vars directly.

GULP_BASE_URL=http://localhost:8080        # Server URL (default: localhost:8080)
GULP_TEST_USER=admin                       # Default test user for integration tests
GULP_TEST_PASSWORD=admin                   # Default test password for integration tests
GULP_TEST_TOKEN=                           # Optional token for test auth
GULP_REQUEST_TIMEOUT=30                    # HTTP timeout in seconds (default: 30)
GULP_WS_TIMEOUT=300                        # WebSocket timeout (default: 300)
GULP_LOG_LEVEL=INFO                        # Logging level (default: INFO)

Or programmatically:

client = GulpClient(
    base_url="http://localhost:8080",
    timeout=30.0,
    ws_auto_connect=True,
)

Examples

See docs/examples/ for complete working examples:

For additional workflows, consult the integration tests in tests/integration/ and the main gulp docs for plugin-specific ingestion and query patterns.

Testing

Run tests:

# Unit tests (no dependencies)
pytest -v -s -x tests/unit

# minimal integration tests, requires live Gulp server on localhost:8080
pytest -v -s -x tests/integration  

to run full integration test suite, look at gulp gulp testing documentation

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

gulp_sdk-1.0.6.tar.gz (76.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

gulp_sdk-1.0.6-py3-none-any.whl (51.5 kB view details)

Uploaded Python 3

File details

Details for the file gulp_sdk-1.0.6.tar.gz.

File metadata

  • Download URL: gulp_sdk-1.0.6.tar.gz
  • Upload date:
  • Size: 76.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.13

File hashes

Hashes for gulp_sdk-1.0.6.tar.gz
Algorithm Hash digest
SHA256 fd51d115a63323518e4ac4d37790d0aec99e4939c40237cae30da2ee89f89aad
MD5 6866533720c2ea8eacb758a867ae206f
BLAKE2b-256 3d1d764a940cee2d711c6337646f0fbe0723e6465e7e100f66f0479848cbfe76

See more details on using hashes here.

File details

Details for the file gulp_sdk-1.0.6-py3-none-any.whl.

File metadata

  • Download URL: gulp_sdk-1.0.6-py3-none-any.whl
  • Upload date:
  • Size: 51.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.13

File hashes

Hashes for gulp_sdk-1.0.6-py3-none-any.whl
Algorithm Hash digest
SHA256 831226cf3b7e9c168cf48e4f73319119edabc49014292fb33c08c382b2ddfb2f
MD5 130cb73688afe120032caa9fdf4c34a6
BLAKE2b-256 19e41ec7b21202a4a0e758624d05a8bd76e242db0dcd9fec3ab34b8d3f73a385

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page