Skip to main content

Python SDK for Gulp

Project description

Gulp Python SDK

Async Python SDK for the Gulp document analysis and collaboration platform.

Features

  • Fully Async — Built on httpx and asyncio for high-performance, non-blocking I/O
  • Complete API Coverage — All major REST endpoints (operations, documents, ingestion, queries, users, collaboration)
  • WebSocket Support — Real-time ingestion progress, query results, and collaborative updates
  • Type-Safe — Full type hints, Pydantic models, and static typing support
  • Error Handling — Comprehensive exception hierarchy with HTTP status codes and response data
  • Pagination — Async iterators for large result sets
  • Retry Logic — Automatic exponential backoff on transient failures

Installation

pip install gulp-sdk

Quick Start

If you prefer a dedicated guide, see docs/quickstart.md.

import asyncio
from gulp_sdk import GulpClient

async def main():
    # Connect to Gulp server
    async with GulpClient("http://localhost:8080") as client:
        # Login
        session = await client.auth.login("user@example.com", "password")
        print(f"Logged in: {session.user_id}")

        # Create operation
        op = await client.operations.create(
            name="My Investigation",
            description="Analyze event logs"
        )
        print(f"Created operation: {op.id}")

        # Get current user
        user = await client.users.get_current()
        print(f"Current user: {user.display_name}")

asyncio.run(main())

API Reference

Documentation site

This repository includes official docs for local or remote hosting:

  • Local preview: mkdocs serve
  • Build HTML: mkdocs build

Run docs locally

pip install -r requirements-docs.txt
mkdocs serve

Documentation pages

  • Quick start: docs/quickstart.md
  • API reference: docs/api_reference.md
  • Examples: docs/examples/ (scripts)

Authentication

# Login with credentials
session = await client.auth.login("user", "password")

# Token is automatically stored
# Logout
await client.auth.logout()

Operations

# Create operation
op = await client.operations.create("Name", "Description")

# Get operation
op = await client.operations.get(op.id)

Documents

# Get document
doc = await client.documents.get(operation_id, document_id)

# Query documents (with async iteration)
async for doc in client.documents.list(operation_id):
    print(doc.content[:100])

Ingestion

# Ingest file
job = await client.ingest.file(operation_id, plugin="json", file_path="/path/to/file.json")

# Ingest raw data
job = await client.ingest.raw(operation_id, plugin="json", data={"key": "value"})

# Monitor with WebSocket
async for progress in client.ingest.stream(operation_id, req_id=job.req_id):
    print(f"Progress: {progress.percent}%")

Collaboration

# Add note
note = await client.collab.create_note(operation_id, document_id, "Important finding")

# Create link between documents
link = await client.collab.create_link(doc_id_from, doc_id_to, "related_to")

WebSocket Real-Time Updates

Auto-Managed Mode

async with GulpClient("http://localhost:8080", ws_auto_connect=True) as client:
    # WebSocket automatically connected
    
    # Subscribe to document updates
    await client.websocket.subscribe(operation_id)
    
    # Receive messages
    async for message in client.websocket:
        print(f"Update: {message.type}{message.data}")

Manual Mode

async with GulpClient("http://localhost:8080") as client:
    async with client.websocket() as ws:
        # Authenticate and subscribe
        await ws.subscribe(operation_id, req_id="ingest-123")
        
        # Receive real-time updates
        async for message in ws:
            if message.type == "WSDATA_INGEST_RAW_PROGRESS":
                print(f"Ingestion: {message.data.percent}%")
            elif message.type == "WSDATA_QUERY_DONE":
                print(f"Query complete: {len(message.data.documents)} results")

Request status: websocket vs polling

For async operations, realtime websocket monitoring is recommended; polling is a fallback.

  • WebSocket pattern: use wait_for_request_stats(client, req_id, timeout, ws_callback=...).
  • Polling: client.plugins.request_get(req_id) in a loop.

See docs/api_reference.md for details and examples.

For advanced websocket note/QUERY_DONE tracking, consult tests/integration/test_stress.py.

Error Handling

from gulp_sdk import (
    GulpClient,
    AuthenticationError,
    PermissionError,
    NotFoundError,
    ValidationError,
    GulpSDKError,
)

async with GulpClient("http://localhost:8080") as client:
    try:
        await client.auth.login("user", "pass")
    except AuthenticationError as e:
        print(f"Login failed: {e.message}")
    except GulpSDKError as e:
        print(f"SDK error: {e.message} (status: {e.status_code})")

Configuration

Environment variables are used by tests and example scripts (fixtures) to parameterize connection settings. The SDK core methods in src/gulp_sdk accept explicit arguments and do not read env vars directly.

GULP_BASE_URL=http://localhost:8080        # Server URL (default: localhost:8080)
GULP_TEST_USER=admin                       # Default test user for integration tests
GULP_TEST_PASSWORD=admin                   # Default test password for integration tests
GULP_TEST_TOKEN=                           # Optional token for test auth
GULP_REQUEST_TIMEOUT=30                    # HTTP timeout in seconds (default: 30)
GULP_WS_TIMEOUT=300                        # WebSocket timeout (default: 300)
GULP_LOG_LEVEL=INFO                        # Logging level (default: INFO)

Or programmatically:

client = GulpClient(
    base_url="http://localhost:8080",
    timeout=30.0,
    ws_auto_connect=True,
)

Examples

See docs/examples/ for complete working examples:

For additional workflows, consult the integration tests in tests/integration/ and the main gulp docs for plugin-specific ingestion and query patterns.

Testing

Run tests:

# Unit tests (no dependencies)
pytest -v -s -x tests/unit

# minimal integration tests, requires live Gulp server on localhost:8080
pytest -v -s -x tests/integration  

to run full integration test suite, look at gulp gulp testing documentation

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

gulp_sdk-1.0.65.tar.gz (91.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

gulp_sdk-1.0.65-py3-none-any.whl (64.1 kB view details)

Uploaded Python 3

File details

Details for the file gulp_sdk-1.0.65.tar.gz.

File metadata

  • Download URL: gulp_sdk-1.0.65.tar.gz
  • Upload date:
  • Size: 91.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.14

File hashes

Hashes for gulp_sdk-1.0.65.tar.gz
Algorithm Hash digest
SHA256 9846efc99e21e161f539e738b8be3e6f481083f81acae9fcfded8b15992ca624
MD5 9c1f07c5d94e21e41d6cb707aea55960
BLAKE2b-256 62546a947efa879a1e6e7615d35236833ef4200b2c897c5c4fdc9b4352d7dd25

See more details on using hashes here.

File details

Details for the file gulp_sdk-1.0.65-py3-none-any.whl.

File metadata

  • Download URL: gulp_sdk-1.0.65-py3-none-any.whl
  • Upload date:
  • Size: 64.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.14

File hashes

Hashes for gulp_sdk-1.0.65-py3-none-any.whl
Algorithm Hash digest
SHA256 98a59406ce99aa4b8068d472ff38995e4097d3ff58d262e8732b8d33a3e57346
MD5 a15766e48b36c7698c00d2ae9dcac024
BLAKE2b-256 8acb6ca8c899c1c0f5293cb569fa3347096ad87a36a6f3ebc0d9b74fb5212bbe

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page