Skip to main content

Official Kadoa SDK for Python - Web data extraction and automation

Project description

Kadoa SDK for Python

Official Python SDK for the Kadoa API, providing easy integration with Kadoa's web data extraction platform.

Installation

We recommend using a virtual environment to avoid dependency conflicts (optional). Use your preferred tool (venv, virtualenv, conda, poetry, uv).

python3 -m venv .venv
source .venv/bin/activate
pip install kadoa-sdk

Getting Started

Obtaining an API Key

  1. Register at kadoa.com
  2. Navigate to your account page
  3. Copy your API key

Quick Start

import logging
from kadoa_sdk import initialize_sdk, run_extraction, KadoaSdkConfig, ExtractionOptions

logging.basicConfig(level=logging.INFO)
logger = logging.getLogger("kadoa_sdk.examples")

# Initialize the SDK
sdk = initialize_sdk(KadoaSdkConfig(
    api_key="your-api-key"
))

# Run an extraction
result = run_extraction(sdk, ExtractionOptions(
    urls=["https://example.com"],
    name="My Extraction Workflow"
))

if result:
    logger.info("Workflow created with ID: %s", result.workflow_id)

Configuration

Basic Configuration

sdk = initialize_sdk(KadoaSdkConfig(
    api_key="your-api-key",
    base_url="https://api.kadoa.com",  # optional
    timeout=30                         # optional, in seconds
))

Using Environment Variables

KADOA_API_KEY=your-api-key
KADOA_API_URL=https://api.kadoa.com
KADOA_TIMEOUT=30
import os
from dotenv import load_dotenv
from kadoa_sdk import initialize_sdk, KadoaSdkConfig

load_dotenv()

sdk = initialize_sdk(KadoaSdkConfig(
    api_key=os.environ["KADOA_API_KEY"],
    base_url=os.environ.get("KADOA_API_URL", "https://api.kadoa.com"),
    timeout=int(os.environ.get("KADOA_TIMEOUT", "30"))
))

Event Handling

import logging
from kadoa_sdk import initialize_sdk, KadoaSdkConfig

logging.basicConfig(level=logging.INFO)
logger = logging.getLogger("kadoa_sdk.examples")

sdk = initialize_sdk(KadoaSdkConfig(api_key="your-api-key"))

# Listen to events with a lambda and log output
sdk.on_event(lambda e: logger.info("event: %s", e.to_dict()))

# Event types:
# - entity:detected
# - extraction:started
# - extraction:status_changed
# - extraction:data_available
# - extraction:completed

API Reference

initialize_sdk(config: KadoaSdkConfig)

  • api_key (required): Your Kadoa API key
  • base_url (optional): API base URL
  • timeout (optional): Request timeout in seconds

Returns an sdk instance with configured API client.

run_extraction(sdk, options: ExtractionOptions)

  • urls: List of URLs to extract from
  • name: Workflow name
  • Additional options available in API documentation

dispose(sdk: KadoaSdk)

Releases resources and removes all event listeners.

Examples

See examples directory for more usage examples.

Requirements

  • Python 3.8+

License

MIT

Support

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

kadoa_sdk-0.5.0.tar.gz (171.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

kadoa_sdk-0.5.0-py3-none-any.whl (664.6 kB view details)

Uploaded Python 3

File details

Details for the file kadoa_sdk-0.5.0.tar.gz.

File metadata

  • Download URL: kadoa_sdk-0.5.0.tar.gz
  • Upload date:
  • Size: 171.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.7

File hashes

Hashes for kadoa_sdk-0.5.0.tar.gz
Algorithm Hash digest
SHA256 993c16ebfa3fd351c27720b6ecdcf5e695f73e886d201a5b686bfe6ee1c33950
MD5 1f01faa5614472e8906a218d97ff1481
BLAKE2b-256 60fc5d8d690159cd09b387cec1c86f074d7bc95dd6a030c37f6ae191fdbd1a10

See more details on using hashes here.

File details

Details for the file kadoa_sdk-0.5.0-py3-none-any.whl.

File metadata

  • Download URL: kadoa_sdk-0.5.0-py3-none-any.whl
  • Upload date:
  • Size: 664.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.7

File hashes

Hashes for kadoa_sdk-0.5.0-py3-none-any.whl
Algorithm Hash digest
SHA256 e1753f58b9150d509ed0aa7bb099cf3d23bcfb2fd635067ef502c32426fdfc40
MD5 f12164129d3572c6e8a9f7a27baa192e
BLAKE2b-256 506fd87dd6151773961c4bef49dcf2acc6f029663eae9abca3de3f8a12c32256

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page