Core SDK for Airbyte connectors with declarative YAML execution

Project description

Airbyte SDK

Type-safe connector execution framework with blessed connectors and full IDE autocomplete.

Overview

The Airbyte SDK executes connector operations through the Airbyte platform. The platform manages third-party API credentials, handles rate limiting, and provides a unified execution layer.

Installation

uv pip install airbyte-agent-sdk

Credentials

You need Airbyte platform credentials to use the SDK:

AIRBYTE_CLIENT_ID / AIRBYTE_CLIENT_SECRET — from your Airbyte Cloud organization settings
connector_id — the source UUID for an existing connector (from the Airbyte dashboard or create())

Quick Start: Connect to an Existing Source

from airbyte_agent_sdk.connectors.stripe import StripeConnector
from airbyte_agent_sdk.types import AirbyteAuthConfig

connector = StripeConnector(auth_config=AirbyteAuthConfig(
    airbyte_client_id="your_client_id",
    airbyte_client_secret="your_client_secret",
    connector_id="your_source_uuid",
))

# List customers — returns typed Pydantic envelope
customers = await connector.customers.list(limit=10)
for c in customers.data:
    print(f"{c.id}: {c.email}")
print(f"More pages? {customers.meta.has_more}")

# Get a single customer
customer = await connector.customers.get(id="cus_123")
print(customer["email"])

# Health check
check = await connector.check()
print(check.status)  # "healthy"

Getting Started: Create a New Source

Use StripeConnector.create() to provision a source on Airbyte Cloud:

from airbyte_agent_sdk.connectors.stripe import StripeConnector
from airbyte_agent_sdk.connectors.stripe.models import StripeAuthConfig
from airbyte_agent_sdk.types import AirbyteAuthConfig

connector = await StripeConnector.create(
    airbyte_config=AirbyteAuthConfig(
        airbyte_client_id="your_client_id",
        airbyte_client_secret="your_client_secret",
        workspace_name="my-workspace",
    ),
    auth_config=StripeAuthConfig(api_key="sk_test_..."),
)

# Save the connector_id for future use
print(f"Source created: {connector.connector_id}")

# Now use it
customers = await connector.customers.list(limit=5)

Generic Factory: connect()

Use connect() for any connector by name. It auto-reads AIRBYTE_CLIENT_ID and AIRBYTE_CLIENT_SECRET from the environment:

from airbyte_agent_sdk import connect, list_connectors

# Discover available connectors (bundled connector slugs)
print(list_connectors())  # ['airtable', 'github', 'stripe', ...]

# Connect (reads AIRBYTE_CLIENT_ID/SECRET from env automatically)
stripe = connect("stripe", connector_id="your_source_uuid")

# Execute operations — shorthand form
result = await stripe.execute("customers", "list", params={"limit": 10})
print(result.success)        # True
print(result.data[0]["id"])  # "cus_..."
print(result.meta)           # {"has_more": False}

# ExecutionConfig form also works:
# from airbyte_agent_sdk import ExecutionConfig
# result = await stripe.execute(
#     ExecutionConfig(entity="customers", action="list", params={"limit": 10})
# )

await stripe.close()

Workspace: Full Workspace Access

Use Workspace for workspace-level operations: natural language queries, listing connectors, creating and deleting connectors.

from airbyte_agent_sdk import Workspace

async with Workspace(workspace_name="my-workspace") as ws:
    # Ask a question across all connectors
    result = await ws.ask("list my recent Stripe customers")
    print(result.answer)

    # List live connector instances from the API
    # (contrast with list_connectors() which returns bundled connector slugs)
    for c in await ws.list_connectors():
        print(f"{c.name} ({c.id})")

    # Get a connector by name (must be exactly one instance in workspace)
    stripe = await ws.get_connector(name="stripe")
    try:
        result = await stripe.execute("customers", "list", params={"limit": 5})
    finally:
        await stripe.close()

Quick ask() — No Setup Needed

from airbyte_agent_sdk import ask

# Reads AIRBYTE_CLIENT_ID/SECRET from env
result = await ask("who are my top customers?", workspace_name="my-workspace")
print(result.answer)

Sync version for scripts and notebooks:

from airbyte_agent_sdk import ask_sync

# Sync version — works in scripts and notebooks
result = ask_sync("who are my top customers?", workspace_name="my-workspace")
print(result.answer)

Available Connectors

Connectors are included in the airbyte-agent-sdk package:

uv pip install airbyte-agent-sdk

Currently supported typed connectors: Stripe (more coming soon!)

51 connectors available via connect() — run list_connectors() to see all.

Advanced: Local Mode (Direct API Calls)

For development and testing, you can bypass the platform and call APIs directly with raw credentials:

from airbyte_agent_sdk.connectors.stripe import StripeConnector
from airbyte_agent_sdk.connectors.stripe.models import StripeAuthConfig

# Local mode — direct HTTP calls to Stripe API
connector = StripeConnector(auth_config=StripeAuthConfig(api_key="sk_test_..."))
customers = await connector.customers.list(limit=5)

Connector YAML Format

Connectors are defined in connector.yaml files:

connector:
  name: stripe
  version: 1.0.0
  base_url: https://api.stripe.com

auth:
  type: api_key
  config:
    header: Authorization
    prefix: Bearer

entities:
  - name: Customer
    actions: [get, create, update, delete, list]
    endpoints:
      get:
        method: GET
        path: /v1/customers/{id}
      create:
        method: POST
        path: /v1/customers
        body_fields: [email, name, description]
      list:
        method: GET
        path: /v1/customers
        query_params: [limit, starting_after]
    schema:
      type: object
      properties:
        id: {type: string}
        email: {type: string}
        name: {type: string}

Supported Actions

get - Fetch a single entity by ID
create - Create a new entity
update - Update an existing entity
delete - Delete an entity
list - List all entities (paginated)
search - Search with flexible criteria
download - Download binary content
authorize - Verify permissions

Working with Downloads

The download action returns an AsyncIterator[bytes] for streaming file content. You can handle this in two ways:

Option 1: Using the save_download Helper (Recommended)

The SDK provides a convenient helper to save downloads to disk:

from airbyte_agent_sdk import save_download
from airbyte_agent_sdk.connectors.zendesk_support import ZendeskSupportConnector

# Create connector instance
zendesk = ZendeskSupportConnector(
    auth_config={"api_token": "your_token"},
    subdomain="your_subdomain"
)

# Download an article attachment
download_iterator = await zendesk.download_article_attachment(
    article_id="123",
    attachment_id="456"
)

# Save to file
file_path = await save_download(download_iterator, "./downloads/attachment.pdf")
print(f"Downloaded to {file_path}")

# Overwrite existing files
file_path = await save_download(
    download_iterator,
    "./downloads/attachment.pdf",
    overwrite=True
)

Features:

✅ Creates parent directories automatically
✅ Returns absolute path to saved file
✅ Handles large files efficiently (streams chunks)
✅ Cleans up partial files on error
✅ Optional overwrite protection
✅ Expands ~ for home directory

Option 2: Manual Handling

You can also manually consume the iterator for custom processing:

# Download and process chunks manually
download_iterator = await zendesk.download_article_attachment(
    article_id="123",
    attachment_id="456"
)

# Save manually
with open("./downloads/attachment.pdf", "wb") as f:
    async for chunk in download_iterator:
        f.write(chunk)

Authentication Types

api_key - API key authentication (most common)
bearer_token - Bearer token in Authorization header
basic - HTTP Basic authentication

Secret Management

The SDK provides secure handling of sensitive credentials like API keys, tokens, and passwords through the SecretStr type and environment variable resolution.

Environment Variable References

Use the ${ENV_VAR_NAME} syntax to reference environment variables in your secrets. This is the recommended approach for security:

from airbyte_agent_sdk.executor import LocalExecutor

# Use LocalExecutor directly for local API access
executor = LocalExecutor(
    config_path="path/to/connector.yaml",
    auth_config={"api_key": "${STRIPE_API_KEY}"},
)

CLI Secret Management

The SDK's CLI commands handle secrets differently for security reasons:

`cassette record` Command (Explicit Secrets)

The record command requires explicit secret mapping via --secrets to prevent accidentally recording all environment variables in cassettes:

# Set your API key
export MY_API_KEY="sk_test_123..."

# Reference it explicitly
uv run airbyte-agent-sdk cassette record ./stripe/ \
  --entity customers \
  --action list \
  --secrets '{"token": "${MY_API_KEY}"}'

Features:

✅ Supports ${ENV_VAR_NAME} syntax for environment variables
✅ Supports literal values: '{"token": "literal_value"}'
✅ Supports multiple variables: '{"token": "${PREFIX}_${SUFFIX}"}'
✅ Validates that referenced environment variables exist
✅ Prevents accidentally recording secrets in cassette files

Examples:

# Single environment variable
--secrets '{"token": "${STRIPE_API_KEY}"}'

# Multiple environment variables in one value
--secrets '{"token": "${API_PREFIX}_${API_SUFFIX}"}'

# Mix of environment variables and literal values
--secrets '{"token": "${API_KEY}", "client_id": "my_client_123"}'

# Literal value (not recommended for production)
--secrets '{"token": "sk_test_hardcoded"}'

`test run` Command (Automatic Environment Loading)

The run command automatically loads all environment variables as potential secrets for backward compatibility:

# Set your secrets as environment variables
export STRIPE_API_KEY="sk_test_..."
export GITHUB_TOKEN="ghp_..."

# No --secrets needed; test specs reference specific vars
uv run airbyte-agent-sdk test run ./stripe/

Why the difference?

record: Explicit mapping prevents accidentally capturing all env vars in cassette files
run: Automatic loading maintains backward compatibility and convenience for running tests

SecretStr Type

All secrets are wrapped in Pydantic's SecretStr type for automatic obfuscation in logs and error messages:

from airbyte_agent_sdk.secrets import SecretStr

api_key = SecretStr("sk_test_123")
print(api_key)  # Output: **********
print(repr(api_key))  # Output: SecretStr('**********')
api_key.get_secret_value()  # Returns: 'sk_test_123'

Security benefits:

Secrets are automatically hidden in logs
Error messages don't leak secret values
String representations are obfuscated
IDE debuggers show ********** instead of actual values

Best Practices

Never hardcode secrets - Always use environment variables
Use .env files locally - Load with python-dotenv or similar
Use ${ENV_VAR} syntax in cassettes - Keeps secrets out of version control
Validate env vars exist - The SDK will error if referenced variables are missing
Use different secrets for dev/prod - Never use production credentials in tests

Example .env file:

# Development secrets (never commit to git!)
STRIPE_API_KEY=sk_test_...
GITHUB_TOKEN=ghp_...

Architecture

The SDK has a layered architecture:

Core Components

Config Loader - Parses OpenAPI 3.1 and legacy YAML formats
HTTP Client - Makes authenticated HTTP requests
Executor - Interprets YAML and executes operations
Types - Pydantic models for validation

Typed Connector System

Protocol - Defines interface for all typed connectors
Type Stubs - TypedDict definitions for full type safety
Wrapper Classes - Convenient, typed methods for each connector

┌─────────────────────────────────────────┐
│     Typed Connectors                    │
│  StripeConnector, GitHubConnector, ...  │
└──────────────────┬──────────────────────┘
                   │
┌──────────────────▼──────────────────────┐
│    connect() / HostedExecutor            │
│   (YAML-driven, works with any API)      │
└──────────────────┬──────────────────────┘
                   │
┌──────────────────▼──────────────────────┐
│          HTTP Client + Auth             │
│      (Bearer, API Key, Basic)           │
└─────────────────────────────────────────┘

All connector logic is driven by OpenAPI 3.1 specifications!

Testing Connectors

The SDK includes a powerful testing framework based on "cassettes" - YAML-based test specifications that capture real API interactions. This allows you to:

Test connectors without making live API calls
Validate request/response behavior
Ensure backward compatibility
Run tests quickly in CI/CD

What Are Cassettes?

Cassettes are YAML files that capture:

The exact HTTP request (method, path, params, headers, body)
The actual API response (status code, headers, body)
Input parameters for the operation
Secret references (using environment variables)

They're stored in tests/cassettes/ and used for fast, reliable mock testing.

Generating Cassettes

Use the cassette record command to capture real API interactions:

# Set your API key
export STRIPE_API_KEY="sk_test_..."

# Generate a cassette for listing customers
uv run airbyte-agent-sdk cassette record integrations/stripe/ \
  --entity customers \
  --action list \
  --params '{"limit": 10}' \
  --secrets '{"STRIPE_API_KEY": "${STRIPE_API_KEY}"}' \
  --output integrations/stripe/tests/cassettes

# Generate a cassette for retrieving a customer
uv run airbyte-agent-sdk cassette record integrations/stripe/ \
  --entity customers \
  --action get \
  --params '{"id": "cus_xxx"}' \
  --secrets '{"STRIPE_API_KEY": "${STRIPE_API_KEY}"}' \
  --output integrations/stripe/tests/cassettes

What happens:

The command executes the real operation against the API
HTTP requests/responses are logged
A YAML cassette file is automatically generated
Sensitive data is automatically redacted

Command options:

--entity - Entity name (e.g., "customers")
--action - Operation action (e.g., "list", "get", "create")
--params - JSON string of operation parameters
--secrets - JSON mapping of secret names to environment variable references
--output - Directory to save cassette files (default: "tests/cassettes")

Validating Cassettes

Validate cassette files to ensure they're well-formed:

uv run airbyte-agent-sdk test validate integrations/stripe/tests/cassettes/customers_list.yaml

Output:

Validating test spec: integrations/stripe/tests/cassettes/customers_list.yaml...
✓ Test specification is valid

Running Tests

Run tests using your cassettes in mock mode (no API calls):

# Set environment variables for secret resolution
export STRIPE_API_KEY="sk_test_..."

# Run all tests in a directory
uv run airbyte-agent-sdk test run integrations/stripe/ \
  --test-dir integrations/stripe/tests/cassettes \
  --verbose

Output:

Running tests for integrations/stripe/connector.yaml...
Test directory: integrations/stripe/tests/cassettes
Mode: mock

  ✓ customers_list (0.4ms)
  ✓ customers_get (0.2ms)

============================================================
Test Report: integrations/stripe/connector.yaml
Mode: mock
============================================================

Summary:
------------------------------------------------------------
  Total:        2
  Passed:       2 ✓
  Failed:       0 ✗
  Errors:       0 ⚠
  Success Rate: 100.0%
  Duration:     19.8ms
============================================================
✅ ALL TESTS PASSED
============================================================

Test modes:

mock (default) - Uses cassettes, no real API calls
More modes coming soon (live, record)

Output formats:

console (default) - Human-readable output
json - Machine-readable JSON report
html - HTML report file

# Generate JSON report
uv run airbyte-agent-sdk test run integrations/stripe/ \
  --test-dir integrations/stripe/tests/cassettes \
  --format json \
  --output results.json

# Generate HTML report
uv run airbyte-agent-sdk test run integrations/stripe/ \
  --test-dir integrations/stripe/tests/cassettes \
  --format html \
  --output report.html

Cassette File Format

Cassettes are YAML files with this structure:

test_name: "customers_list"
description: "Captured from real API call"
entity: "customers"
action: "list"

# Secret references (resolved from environment variables)
secrets:
  STRIPE_API_KEY: "${STRIPE_API_KEY}"

# Input parameters for the operation
inputs:
  params:
    limit: 10

# Expected HTTP request (captured from real API)
captured_request:
  method: "GET"
  path: "/v1/customers"
  query_params:
    limit: "10"
  headers: {}
  body: null

# Captured HTTP response (from real API)
captured_response:
  status_code: 200
  headers: {}
  body:
    object: "list"
    data: [...]

Key features:

Authentication headers are NOT included - they're auto-injected from connector.yaml
Secrets use environment variable references like ${STRIPE_API_KEY}
Response body contains actual data from your test API account

Anonymous Telemetry

The Airbyte SDK includes optional anonymous telemetry to help us understand how connectors are being used in the wild. This helps us prioritize features and improvements.

What's Tracked

What's Tracked (Basic Mode - Default):

Connector name and version
API operations used (entity/action, e.g., "customers/list")
Success/failure rates
Performance metrics (timing)
Error types (not error messages or parameters)
Execution context (MCP server, direct SDK, etc.)
System info (Python version, OS)
Public IP address (for usage analytics and regional insights)
Anonymous user ID (stored in config to correlate sessions)

What's NOT Tracked:

Your API keys or credentials
Customer data or PII
Actual API responses
Error messages or parameters

Opt-out

Telemetry is enabled by default. To disable it:

export AIRBYTE_TELEMETRY_MODE=disabled

Configuration

The SDK stores configuration in ~/.airbyte/connector-sdk/config.yaml:

# Generated user ID for anonymous telemetry
user_id: "550e8400-e29b-41d4-a716-446655440000"

# Set to true for internal Airbyte users
is_internal_user: false

The anonymous user ID allows us to understand usage patterns across multiple sessions from the same user. The ID is a random UUID and contains no personally identifiable information. You can delete the config file at any time to generate a new ID.

For Airbyte employees: Run the setup script to mark yourself as an internal user:

./scripts/setup_internal_user.sh

Or set the environment variable (takes precedence over config file):

export AIRBYTE_INTERNAL_USER=true

For more details, see our Privacy Policy.

Development

# Install dev dependencies
uv pip install -e ".[dev]"

# Run tests
pytest

License

MIT

Project details

Release history Release notifications | RSS feed

0.1.89

Apr 27, 2026

0.1.88

Apr 27, 2026

0.1.87

Apr 26, 2026

0.1.86

Apr 26, 2026

0.1.85

Apr 26, 2026

0.1.84

Apr 26, 2026

0.1.83

Apr 26, 2026

0.1.82

Apr 26, 2026

0.1.81

Apr 26, 2026

0.1.80

Apr 26, 2026

0.1.79

Apr 25, 2026

0.1.78

Apr 25, 2026

0.1.77

Apr 25, 2026

0.1.76

Apr 25, 2026

0.1.75

Apr 25, 2026

0.1.74

Apr 25, 2026

0.1.73

Apr 25, 2026

0.1.72

Apr 25, 2026

0.1.71

Apr 25, 2026

0.1.70

Apr 25, 2026

0.1.69

Apr 25, 2026

0.1.68

Apr 25, 2026

0.1.67

Apr 24, 2026

0.1.66

Apr 24, 2026

0.1.65

Apr 24, 2026

0.1.64

Apr 24, 2026

0.1.63

Apr 24, 2026

0.1.62

Apr 24, 2026

0.1.61

Apr 24, 2026

0.1.60

Apr 24, 2026

0.1.59

Apr 24, 2026

0.1.58

Apr 23, 2026

0.1.57

Apr 23, 2026

0.1.56

Apr 23, 2026

0.1.55

Apr 22, 2026

0.1.54

Apr 22, 2026

0.1.53

Apr 22, 2026

0.1.52

Apr 22, 2026

0.1.51

Apr 22, 2026

0.1.50

Apr 22, 2026

0.1.49

Apr 22, 2026

0.1.48

Apr 22, 2026

0.1.47

Apr 21, 2026

0.1.46

Apr 21, 2026

0.1.45

Apr 21, 2026

0.1.44

Apr 21, 2026

0.1.43

Apr 21, 2026

0.1.42

Apr 21, 2026

0.1.41

Apr 21, 2026

0.1.40

Apr 21, 2026

0.1.39

Apr 21, 2026

0.1.38

Apr 21, 2026

0.1.37

Apr 21, 2026

0.1.36

Apr 21, 2026

0.1.35

Apr 21, 2026

0.1.34

Apr 21, 2026

0.1.33

Apr 21, 2026

0.1.32

Apr 20, 2026

0.1.31

Apr 20, 2026

0.1.30

Apr 20, 2026

0.1.29

Apr 19, 2026

0.1.28

Apr 19, 2026

0.1.27

Apr 19, 2026

0.1.26

Apr 19, 2026

0.1.25

Apr 19, 2026

0.1.24

Apr 19, 2026

0.1.23

Apr 19, 2026

0.1.22

Apr 19, 2026

0.1.21

Apr 19, 2026

0.1.20

Apr 19, 2026

0.1.19

Apr 19, 2026

0.1.18

Apr 19, 2026

0.1.17

Apr 19, 2026

0.1.16

Apr 18, 2026

0.1.15

Apr 17, 2026

0.1.14

Apr 17, 2026

0.1.13

Apr 17, 2026

0.1.12

Apr 17, 2026

0.1.11

Apr 17, 2026

0.1.10

Apr 17, 2026

This version

0.1.9

Apr 17, 2026

0.1.8

Apr 17, 2026

0.1.7

Apr 17, 2026

0.1.6

Apr 16, 2026

0.1.5

Apr 16, 2026

0.1.4

Apr 16, 2026

0.1.3

Apr 16, 2026

0.1.2

Apr 16, 2026

0.1.1

Apr 9, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

airbyte_agent_sdk-0.1.9-py3-none-any.whl (2.7 MB view details)

Uploaded Apr 17, 2026 Python 3

File details

Details for the file airbyte_agent_sdk-0.1.9-py3-none-any.whl.

File metadata

Download URL: airbyte_agent_sdk-0.1.9-py3-none-any.whl
Upload date: Apr 17, 2026
Size: 2.7 MB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.13

File hashes

Hashes for airbyte_agent_sdk-0.1.9-py3-none-any.whl
Algorithm	Hash digest
SHA256	`74c977e2c2094f01efa6dd6b18a20829d3b465b64def454c746c371892101c92`
MD5	`5bc226639e77a752409366eded48dfea`
BLAKE2b-256	`9dc8e9331a38762a5c7ac4ee14d07d137e605b5877eeff77ee127418fe0ec3ce`

See more details on using hashes here.

airbyte-agent-sdk 0.1.9

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

Airbyte SDK

Overview

Installation

Credentials

Quick Start: Connect to an Existing Source

Getting Started: Create a New Source

Generic Factory: connect()

Workspace: Full Workspace Access

Quick ask() — No Setup Needed

Available Connectors

Advanced: Local Mode (Direct API Calls)

Connector YAML Format

Supported Actions

Working with Downloads

Option 1: Using the save_download Helper (Recommended)

Option 2: Manual Handling

Authentication Types

Secret Management

Environment Variable References

CLI Secret Management

cassette record Command (Explicit Secrets)

test run Command (Automatic Environment Loading)

SecretStr Type

Best Practices

Architecture

Core Components

Typed Connector System

Testing Connectors

What Are Cassettes?

Generating Cassettes

Validating Cassettes

Running Tests

Cassette File Format

Anonymous Telemetry

What's Tracked

Opt-out

Configuration

Development

License

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distributions

Built Distribution

File details

File metadata

File hashes

`cassette record` Command (Explicit Secrets)

`test run` Command (Automatic Environment Loading)