Skip to main content

ETL module for RD Station API database-optimized DataFrame processing

Project description

RD Station API Helper

A Python library for interacting with the RD Station API, providing ORM models, authentication, segmentation, contact, and event retrieval, as well as batch and parallel data fetching utilities.

PyPI version Last Commit Issues License

Features

  • RD Station API v2 support: Query segmentations, contacts, leads, and conversion events
  • Batch & Parallel Fetching: Utilities for efficient data extraction with configurable workers
  • Robust Error Handling: Comprehensive error handling and retry logic with coordinated barriers
  • Webhook Data Processing: Fetch and process webhook events from SQL databases
  • PostgreSQL Integration: Built-in PostgreSQL utilities for data storage and retrieval
  • ORM Models: SQLAlchemy models for RD Station entities (Segmentation, Contact, Lead, etc.)
  • Logging & Config Utilities: Easy configuration and logging
  • Type Hints: Full type hint support for better IDE experience

Installation

pip install rdstation-api-helper

Quick Start

1. Set up credentials

Create a secrets/rdstation_secret.json file with your RD Station API credentials:

{
  "RDSTATION_CLIENT_ID": "YOUR_CLIENT_ID",
  "RDSTATION_CLIENT_SECRET": "YOUR_CLIENT_SECRET",
  "RDSTATION_REFRESH_TOKEN": "YOUR_REFRESH_TOKEN"
}

2. Basic usage

from rdstation_api_helper import RDStationAPI

# Initialize API client (loads credentials from environment or .env)
client = RDStationAPI()

# Fetch all segmentations
segmentations = client.get_segmentations()

# Fetch contacts for each segmentation
contacts = client.get_segmentation_contacts("segmentations_id")

# Fetch contact data for a specific UUID
status_code, contact_data = client.get_contact_data("contact_uuid")

# Fetch conversion events for a contact
status_code, events = client.get_contact_events("some-contact_uuid")

# Fetch webhook events from database
from rdstation_api_helper.utils import PostgresDB, PgConfig

# Initialize database connection
db = PostgresDB()

# Fetch webhook events within date range
webhook_events = client.get_webhook_events(
    start_date="2025-08-01",
    end_date="2025-08-28", 
    engine=db.engine,
    table_name="rd_webhook_v1",
    api_version="v1"
)

ORM Models

The package provides SQLAlchemy ORM models for RD Station entities, which can be used for database integration.

  • Segmentation
  • SegmentationContact
  • Contact
  • ContactFunnelStatus
  • ConversionEvents
  • Lead

Database Integration

The library includes PostgreSQL utilities for easy database integration:

from rdstation_api_helper.utils import PostgresDB, PgConfig

# Using environment variables (PGHOST, PGPORT, PGDATABASE, PGUSER, PGPASSWORD)
db = PostgresDB()

# Or with custom configuration
config = PgConfig(
    host="localhost",
    port="5432", 
    dbname="mydb",
    user="myuser",
    password="mypass"
)
db = PostgresDB(config=config)

# Save data to database with upsert support
db.save_to_sql(data, Contact, upsert_values=True)

Examples

Check the examples/ directory for comprehensive usage examples:

  • basic_usage.py - Simple report extraction

Parallel & Batch Fetching

The library provides a parallel_decorator utility to easily parallelize API calls for batch data fetching. This is used in the following methods of RDStationAPI:

  • get_contact_data_parallel(uuids: list[str])
  • get_contact_events_parallel(uuids: list[str])
  • get_contact_funnel_status_parallel(uuids: list[str])

These methods accept a list of UUIDs and fetch the corresponding data in parallel, handling rate limits and transient errors automatically. The decorator coordinates retries for 429/5xx/network errors and ensures each result is tagged with its UUID.

Usage Example

from rdstation_api_helper import RDStationAPI

client = RDStationAPI()
uuids = ["uuid1", "uuid2", "uuid3"]

# Fetch contact data in parallel
_, contact_results = client.get_contact_data_parallel(uuids)

# Fetch contact events in parallel
_, events_results = client.get_contact_events_parallel(uuids)

# Fetch funnel status in parallel
_, funnel_results = client.get_contact_funnel_status_parallel(uuids)

print(contact_results)
print(events_results)
print(funnel_results)

Features:

  • Automatic parallelization with configurable worker count
  • Handles 429/5xx/network errors with coordinated retries
  • Appends the UUID to each result for traceability

See the rdstation_api_helper/utils.py source for details.

Requirements

  • Python 3.11-3.14
  • pandas >= 2.0.0
  • python-dotenv >= 1.0.0
  • requests >= 2.32.4
  • sqlalchemy >= 2.0.0
  • psycopg2-binary >= 2.9.0
  • tqdm >= 4.65.0

License

This project is licensed under the GPL License. See LICENSE file for details.

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

rdstation_api_helper-1.1.6.tar.gz (25.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

rdstation_api_helper-1.1.6-py3-none-any.whl (26.0 kB view details)

Uploaded Python 3

File details

Details for the file rdstation_api_helper-1.1.6.tar.gz.

File metadata

  • Download URL: rdstation_api_helper-1.1.6.tar.gz
  • Upload date:
  • Size: 25.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/2.1.4 CPython/3.12.3 Linux/5.15.167.4-microsoft-standard-WSL2

File hashes

Hashes for rdstation_api_helper-1.1.6.tar.gz
Algorithm Hash digest
SHA256 8a1e0b660498bda16c93939d20e8aa5b2e48d887f1281a21cc014f29aac5e6ae
MD5 01e3efe16fa2b828b7316578fe9c35b3
BLAKE2b-256 0fc5751b380c53f6ba66d73396ecf3cd8e6df4551ebcdc8029dd9a71fecdb23a

See more details on using hashes here.

File details

Details for the file rdstation_api_helper-1.1.6-py3-none-any.whl.

File metadata

  • Download URL: rdstation_api_helper-1.1.6-py3-none-any.whl
  • Upload date:
  • Size: 26.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/2.1.4 CPython/3.12.3 Linux/5.15.167.4-microsoft-standard-WSL2

File hashes

Hashes for rdstation_api_helper-1.1.6-py3-none-any.whl
Algorithm Hash digest
SHA256 5da486bc3b488d36d90ae242a59903f717dbceca98d0ddba4ac0ebf9292a1413
MD5 cf64bad40b9829818b0752460429238b
BLAKE2b-256 94119850a1557cb246b549f2349f9ec7b6e165dd864da514e3064f1a5c665333

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page