Skip to main content

ETL module for RD Station API database-optimized DataFrame processing

Project description

RD Station API Helper

A Python library for interacting with the RD Station API, providing ORM models, authentication, segmentation, contact, and event retrieval, as well as batch and parallel data fetching utilities.

PyPI version Last Commit Issues License

Features

  • RD Station API v2 support: Query segmentations, contacts, leads, and conversion events
  • Batch & Parallel Fetching: Utilities for efficient data extraction with configurable workers
  • Robust Error Handling: Comprehensive error handling and retry logic with coordinated barriers
  • Webhook Data Processing: Fetch and process webhook events from SQL databases
  • PostgreSQL Integration: Built-in PostgreSQL utilities for data storage and retrieval
  • ORM Models: SQLAlchemy models for RD Station entities (Segmentation, Contact, Lead, etc.)
  • Logging & Config Utilities: Easy configuration and logging
  • Type Hints: Full type hint support for better IDE experience

Installation

pip install rdstation-api-helper

Quick Start

1. Set up credentials

Create a secrets/rdstation_secret.json file with your RD Station API credentials:

{
  "RDSTATION_CLIENT_ID": "YOUR_CLIENT_ID",
  "RDSTATION_CLIENT_SECRET": "YOUR_CLIENT_SECRET",
  "RDSTATION_REFRESH_TOKEN": "YOUR_REFRESH_TOKEN"
}

2. Basic usage

from rdstation_api_helper import RDStationAPI

# Initialize API client (loads credentials from environment or .env)
client = RDStationAPI()

# Fetch all segmentations
segmentations = client.get_segmentations()

# Fetch contacts for each segmentation
contacts = client.get_segmentation_contacts("segmentations_id")

# Fetch contact data for a specific UUID
status_code, contact_data = client.get_contact_data("contact_uuid")

# Fetch conversion events for a contact
status_code, events = client.get_contact_events("some-contact_uuid")

# Fetch webhook events from database
from rdstation_api_helper.utils import PostgresDB, PgConfig

# Initialize database connection
db = PostgresDB()

# Fetch webhook events within date range
webhook_events = client.get_webhook_events(
    start_date="2025-08-01",
    end_date="2025-08-28", 
    engine=db.engine,
    table_name="rd_webhook_v1",
    api_version="v1"
)

ORM Models

The package provides SQLAlchemy ORM models for RD Station entities, which can be used for database integration.

  • Segmentation
  • SegmentationContact
  • Contact
  • ContactFunnelStatus
  • ConversionEvents
  • Lead

Database Integration

The library includes PostgreSQL utilities for easy database integration:

from rdstation_api_helper.utils import PostgresDB, PgConfig

# Using environment variables (PGHOST, PGPORT, PGDATABASE, PGUSER, PGPASSWORD)
db = PostgresDB()

# Or with custom configuration
config = PgConfig(
    host="localhost",
    port="5432", 
    dbname="mydb",
    user="myuser",
    password="mypass"
)
db = PostgresDB(config=config)

# Save data to database with upsert support
db.save_to_sql(data, Contact, upsert_values=True)

Examples

Check the examples/ directory for comprehensive usage examples:

  • basic_usage.py - Simple report extraction

Parallel & Batch Fetching

The library provides a parallel_decorator utility to easily parallelize API calls for batch data fetching. This is used in the following methods of RDStationAPI:

  • get_contact_data_parallel(uuids: list[str])
  • get_contact_events_parallel(uuids: list[str])
  • get_contact_funnel_status_parallel(uuids: list[str])

These methods accept a list of UUIDs and fetch the corresponding data in parallel, handling rate limits and transient errors automatically. The decorator coordinates retries for 429/5xx/network errors and ensures each result is tagged with its UUID.

Usage Example

from rdstation_api_helper import RDStationAPI

client = RDStationAPI()
uuids = ["uuid1", "uuid2", "uuid3"]

# Fetch contact data in parallel
_, contact_results = client.get_contact_data_parallel(uuids)

# Fetch contact events in parallel
_, events_results = client.get_contact_events_parallel(uuids)

# Fetch funnel status in parallel
_, funnel_results = client.get_contact_funnel_status_parallel(uuids)

print(contact_results)
print(events_results)
print(funnel_results)

Features:

  • Automatic parallelization with configurable worker count
  • Handles 429/5xx/network errors with coordinated retries
  • Appends the UUID to each result for traceability

See the rdstation_api_helper/utils.py source for details.

Requirements

  • Python 3.10-3.12
  • pandas >= 2.0.0
  • python-dotenv >= 1.0.0
  • requests >= 2.32.4
  • sqlalchemy >= 2.0.0
  • psycopg2-binary >= 2.9.0
  • tqdm >= 4.65.0

License

This project is licensed under the GPL License. See LICENSE file for details.

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

rdstation_api_helper-1.1.4.tar.gz (25.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

rdstation_api_helper-1.1.4-py3-none-any.whl (26.0 kB view details)

Uploaded Python 3

File details

Details for the file rdstation_api_helper-1.1.4.tar.gz.

File metadata

  • Download URL: rdstation_api_helper-1.1.4.tar.gz
  • Upload date:
  • Size: 25.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/2.1.4 CPython/3.12.3 Linux/5.15.167.4-microsoft-standard-WSL2

File hashes

Hashes for rdstation_api_helper-1.1.4.tar.gz
Algorithm Hash digest
SHA256 9033c675f9844e91e67e44399730194de27e4f62f94125e879f11f5caf894f6c
MD5 b8407476fe2e678224a02d1dbb946d13
BLAKE2b-256 8cf00380ed007a1dfa4dae9d96ca1b5583040561c95b09ae1ad5da1d13e94601

See more details on using hashes here.

File details

Details for the file rdstation_api_helper-1.1.4-py3-none-any.whl.

File metadata

  • Download URL: rdstation_api_helper-1.1.4-py3-none-any.whl
  • Upload date:
  • Size: 26.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/2.1.4 CPython/3.12.3 Linux/5.15.167.4-microsoft-standard-WSL2

File hashes

Hashes for rdstation_api_helper-1.1.4-py3-none-any.whl
Algorithm Hash digest
SHA256 ad35e64f63815f769c9d36a010636abe6fb1247ff7378ba68a9139db47fa747b
MD5 3db4204caaa6d7d7843079bd2b720778
BLAKE2b-256 6aad1c7f0bcb820d7ce2954904d4b71c8675ee30e53814ef1b18a915a60f5590

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page