Skip to main content

ETL module for RD Station API database-optimized DataFrame processing

Project description

RD Station API Helper

A Python library for interacting with the RD Station API, providing ORM models, authentication, segmentation, contact, and event retrieval, as well as batch and parallel data fetching utilities.

PyPI version Last Commit Issues License

Features

  • RD Station API v2 support: Query segmentations, contacts, leads, and conversion events
  • Batch & Parallel Fetching: Utilities for efficient data extraction with configurable workers
  • Robust Error Handling: Comprehensive error handling and retry logic with coordinated barriers
  • Webhook Data Processing: Fetch and process webhook events from SQL databases
  • PostgreSQL Integration: Built-in PostgreSQL utilities for data storage and retrieval
  • ORM Models: SQLAlchemy models for RD Station entities (Segmentation, Contact, Lead, etc.)
  • Logging & Config Utilities: Easy configuration and logging
  • Type Hints: Full type hint support for better IDE experience

Installation

pip install rdstation-api-helper

Quick Start

1. Set up credentials

Create a secrets/rdstation_secret.json file with your RD Station API credentials:

{
  "RDSTATION_CLIENT_ID": "YOUR_CLIENT_ID",
  "RDSTATION_CLIENT_SECRET": "YOUR_CLIENT_SECRET",
  "RDSTATION_REFRESH_TOKEN": "YOUR_REFRESH_TOKEN"
}

2. Basic usage

from rdstation_api_helper import RDStationAPI

# Initialize API client (loads credentials from environment or .env)
client = RDStationAPI()

# Fetch all segmentations
segmentations = client.get_segmentations()

# Fetch contacts for each segmentation
contacts = client.get_segmentation_contacts("segmentations_id")

# Fetch contact data for a specific UUID
status_code, contact_data = client.get_contact_data("contact_uuid")

# Fetch conversion events for a contact
status_code, events = client.get_contact_events("some-contact_uuid")

# Fetch webhook events from database
from rdstation_api_helper.utils import PostgresDB, PgConfig

# Initialize database connection
db = PostgresDB()

# Fetch webhook events within date range
webhook_events = client.get_webhook_events(
    start_date="2025-08-01",
    end_date="2025-08-28", 
    engine=db.engine,
    table_name="rd_webhook_v1",
    api_version="v1"
)

ORM Models

The package provides SQLAlchemy ORM models for RD Station entities, which can be used for database integration.

  • Segmentation
  • SegmentationContact
  • Contact
  • ContactFunnelStatus
  • ConversionEvents
  • Lead

Database Integration

The library includes PostgreSQL utilities for easy database integration:

from rdstation_api_helper.utils import PostgresDB, PgConfig

# Using environment variables (PGHOST, PGPORT, PGDATABASE, PGUSER, PGPASSWORD)
db = PostgresDB()

# Or with custom configuration
config = PgConfig(
    host="localhost",
    port="5432", 
    dbname="mydb",
    user="myuser",
    password="mypass"
)
db = PostgresDB(config=config)

# Save data to database with upsert support
db.save_to_sql(data, Contact, upsert_values=True)

Examples

Check the examples/ directory for comprehensive usage examples:

  • basic_usage.py - Simple report extraction

Parallel & Batch Fetching

The library provides a parallel_decorator utility to easily parallelize API calls for batch data fetching. This is used in the following methods of RDStationAPI:

  • get_contact_data_parallel(uuids: list[str])
  • get_contact_events_parallel(uuids: list[str])
  • get_contact_funnel_status_parallel(uuids: list[str])

These methods accept a list of UUIDs and fetch the corresponding data in parallel, handling rate limits and transient errors automatically. The decorator coordinates retries for 429/5xx/network errors and ensures each result is tagged with its UUID.

Usage Example

from rdstation_api_helper import RDStationAPI

client = RDStationAPI()
uuids = ["uuid1", "uuid2", "uuid3"]

# Fetch contact data in parallel
_, contact_results = client.get_contact_data_parallel(uuids)

# Fetch contact events in parallel
_, events_results = client.get_contact_events_parallel(uuids)

# Fetch funnel status in parallel
_, funnel_results = client.get_contact_funnel_status_parallel(uuids)

print(contact_results)
print(events_results)
print(funnel_results)

Features:

  • Automatic parallelization with configurable worker count
  • Handles 429/5xx/network errors with coordinated retries
  • Appends the UUID to each result for traceability

See the rdstation_api_helper/utils.py source for details.

Requirements

  • Python 3.10-3.12
  • pandas >= 2.0.0
  • python-dotenv >= 1.0.0
  • requests >= 2.32.4
  • sqlalchemy >= 2.0.0
  • psycopg2-binary >= 2.9.0
  • tqdm >= 4.65.0

License

This project is licensed under the GPL License. See LICENSE file for details.

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

rdstation_api_helper-1.1.2.tar.gz (28.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

rdstation_api_helper-1.1.2-py3-none-any.whl (29.7 kB view details)

Uploaded Python 3

File details

Details for the file rdstation_api_helper-1.1.2.tar.gz.

File metadata

  • Download URL: rdstation_api_helper-1.1.2.tar.gz
  • Upload date:
  • Size: 28.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/2.1.4 CPython/3.12.3 Linux/5.15.167.4-microsoft-standard-WSL2

File hashes

Hashes for rdstation_api_helper-1.1.2.tar.gz
Algorithm Hash digest
SHA256 4a190d41d5b9f5908dec40adad863d8e2a54a11104b8a1d3e475f1724dc6080a
MD5 6a4c78e880d425eccdaccce4943f633c
BLAKE2b-256 37235b664daa9a52272697b968c292adaf2d0315366d89bc2d2d55f563d38267

See more details on using hashes here.

File details

Details for the file rdstation_api_helper-1.1.2-py3-none-any.whl.

File metadata

  • Download URL: rdstation_api_helper-1.1.2-py3-none-any.whl
  • Upload date:
  • Size: 29.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/2.1.4 CPython/3.12.3 Linux/5.15.167.4-microsoft-standard-WSL2

File hashes

Hashes for rdstation_api_helper-1.1.2-py3-none-any.whl
Algorithm Hash digest
SHA256 2c4bcc1509c2343ffa08b12ceb351e5da78d7035ec22c910d7f90c9da28aaa76
MD5 177c4e4a35806e6217577194054e567c
BLAKE2b-256 af57b8b9bdb0630aeb97481fe6c3b4e65387db4c33b6a09777827541000c998a

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page