Skip to main content

A simple, lightweight set of connectors and functions to various APIs and DBMSs, controlled by a central broker.

Project description

ppp_connectors

A clean, modular set of Python connectors and utilities for working with both APIs and DBMS backends, unified by a centralized Broker abstraction and a consistent interface. Designed for easy testing, code reuse, and plug-and-play extensibility.

📚 Table of Contents


📦 Installation

pip install ppp-connectors

Copy the .env.example to .env for local development:

cp dev_env/.env.example dev_env/.env

Environment variables are loaded automatically via the combine_env_configs() helper.


🔌 API Connectors

All API connectors inherit from the shared Broker base class and:

  • Accept API credentials via env vars or constructor args
  • Send requests via get, post, etc.
  • Include support for logging, retry/backoff, and VCR integration

Example (URLScan)

from ppp_connectors.api_connectors.urlscan import URLScanConnector

scanner = URLScanConnector(load_env_vars=True)
result = scanner.scan(
    url="https://example.com",
    visibility="public",
    tags=["example", "demo"],
    custom={"foo": "bar"},  # Arbitrary API field passed via **kwargs
    headers={"X-Custom-Header": "my-value"}  # httpx headers
)
print(result.json())

Customizing API Requests with **kwargs

All connector methods accept arbitrary keyword arguments using **kwargs. These arguments are passed directly to the underlying httpx request methods, enabling support for any feature available in httpx — including custom headers, query parameters, timeouts, authentication, and more. Additionally, for APIs that accept arbitrary fields in their request body (like URLScan), these can also be passed as part of **kwargs and will be merged into the outgoing request. This enables full control over how API requests are constructed without needing to modify connector internals.

Example (URLScan with custom headers and params)

result = scanner.scan(
    url="https://example.com",
    visibility="unlisted",
    headers={"X-Custom-Header": "my-value"},
    params={"pretty": "true"}
)

This pattern allows flexibility without needing to subclass or modify the connector.

Proxy Awareness

API connectors inherit from the Broker class and support flexible proxy configuration for outgoing HTTP requests. You can set proxies in multiple ways:

  • a single proxy parameter (applies to all requests),
  • a per-scheme mounts parameter (e.g., separate proxies for http and https as a dictionary),
  • or environment variables (from .env or OS environment, specifically HTTP_PROXY and HTTPS_PROXY).

Proxy precedence: mounts > proxy > environment source (.env via load_env_vars=True, else OS environment if trust_env=True) > none.

  • If you provide explicit mounts, these override all other proxy settings.
  • If you set proxy, it overrides environment proxies but is overridden by mounts.
  • If neither is set, and load_env_vars=True, proxy settings are loaded from .env via combine_env_configs().
    • If both .env and OS environment have the same variable, OS environment takes precedence.
  • If no explicit proxy or mounts are set but trust_env=True, HTTPX will use OS environment proxy settings (including NO_PROXY).

Examples:

Using a single proxy:

from ppp_connectors.api_connectors.urlscan import URLScanConnector
conn = URLScanConnector(proxy="http://myproxy:8080")

Using per-scheme mounts:

conn = URLScanConnector(mounts={"https://": "http://myproxy:8080", "http://": "http://myproxy2:8888"})

Loading proxy from .env:

# .env file contains: HTTP_PROXY="http://myproxy:8080"
conn = URLScanConnector(load_env_vars=True)
# Uses HTTP_PROXY from .env even if not in OS environment.

Note: Any changes to proxy settings require re-instantiating the connector for changes to take effect.

SSL Verification and Per-Request Options

You can now pass any httpx.Client keyword arguments (such as verify=False, http2=True) when instantiating a connector. These options will be applied to all requests made by that connector.

Additionally, per-request keyword arguments can be passed to methods like .get(), .post(), etc., and will be forwarded to httpx.Client.request for that single call.

Setting verify=False disables SSL verification and can be useful for testing against servers with self-signed certificates, but should not be used in production unless you understand the security implications.

Examples:

Disable SSL verification at the connector level:

conn = URLScanConnector(verify=False)
response = conn.get("https://self-signed.badssl.com/")
print(response.status_code)

Disable SSL verification for a single request:

conn = URLScanConnector()
response = conn.get("https://self-signed.badssl.com/", verify=False)
print(response.status_code)

Enable HTTP/2:

conn = URLScanConnector(http2=True)
response = conn.get("https://nghttp2.org/httpbin/get")
print(response.http_version)

🗃️ DBMS Connectors

Each database connector follows a class-based pattern and supports reusable sessions, query helpers, and in some cases bulk_insert.

MongoDB

from ppp_connectors.dbms_connectors.mongo import MongoConnector

conn = MongoConnector("mongodb://localhost:27017", username="root", password="example")
conn.bulk_insert("mydb", "mycol", [{"foo": "bar"}])

Elasticsearch

# The query method returns a generator; use list() or iterate to access results
from ppp_connectors.dbms_connectors.elasticsearch import ElasticsearchConnector

conn = ElasticsearchConnector(["http://localhost:9200"])
results = list(conn.query("my-index", {"query": {"match_all": {}}}))
for doc in results:
    print(doc)

ODBC (e.g., Postgres, Teradata)

For automatic connection handling, use ODBCConnector as a context manager

from ppp_connectors.dbms_connectors.odbc import ODBCConnector

with ODBCConnector("DSN=PostgresLocal;UID=postgres;PWD=postgres") as db:
   rows = conn.query("SELECT * FROM my_table")
   print(list(rows))

If you'd like to keep manual control, you can still use the .close() method

from ppp_connectors.dbms_connectors.odbc import ODBCConnector

conn = ODBCConnector("DSN=PostgresLocal;UID=postgres;PWD=postgres")
rows = conn.query("SELECT * FROM my_table")
print(list(rows))
conn.close()

Splunk

from ppp_connectors.dbms_connectors.splunk import SplunkConnector

conn = SplunkConnector("localhost", 8089, "admin", "admin123", scheme="https", verify=False)
results = conn.query("search index=_internal | head 5")

🧪 Testing

✅ Unit tests

  • Located in tests/<connector_name>/test_unit_<connector>.py
  • Use mocking (MagicMock, patch) to avoid hitting external APIs

🔁 Integration tests

  • Use VCR.py to record HTTP interactions
  • Cassettes stored in: tests/<connector_name>/cassettes/
  • Automatically redact secrets (API keys, tokens, etc.)
  • Marked with @pytest.mark.integration
pytest -m integration

🧼 Suppress warnings

Add this to pytest.ini:

[pytest]
markers =
    integration: marks integration tests

🧑‍💻 Contributing / Adding a Connector

To add a new connector:

  1. Module: Place your module in:

    • ppp_connectors/api_connectors/ for API-based integrations
    • ppp_connectors/dbms_connectors/ for database-style connectors
  2. Base class:

    • Use the Broker class for APIs
    • Use the appropriate DBMS connector template for DBMSs
  3. Auth: Pull secrets using combine_env_configs() to support .env, environment variables, and CI/CD injection.

  4. Testing:

    • Add unit tests in: tests/<name>/test_unit_<connector>.py
    • Add integration tests in: tests/<name>/test_integration_<connector>.py
    • Save cassettes in: tests/<name>/cassettes/
  5. Docs:

    • Add an example usage to this README.md
    • Document all methods with docstrings
    • Ensure your connector supports logging if enable_logging=True is passed
  6. Export:

    • Optionally expose your connector via __init__.py for easier importing

🛠️ Dev Environment

git clone https://github.com/FineYoungCannibals/ppp_connectors.git
cd ppp_connectors

cp .env.example .env

python -m venv .venv
source .venv/bin/activate

poetry install  # if using poetry, or `pip install -e .[dev]`

pytest           # run all tests
black .          # format code
flake8 .         # linting

🔐 Secrets and Redaction

Sensitive values like API keys are redacted using the AUTH_PARAM_REDACT list in conftest.py. This ensures .yaml cassettes don’t leak credentials.

Redacted fields include:

  • Query/body fields like api_key, key, token
  • Header fields like Authorization, X-API-Key
  • URI query parameters

✅ Summary

  • Centralized request broker for all APIs
  • Robust DBMS connectors
  • Easy-to-write unit and integration tests with automatic redaction
  • Environment-agnostic configuration system
  • VCR-powered CI-friendly test suite

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ppp_connectors-1.0.9.tar.gz (154.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

ppp_connectors-1.0.9-py3-none-any.whl (168.5 kB view details)

Uploaded Python 3

File details

Details for the file ppp_connectors-1.0.9.tar.gz.

File metadata

  • Download URL: ppp_connectors-1.0.9.tar.gz
  • Upload date:
  • Size: 154.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/2.1.4 CPython/3.13.6 Linux/6.11.0-1018-azure

File hashes

Hashes for ppp_connectors-1.0.9.tar.gz
Algorithm Hash digest
SHA256 6f1ee1e8da9554ec9c50e8e21c088f774923b83147027ffd8cbdee19b2a8d9a8
MD5 62238407b2ab8da36c5b631e09e350c0
BLAKE2b-256 3b2c4489f223e97333fc5469e9739a1ba76d8af160ff4eb5cdfe85af702b8c16

See more details on using hashes here.

File details

Details for the file ppp_connectors-1.0.9-py3-none-any.whl.

File metadata

  • Download URL: ppp_connectors-1.0.9-py3-none-any.whl
  • Upload date:
  • Size: 168.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/2.1.4 CPython/3.13.6 Linux/6.11.0-1018-azure

File hashes

Hashes for ppp_connectors-1.0.9-py3-none-any.whl
Algorithm Hash digest
SHA256 b8853eaa611694be47b9e127b0141edb61ac937ef3466cbd1cc3a48605ddd827
MD5 9e341cb4506e01ad3dfdef5a8d6f9164
BLAKE2b-256 7c221d002632572bcc2cc23bb7f76703a5c9771f721b696dfcb6cdb3ccd5f547

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page