Lib to connect python/django modules to redshift

These details have not been verified by PyPI

Project description

Weni Data Lake SDK

The Weni Data Lake SDK is a Python library that provides an interface to interact with Weni's data lake services. It supports operations for sending data, managing message templates, and handling traces.

Installation

pip install weni-datalake-sdk
In case you are using poetry, you can add the package to your project with the following command:
poetry add weni-datalake-sdk

Environment Variables

To insert data into the data lake, you need to set the following environment variables:

DATALAKE_SERVER_ADDRESS=your_server_address

To get data from the data lake, you need to set the following environment variables:

REDSHIFT_QUERY_BASE_URL=your_redshift_url
REDSHIFT_SECRET=your_secret
REDSHIFT_ROLE_ARN=your_role_arn
MESSAGE_TEMPLATES_METRIC_NAME=your_metric_name (if you want to get message templates)
TRACES_METRIC_NAME=your_trace_metric_name (if you want to get traces)
EVENTS_METRIC_NAME=your_event_metric_name (if you want to get events)

Although you will need some AWS credentials to get data from the data lake, you can use the following environment variables:

AWS_ACCESS_KEY_ID=your_access_key_id
AWS_SECRET_ACCESS_KEY=your_secret_access_key
AWS_DEFAULT_REGION=your_region

This is important that we will use assumed role to get data from the data lake.

Usage Examples

1. Sending Data

from weni_datalake_sdk.clients.client import send_data
from weni_datalake_sdk.paths.your_path import YourPath

# Prepare your data
data = {
    "field1": "value1",
    "field2": "value2"
}

# Send data using a path class
send_data(YourPath, data)

# Or using an instantiated path
path = YourPath()
send_data(path, data)

2. Send Event Data

from weni_datalake_sdk.clients.client import send_event_data
from weni_datalake_sdk.paths.events_path import EventPath

# Prepare your data
data = {
    "event_name": "event_name",
    "key": "key",
    "value": "value",
    "value_type": "value_type",
    "date": "2021-01-01",
    "project": "project_uuid",
    "contact_urn": "contact_urn",
    "metadata": {
        "field1": "value1",
        "field2": "value2"
    }
}

2. Get Message Templates

from weni_datalake_sdk.clients.redshift.message_templates import get_message_templates

# Get templates with specific parameters
result = get_message_templates(
    contact_urn="contact123",
    template_uuid="template_uuid"
)

3. Get Traces

from weni_datalake_sdk.clients.redshift.traces import get_traces

# Get traces with query parameters
result = get_traces(
    query_params={
        "message_uuid": "123e4567-e89b-12d3-a456-426614174000"
    }
)

4. Get Events

from weni_datalake_sdk.clients.redshift.events import get_events    

# Get events with query parameters
result = get_events(
    query_params={
        "date_start": "2021-01-01", # date_start is required
        "date_end": "2021-01-01", # date_end is required
        "project": "project_uuid", # project is optional
        "event_type": "event_type", # event_type is optional
        "contact_urn": "contact_urn", # contact_urn is optional
        "event_name": "event_name", # event_name is optional
        "key": "key", # key is optional
        "value": "value", # value is optional
        "value_type": "value_type" # value_type is optional
    }
)

5. Get Events Count

from weni_datalake_sdk.clients.redshift.events import get_events_count

# Get events count with required and optional parameters
result = get_events_count(
    project="your_project_uuid", # project is required
    date_start="2025-06-03T00:00:00Z", # date_start is required
    date_end="2025-07-30T23:59:59Z", # date_end is required
    event_type="event_type", # event_type is optional
    event_name="event_name", # event_name is optional
    key="topics",  # key is optional
    value="value", # value is optional
    value_type="value_type", # value_type is optional
    contact_urn="contact_urn", # contact_urn is optional
)
print(result)

6. Get Events Count By Group

from weni_datalake_sdk.clients.redshift.events import get_events_count_by_group

# Get events count grouped by a metadata key
result = get_events_count_by_group(
    project="your_project_uuid", # project is required
    date_start="2025-06-03T00:00:00Z", # date_start is required
    date_end="2025-07-30T23:59:59Z", # date_end is required
    metadata_key="topic_uuid", # metadata_key is required
    event_type="event_type", # event_type is optional
    event_name="event_name", # event_name is optional
    key="topics",  # key is optional
    value="value", # value is optional
    value_type="value_type", # value_type is optional
    contact_urn="contact_urn", # contact_urn is optional
    group_by="subtopic_uuid",  # group_by is optional
    metadata_value="uuid" # metadata_value is optional
)
print(result)

If you don't pass group_by value, the result will be aggregated by value.

Error Handling

The SDK includes proper error handling. Always wrap your calls in try-except blocks:

try:
    result = get_message_templates(template_id="template123")
except Exception as e:
    print(f"Error: {e}")

Best Practices

Environment Variables: Always ensure all required environment variables are set before using the SDK.
Path Validation: Use proper path classes instead of raw strings.
Error Handling: Implement proper error handling in your code.
Data Types: Ensure you're passing the correct data types for each parameter.
Security: Never hardcode sensitive information like tokens or credentials.

Common Issues and Solutions

Connection Issues
- Ensure DATALAKE_SERVER_ADDRESS is correct and accessible
- Check your network connectivity
Authentication Errors
- Verify your AWS credentials are properly configured
- Check if REDSHIFT_SECRET and REDSHIFT_ROLE_ARN are correct
Missing Environment Variables
- Double-check all required environment variables are set
- Use a .env file for local development

Contributing

For contributing to this SDK, please follow these steps:

Fork the repository
Create a feature branch
Commit your changes
Push to the branch
Create a Pull Request

License

This project is licensed under the MIT License - see the LICENSE file for details.

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.8.0

Apr 29, 2026

0.7.0

Mar 24, 2026

0.6.2

Mar 12, 2026

0.6.1

Oct 29, 2025

0.6.0

Oct 7, 2025

0.5.0

Jul 31, 2025

0.4.0

Jul 28, 2025

0.4.0a1 pre-release

Jul 28, 2025

0.4.0a0 pre-release

Jul 25, 2025

This version

0.3.0

Jul 24, 2025

0.3.0a3 pre-release

Jul 23, 2025

0.3.0a2 pre-release

Jul 22, 2025

0.3.0a1 pre-release

Jul 17, 2025

0.3.0a0 pre-release

Jul 16, 2025

0.2.4

Jul 2, 2025

0.2.4a0 pre-release

Jun 25, 2025

0.2.3

Jun 18, 2025

0.2.2

Jun 18, 2025

0.2.2a5 pre-release

Jun 17, 2025

0.2.2a1 pre-release

Jun 13, 2025

0.2.2a0 pre-release

Jun 12, 2025

0.2.1

Jun 4, 2025

0.2.1a2 pre-release

Jun 4, 2025

0.2.1a1 pre-release

Jun 3, 2025

0.2.1a0 pre-release

May 23, 2025

0.2.0

May 2, 2025

0.2.0a0 pre-release

Apr 30, 2025

0.1.1

Apr 8, 2025

0.1.0

Apr 3, 2025

0.0.9

Apr 3, 2025

0.0.8

Mar 28, 2025

0.0.7

Mar 28, 2025

0.0.6

Mar 25, 2025

0.0.5

Mar 25, 2025

0.0.4

Mar 25, 2025

0.0.3

Mar 24, 2025

0.0.2

Mar 24, 2025

0.0.1

Mar 21, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

weni_datalake_sdk-0.3.0.tar.gz (28.7 kB view details)

Uploaded Jul 24, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

weni_datalake_sdk-0.3.0-py3-none-any.whl (40.7 kB view details)

Uploaded Jul 24, 2025 Python 3

File details

Details for the file weni_datalake_sdk-0.3.0.tar.gz.

File metadata

Download URL: weni_datalake_sdk-0.3.0.tar.gz
Upload date: Jul 24, 2025
Size: 28.7 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: poetry/1.8.5 CPython/3.11.13 Linux/6.11.0-1018-azure

File hashes

Hashes for weni_datalake_sdk-0.3.0.tar.gz
Algorithm	Hash digest
SHA256	`0c19e935b648d2c2677ace3080f07bf2b0a91367bc0f9e87cbaf9c745bfcd2d1`
MD5	`daa914f84a830fe17125f726ceab2e0f`
BLAKE2b-256	`78c3a73dfacc65d10485a9156ea2fd78a12a12b24fafec7b7aea63c0e3cc3dc9`

See more details on using hashes here.

File details

Details for the file weni_datalake_sdk-0.3.0-py3-none-any.whl.

File metadata

Download URL: weni_datalake_sdk-0.3.0-py3-none-any.whl
Upload date: Jul 24, 2025
Size: 40.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: poetry/1.8.5 CPython/3.11.13 Linux/6.11.0-1018-azure

File hashes

Hashes for weni_datalake_sdk-0.3.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`1fb089d5ae9f1796a21dc9d0854d581edb7a7033d9e7c5eb5aa115d8ef986211`
MD5	`6e47df90b82418dad788103ba175c72a`
BLAKE2b-256	`c984d0ccdd5442f9ee0ed3b4bd2a2acd7002a47d09b9feb03e84d9a3840771bf`

See more details on using hashes here.

weni-datalake-sdk 0.3.0

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers

Project description

Weni Data Lake SDK

Installation

Environment Variables

Usage Examples

1. Sending Data

2. Send Event Data

2. Get Message Templates

3. Get Traces

4. Get Events

5. Get Events Count

6. Get Events Count By Group

Error Handling

Best Practices

Common Issues and Solutions

Contributing

License

Project details

Verified details

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes