Support for VDK Managed Oracle connection

These details have not been verified by PyPI

Project links

Project description

oracle

Support for VDK Managed Oracle connection

TODO: what the project is about, what is its purpose

Usage

pip install vdk-oracle

Configuration

(vdk config-help is useful command to browse all config options of your installation of vdk)

Name	Description	(example) Value
oracle_user	Username used when connecting to Oracle database	my_user
oracle_password	Password used when connecting to Oracle database	super_secret_shhhh
oracle_use_secrets	Set to True to use secrets to connect to Oracle	True
oracle_connection_string	The Oracle connection string	localhost:1521/free
oracle_host	The host of the Oracle database. Note: This gets overridden if oracle_connection_string is set.	localhost
oracle_port	The port of the Oracle database. Note: This gets overridden if oracle_connection_string is set.	1521
oracle_sid	The SID of the Oracle database. Note: This gets overridden if oracle_connection_string is set.	free
oracle_service_name	The Service name of the Oracle database. Note: This gets overridden if oracle_connection_string is set.	free
oracle_thick_mode	Python-oracledb is said to be in Thick mode when Oracle Client libraries are used. True by default. Set to False to disable Oracle Thick mode. More info: https://python-oracledb.readthedocs.io/en/latest/user_guide/appendix_b.html	True
oracle_ingest_batch_size	vdk-oracle splits ingestion payloads into batches. Change this config to control the batch size. Default is set to 100.	100

Example

CLI Queries

export VDK_ORACLE_USER=my_username
export VDK_ORACLE_PASSWORD=my_password
export VDK_ORACLE_CONNECTION_STRING=localhost:1521/free
vdk oracle-query -q "SELECT * FROM TEST_TABLE"

Note: Running CLI queries does not support secrets

Ingestion

import datetime
from decimal import Decimal

def run(job_input):

    # Ingest object
    payload_with_types = {
        "id": 5,
        "str_data": "string",
        "int_data": 12,
        "float_data": 1.2,
        "bool_data": True,
        "timestamp_data": datetime.datetime.fromtimestamp(1700554373),
        "decimal_data": Decimal(0.1),
    }

    job_input.send_object_for_ingestion(
        payload=payload_with_types, destination_table="test_table"
    )

    # Ingest tabular data
    col_names = [
        "id",
        "str_data",
        "int_data",
        "float_data",
        "bool_data",
        "timestamp_data",
        "decimal_data",
    ]
    row_data = [
        [
            0,
            "string",
            12,
            1.2,
            True,
            datetime.datetime.fromtimestamp(1700554373),
            Decimal(1.1),
        ],
        [
            1,
            "string",
            12,
            1.2,
            True,
            datetime.datetime.fromtimestamp(1700554373),
            Decimal(1.1),
        ],
        [
            2,
            "string",
            12,
            1.2,
            True,
            datetime.datetime.fromtimestamp(1700554373),
            Decimal(1.1),
        ],
    ]
    job_input.send_tabular_data_for_ingestion(
        rows=row_data, column_names=col_names, destination_table="test_table"
    )

Ingestion with type inference

Ingestion works with an already created table even if you pass strings in the payload. vdk-oracle infers the correct type based on the existing table.

create table test_table (
                            id number,
                            str_data varchar2(255),
                            int_data number,
                            float_data float,
                            bool_data number(1),
                            timestamp_data timestamp,
                            decimal_data decimal(14,8),
                            primary key(id))

def run(job_input):
    payload = {
        "id": "5",
        "str_data": "string",
        "int_data": "12",
        "float_data": "1.2",
        "bool_data": "False",
        "timestamp_data": "2023-11-21T08:12:53",
        "decimal_data": "0.1",
    }

    job_input.send_object_for_ingestion(payload=payload, destination_table="test_table")

Case Sensitivity

vdk-oracle supports only lower-case and upper-case payload keys. Oracle is case-insensitive by default. This is a challenge when ingesting payloads and doing type and schema inference, so we've opted for the simplest solution to avoid confusion on the user side.

Valid Ingestion

def run(job_input):
    payload = {
        "id": "5",
        "str_data": "string",
        "int_data": "12",
        "float_data": "1.2",
        "bool_data": "False",
        "timestamp_data": "2023-11-21T08:12:53",
        "decimal_data": "0.1",
    }

    job_input.send_object_for_ingestion(payload=payload, destination_table="test_table")

def run(job_input):
    payload = {
        "ID": "5",
        "STR_DATA": "string",
        "INT_DATA": "12",
        "FLOAT_DATA": "1.2",
        "BOOL_DATA": "False",
        "TIMESTAMP_DATA": "2023-11-21T08:12:53",
        "DECIMAL_DATA": "0.1",
    }

    job_input.send_object_for_ingestion(payload=payload, destination_table="TEST_TABLE")

Invalid ingestion

Will infer the schema, but won't insert correctly.

def run(job_input):
    payload = {
        "Id": "5",
        "Str_Data": "string",
        "Int_Data": "12",
        "Float_Data": "1.2",
        "Bool_Data": "False",
        "Timestamp_Data": "2023-11-21T08:12:53",
        "Decimal_Data": "0.1",
    }
    job_input.send_object_for_ingestion(payload=payload, destination_table="test_table")

Multiple Oracle Database Connections

Configuring Multiple Oracle Databases

To manage multiple Oracle database connections within a data job, always configure the default database in the [vdk] section of the config.ini file. This section should contain the primary connection details that the application will use by default. The default Oracle connection is saved as oracle and should always be called with that name. Subsections should not be created with that name. Subsection name vdk_oracle is prohibited.

For each additional Oracle database, add a new section following the pattern vdk_<name>, where <name> is a unique identifier for each database connection. These additional sections must also include all necessary Oracle connection details.

Note: When using in code the <name> should be lowercased. For example, if you have vdk_DEV, in the data job you should refer to the database using the dev string.

Example `config.ini` with Multiple Oracle Database Connections

[vdk]
oracle_user=user
oracle_password=password
oracle_host=localhost
oracle_port=1521
oracle_sid=FREE
oracle_connection_string =localhost:1521/FREE
oracle_thick_mode=True

[vdk_oracle_reports]
oracle_user=reports_user
oracle_password=reports_password
oracle_host=localhost
oracle_port=1523
oracle_sid=FREE
oracle_connection_string =localhost:1523/FREE
oracle_thick_mode=False

You can specify which database to use in your data job by referencing the specific section name.

def run(job_input):

    # Querying the default Oracle database
    default_query = "SELECT * FROM default_table"
    job_input.execute_query(sql=default_query, database="oracle") # database option can be omitted

    # Querying the reports Oracle database
    reports_query = "SELECT * FROM reports_table"
    job_input.execute_query(sql=reports_query, database="oracle_reports") # database is mandatory; if omitted query will be executed against default db

Ingestion into Multiple Oracle Databases

For data ingestion, you can also specify the target database to ensure the data is sent to the correct Oracle instance.

def run(job_input):

    # Ingest data into the default database
    payload_default = {"col1": "value1", "col2": "value2"}
    job_input.send_object_for_ingestion(
        payload=payload_default,
        destination_table="default_table",
        method="oracle",
        target="oracle"
    )

    # Ingest data into the reports database
    payload_reports = {"col1": "value3", "col2": "value4"}
    job_input.send_object_for_ingestion(
        payload=payload_reports,
        destination_table="reports_table",
        method="oracle_reports",
        target="oracle_reports"
    )

Secrets with Multiple Oracle Databases

If you have a config like above, for the default vdk section, secrets overrides work like usual. For example, to override oracle_user=your_user, you should create a secret oracle_user with value your_user.

If you want to override a config property for a subsection, you have to prefix the secret with the subsection name without vdk. For example, to override oracle_user=reports_user for vdk_oracle_reports, create a secret oracle_reports_oracle_user with value reports_user.

Environmental variables with Multiple Oracle Databases

Environment variables work pretty much the same way as secrets. For the above config:

export VDK_ORACLE_USER=user # overrides oracle_user=user in section [vdk] (default oracle)
export VDK_ORACLE_REPORTS_ORACLE_USER=reports_user # overrides oracle_user=reports_user in section [vdk_reports_user]

Note: Environment variable overrides take precedence over secrets. For example, if you have a secret oracle_reports_oracle_user=reports_user and an env variable VDK_ORACLE_REPORTS_ORACLE_USER=another_reports_user the value of oracle_user for section vdk_oracle_reports will be another_reports_user.

Build and testing

pip install -r requirements.txt
pip install -e .
pytest

In VDK repo ../build-plugin.sh script can be used also.

Note about the CICD:

.plugin-ci.yaml is needed only for plugins part of Versatile Data Kit Plugin repo.

The CI/CD is separated in two stages, a build stage and a release stage. The build stage is made up of a few jobs, all which inherit from the same job configuration and only differ in the Python version they use (3.7, 3.8, 3.9 and 3.10). They run according to rules, which are ordered in a way such that changes to a plugin's directory trigger the plugin CI, but changes to a different plugin does not.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.1.1431637373

Aug 29, 2024

0.1.1285639256

May 10, 2024

0.1.1284357447

May 9, 2024

0.1.1256713362

Apr 17, 2024

0.1.1245476944

Apr 9, 2024

0.1.1224224703

Mar 22, 2024

0.1.1223971691

Mar 22, 2024

0.1.1222690686

Mar 21, 2024

0.1.1211528752

Mar 13, 2024

0.1.1208905902

Mar 11, 2024

0.1.1206376307

Mar 8, 2024

0.1.1195600109

Feb 29, 2024

0.1.1190994517

Feb 26, 2024

0.1.1184833162

Feb 21, 2024

0.1.1147913641

Jan 23, 2024

0.1.1142782514

Jan 18, 2024

0.1.1141309149

Jan 17, 2024

0.1.1140938648

Jan 17, 2024

0.1.1106880613

Dec 14, 2023

0.1.1098807782

Dec 7, 2023

0.1.1090980803

Nov 30, 2023

0.1.1084395846

Nov 24, 2023

0.1.2

Jan 16, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vdk_oracle-0.1.1431637373.tar.gz (18.3 kB view details)

Uploaded Aug 29, 2024 Source

File details

Details for the file vdk_oracle-0.1.1431637373.tar.gz.

File metadata

Download URL: vdk_oracle-0.1.1431637373.tar.gz
Upload date: Aug 29, 2024
Size: 18.3 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/5.1.1 CPython/3.10.14

File hashes

Hashes for vdk_oracle-0.1.1431637373.tar.gz
Algorithm	Hash digest
SHA256	`259bdd0ab61e776911f71a2cf47056b666b17a76c6be3a6fd95c8bb82d3fe691`
MD5	`707dd5bbbbcc55b1a0f7a88eb7aa07c9`
BLAKE2b-256	`6ad2e63f42936c7980d3e4ecd80bdb140271bf756dffe97c322fcaafdc3e8c70`

See more details on using hashes here.

vdk-oracle 0.1.1431637373

Navigation

Verified details

Maintainers

Unverified details

Project links

Classifiers

Project description

oracle

Usage

Configuration

Example

CLI Queries

Ingestion

Ingestion with type inference

Case Sensitivity

Multiple Oracle Database Connections

Configuring Multiple Oracle Databases

Example `config.ini` with Multiple Oracle Database Connections

Ingestion into Multiple Oracle Databases

Secrets with Multiple Oracle Databases

Environmental variables with Multiple Oracle Databases

Build and testing

Note about the CICD:

Project details

Verified details

Maintainers

Unverified details

Project links

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

File details

File metadata

File hashes

vdk-oracle 0.1.1431637373

Navigation

Verified details

Maintainers

Unverified details

Project links

Classifiers

Project description

oracle

Usage

Configuration

Example

CLI Queries

Ingestion

Ingestion with type inference

Case Sensitivity

Multiple Oracle Database Connections

Configuring Multiple Oracle Databases

Example config.ini with Multiple Oracle Database Connections

Ingestion into Multiple Oracle Databases

Secrets with Multiple Oracle Databases

Environmental variables with Multiple Oracle Databases

Build and testing

Note about the CICD:

Project details

Verified details

Maintainers

Unverified details

Project links

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

File details

File metadata

File hashes

Example `config.ini` with Multiple Oracle Database Connections