Skip to main content

The avrio adapter plugin for dbt (data build tool)

Project description

dbt-avrio

The dbt-avrio adapter allows users to interact with avrio's Trino Engine, a distributed SQL query engine, using dbt. This adapter is designed to facilitate the use of dbt for transforming and modeling data within avrio.

Features

  • avrio Compatibility: Compatible with avrio's Trino Query Engine, allowing users to leverage dbt within avrio.
  • JWT Authentication: Utilizes JWT for secure authentication with avrio. The adapter handles the generation of JWT tokens behind the scenes via username and password configured in profiles.yml.
  • avrio eliminates complex data-engineering and de-couples migrations to help businesses experience a quantum leap in the insights.

Description

About DBT

dbt (data build tool) is a powerful data transformation workflow tool that facilitates quick and collaborative deployment of analytics code. It follows software engineering best practices such as modularity, continuous integration/deployment (CI/CD), testing, and documentation. With dbt, individuals familiar with SQL can easily build production-grade data pipelines.

Connecting to Multiple Data Sources

Challenges in Limited Connectivity
Default configurations in dbt make it challenging to connect to databases beyond the primary one associated with the project. Many organizations deal with diverse data sources, such as Oracle, Snowflake, SQL Server, Google BigQuery, Delta Lake, Iceberg, AWS Redshift, Vertica, Azure Synapse, MongoDB, AWS S3 and other data sources. Integrating and transforming data from these varied sources efficiently becomes essential for comprehensive analytics.

Solution: Leveraging Trino and Catalogs in avrio

avrio's Trino is an advanced query engine that excels in federated queries across multiple data sources and also allows the writeback capabilities to variety of data sources. Its ability to connect to various databases and process SQL queries at scale makes it an ideal solution for organizations dealing with diverse data sources.

avrio, built on Trino introduces the concept of catalogs to address the challenge of connecting to multiple data sources seamlessly within dbt. Each catalog corresponds to a specific data source, enabling a unified approach to managing and transforming data across various systems.

Connecting to avrio

Example profiles.yml

Here is a example of a dbt-avrio profile parameters. At a minimum, you need to specify type, method, username, password host, port, schema, catalog and threads.

<profile-name>:
  outputs:
    dev:
      type: avrio
      method: jwt 
      username: [username for jwt auth]
      password: [password for jwt auth]  
      host: [avrio hostname]
      port: [port number]
      schema: [dev_schema]
      catalog: [catalog_name]
      threads: [1 or more]

    prod:
      type: avrio
      method: jwt 
      username: [username for jwt auth]
      password: [password for jwt auth]  
      host: [avrio hostname]
      port: [port number]
      schema: [dev_schema]
      catalog: [catalog_name]
      threads: [1 or more]
  target: dev

Description of Profile Fields

Parameter Type Description
type string Specifies the type of dbt adapter (avrio).
method jwt Authentication method for JWT authentication.
username string Username for JWT authentication. The obtained JWT token is used to initialize a trino.auth.JWTAuthentication object.
password string Password for JWT authentication. The obtained JWT token is used to initialize a trino.auth.JWTAuthentication object.
host string The host parameter specifies the hostname or IP address of the Trino server.
port integer The port parameter specifies the port number on which the Trino server is listening.
schema string Schema or database name for the connection.
catalog string Name of the catalog representing the data source.
threads integer Number of threads for parallel execution of queries. (1 or more

Getting Started

Install dbt-avrio adapter

 pip install  dbt-avrio

Note: The adapter requires dbt-core>=1.5, <1.8. Ensure that your environment meets this requirement.

Initialize dbt project

dbt init
  • Select Adapters : After running the dbt init command, you'll be prompted to select adapters from a list of available adapters. Choose the appropriate adapter for your project.
  • Modify Profiles.yml : The dbt init command will create a project structure in current dir and a .dbt folder inside users of your system. Inside .dbt folder, you'll find a profiles.yml file.

Configure Profiles.yml

  • Open the profiles.yml file in a text editor.
  • Locate the section for your selected adapter and project (e.g., avrio).
  • Add the necessary connection details such as host, port, user, password, etc.
  • Save and close the profiles.yml file.

Check connection is successful

dbt debug

To run all models use below command

dbt run

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dbt-avrio-20.0.0.tar.gz (24.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

dbt_avrio-20.0.0-py3-none-any.whl (30.2 kB view details)

Uploaded Python 3

File details

Details for the file dbt-avrio-20.0.0.tar.gz.

File metadata

  • Download URL: dbt-avrio-20.0.0.tar.gz
  • Upload date:
  • Size: 24.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.0.1 CPython/3.11.4

File hashes

Hashes for dbt-avrio-20.0.0.tar.gz
Algorithm Hash digest
SHA256 87f97d072eff03ce82ec7b032baf3b959dd4148debd3f5f3e9170f71a15e8a5f
MD5 867cacede4347496a7a690388b345e41
BLAKE2b-256 9148e2ecfa485ca5b6f9453657fd5c471a66271a5e9cb3bed14aa5065334cd07

See more details on using hashes here.

File details

Details for the file dbt_avrio-20.0.0-py3-none-any.whl.

File metadata

  • Download URL: dbt_avrio-20.0.0-py3-none-any.whl
  • Upload date:
  • Size: 30.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.0.1 CPython/3.11.4

File hashes

Hashes for dbt_avrio-20.0.0-py3-none-any.whl
Algorithm Hash digest
SHA256 76e45b8a1ab6e0a00c8c546ba8afcfbae55a3b67dbe42ccf8cca03699abb8a51
MD5 d1598dcd4e7d65f0ccdbd912524626b5
BLAKE2b-256 620a75c4bcfc479096fe7124a1867d5a0e43e26fed373fe236b92bece76ca411

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page