The dbt-spark-livy adapter plugin for Spark in Cloudera DataHub with Livy interface
Project description
dbt-spark-livy
The dbt-spark-livy
adapter allows you to use dbt along with Apache spark-livy and Cloudera Data Platform with Livy server support. This code bases use the dbt-spark project (https://github.com/dbt-labs/dbt-spark), and provides a Livy connectivity support over it.
Getting started
- Install dbt
- Read the introduction and viewpoint
Requirements
Python >= 3.8
dbt-core >= 1.1.0
pyspark
sqlparams
Installing dbt-spark-livy
pip install dbt-spark-livy
Profile Setup
demo_project:
target: dev
outputs:
dev:
type: spark_livy
method: livy
schema: my_db
host: https://spark-livy-gateway.my.org.com/dbt-spark/cdp-proxy-api/livy_for_spark3/
user: my_user
password: my_pass
Caveats
- While using livy , in the Livy UI if you notice sessions change state to dead from starting instead of idle, make sure there is a proper mapping for the user in the IDBroker mapping section
- Actions > Manage Access > IDBroker Mappings . Reference
- Also make sure the workload password is set either through UI or CLI. Reference
Supported features
Please see the original adapter documentation: https://github.com/dbt-labs/dbt-spark and https://docs.getdbt.com/reference/warehouse-profiles/spark-profile
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
dbt-spark-livy-1.1.4.tar.gz
(28.4 kB
view hashes)
Built Distribution
Close
Hashes for dbt_spark_livy-1.1.4-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 971fa46bffd3be4277a30278d6e4ddd1d74309f9950947f75190c5570bc06056 |
|
MD5 | 6a35b1815333d7c4d9f1b614b0eb8e39 |
|
BLAKE2b-256 | 455d17b40a11c05a3d9cecfcfca5a6b17ad09b9c6a4f34d03f1bf9a10755d7f4 |