Skip to main content

A Harlequin adapter for ADBC drivers.

Project description

harlequin-adbc

This repo provides the Harlequin adapter for ADBC (arrow database connectivity).

Warning

ADBC is a very new database connectivity method compared to ODBC, JDBC that aims at providing a more efficient way to transfer columnar data over the wire.

Due to the recency of its development, there is various level of functionality across drivers which may cause issues.

Installation

harlequin-adbc depends on adbc_driver_manager, harlequin and pyarrow, so installing this package with also install these dependencies.

I also recommend installing the extra adbc drivers for the databases you plan on connecting to otherwise you will have to download the driver yourself from another place and provide the driver_path as a cli argument. The driver path connection method hasn't been thoroughly test.

If you don't install the driver but provide a --drive-type cli argument you will get an ImportError

Using pip

To install the adapter using pip into an activated virtual environment:

pip install harlequin-adbc

with one of the driver extras:

pip install harlequin-adbc[snowflake]

Using poetry

poetry add harlequin-adbc

with one of the driver extras:

poetry add harlequin-adbc[snowflake]

Using pipx

If you do not already have Harlequin installed:

pipx install harlequin-adbc

pipx install harlequin-adbc[snowflake]

If you would like to add ADBC adapter to an existing Harlequin installation:

pipx inject harlequin harlequin-adbc

Make sure to also inject the driver package you want as well. The following options are:

  • adbc-driver-flightsql
  • adbc-driver-postgresql
  • adbc-driver-snowflake
  • adbc-driver-sqlite

Example usage:

pipx inject harlequin adbc-driver-snowflake

Usage and Configuration

You can open Harlequin with the ADBC adapter by selecting it with the -a option and passing in a connection string. The format of the connection string will depend on the driver you are using. You will also need to provide either the --driver-type or the --driver-path.

Driver Type (preferred)

When you use the --driver-type option it will try to dynamically use the driver package of the type selected adbc-driver-{driver type}. That is why it's crucial to also have that package also install in the virtual environment where Harlequin is installed.

  • --driver-type with one of the following options
    • flightsql, postgresql, snowflake, sqlite, duckdb

Driver Path

The other option is to pass the file path location of the adbc driver that you are using. Note this method is not well tested.

  • --driver-path

DB Kwargs String (Optional)

Since the drivers implement so many different options to pass through when you connect to the database this is a way to pass through these options. The format of the string is key=value separated by ;

  • --db-kwargs-str

Example: --db-kwargs-str "username=flight_username;password=flight_password;adbc.flight.sql.client_option.tls_skip_verify=true"

This will parse the string and pass through these values into the db_kwargs value in the dbapi.connect() method. Note the parser relies on the ; and = so if any of of the parameters in the string contain either of these characters it's not going to work. This is a known limitation and something being worked on.

Snowflake Driver

The Snowflake URI should be of one of the following formats:

  • user[:password]@account/database/schema[?param1=value1&paramN=valueN]
  • user[:password]@account/database[?param1=value1&paramN=valueN]
  • user[:password]@host:port/database/schema?account=user_account[&param1=value1&paramN=valueN]
  • host:port/database/schema?account=user_account[&param1=value1&paramN=valueN]

Check the Snowflake ADBC Driver Docs for more details.

Example usage:

harlequin -a adbc "$SNOWFLAKE_URI" --driver-type snowflake

FlightSQL Driver

Example usage:

harlequin -P None -a adbc "grpc+tls://localhost:31337" --driver-type flightsql --db-kwargs-str "username=flight_username;password=flight_password;adbc.flight.sql.client_option.tls_skip_verify=true"

Check the FlightSQL ADBC Driver Docs for more details.

Postgres Driver

The Postgres URI should be in the format of a Postgres DSN:

harlequin -a postgres "postgres://my-user:my-pass@localhost:5432/my-database"

DuckDB and SQLite

While DuckDB and SQLite both have ADBC drivers it's not recommend to use them. Harlequin natively supports both of the databases without having to install any other dependencies.

Know Issues

  • Snowflake adbc driver seems to be the only driver that returns the xdbc_data_type from adbc_get_objects()
  • Snowflake adbc driver has a bug with adbc_get_table_schema() that returns adbc_driver_manager.OperationalError: IO: sql: expected 12 destination arguments in Scan, not 11
  • The PostgreSQL adbc driver is overall buggy and when executing queries you might get the error IO: [libpq] Fetch header failed: no COPY in progress
  • SQLite and DuckDB don't have the same level of depth for adbc_get_objects() compared to other dbs which causes weird issues.
  • DuckDB driver uses different names for the adbc_get_objects() which causes things to break.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

harlequin_adbc-0.1.0.tar.gz (6.7 kB view details)

Uploaded Source

Built Distribution

harlequin_adbc-0.1.0-py3-none-any.whl (7.7 kB view details)

Uploaded Python 3

File details

Details for the file harlequin_adbc-0.1.0.tar.gz.

File metadata

  • Download URL: harlequin_adbc-0.1.0.tar.gz
  • Upload date:
  • Size: 6.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.6.1 CPython/3.10.13 Linux/6.2.0-1019-azure

File hashes

Hashes for harlequin_adbc-0.1.0.tar.gz
Algorithm Hash digest
SHA256 7a9c8de72fce767104730fd0f0e4de3583ecc9503e9ca39d36785082b489e21f
MD5 fb556d16f11d03d34fd8539b4b9c991d
BLAKE2b-256 f4dced86369db0c25a6a54176a4b6cf74f9f3c323f4dd60dfe68076aaf5be372

See more details on using hashes here.

File details

Details for the file harlequin_adbc-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: harlequin_adbc-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 7.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.6.1 CPython/3.10.13 Linux/6.2.0-1019-azure

File hashes

Hashes for harlequin_adbc-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 b6d42b64b687671bc35713bf59e99bf89f80953e4235962e2e822fbc8f29a2a2
MD5 0aacab4f9a7b51b5ffddfc04bdc705dd
BLAKE2b-256 4b5eda6b9b6a4f3af1812b750c7862c43b0a98053afb05b3471c5e7e7692eec1

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page