BigQuery Foreign Data Wrapper for PostgreSQL
Project description
bigquery_fdw is a BigQuery foreign data wrapper for PostgreSQL using Multicorn.
It allows to write queries in PostgreSQL SQL syntax using a foreign table. It supports most of BigQuery’s data types and operators.
Features and limitations
Table partitioning is supported. You can use partitions in your SQL queries.
Queries are parameterized when sent to BigQuery
BigQuery’s standard SQL support (legacy SQL is not supported)
Authentication works with a “Service Account” Json private key
Requirements
PostgreSQL >= 9.5
Python >= 3.4
⚠️ Migrating to version 1.8 from versions 1.7 and below
Starting with version 1.8, the fdw_key option is deprecated and replaced with a default environment variable. See Authentication.
Get started
Using docker
Installation on Debian/Ubuntu
Dependencies required to install bigquery_fdw:
You need to install the following dependencies:
# Install required packages
apt-get update
apt-get install --yes postgresql-server-dev-12 python3-setuptools python3-dev make gcc git
For PostgresSQL 9.X, install postgresql-server-dev-9.X instead of postgresql-server-dev-12.
All PostgreSQL versions from 9.2 to 12 should be supported. Building Multicorn against PostgreSQL 13 is currently not working properly (as of 1/21/2013).
Installation
# Install Multicorn
# gabfl/Multicorn is a fork of Segfault-Inc/Multicorn that adds better support for Python3.
# You may chose to build against the original project instead.
git clone git://github.com/gabfl/Multicorn.git && cd Multicorn
make && make install
# Install bigquery_fdw
pip3 install bigquery-fdw
Major dependencies installed automatically during the installation process:
Authentication
bigquery_fdw relies on Google Cloud API’s default authentication.
Your need to have an environment variable GOOGLE_APPLICATION_CREDENTIALS that has to be accessible by bigquery_fdw. Setting environment variables varies depending on OS but for Ubuntu or Debian, the preferred way is to edit /etc/postgresql/[version]/main/environment and add:
GOOGLE_APPLICATION_CREDENTIALS = '/path/to/key.json'
Restarting PostgreSQL is required for the environment variable to be loaded.
Usage
We recommend testing the BigQuery client connectivity before trying to use the FDW.
With psql:
CREATE EXTENSION multicorn;
CREATE SERVER bigquery_srv FOREIGN DATA WRAPPER multicorn
OPTIONS (
wrapper 'bigquery_fdw.fdw.ConstantForeignDataWrapper'
);
CREATE FOREIGN TABLE my_bigquery_table (
column1 text,
column2 bigint
) SERVER bigquery_srv
OPTIONS (
fdw_dataset 'my_dataset',
fdw_table 'my_table'
);
Options
List of options implemented in CREATE FOREIGN TABLE syntax:
Option |
Default |
Description |
---|---|---|
fdw_dataset |
BigQuery dataset name |
|
fdw_table |
BigQuery table name |
|
fdw_convert_tz |
Convert BigQuery time zone for dates and timestamps to selected time zone. Example: 'US/Eastern'. |
|
fdw_group |
'false' |
|
fdw_casting |
See Casting. |
|
fdw_verbose |
'false' |
Set to 'true' to output debug information in PostrgeSQL’s logs |
fdw_sql_dialect |
'standard' |
BigQuery SQL dialect. Currently only standard is supported. |
More documentation
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for bigquery_fdw-1.8-py2.py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 8bb24b4116a9e5533f40e194a32867169a0f3e6b1d30b86af7b66a22296b3365 |
|
MD5 | c37c3adf4cc5130b282cb85882bc6d02 |
|
BLAKE2b-256 | bed25873bac728e46b92dd44a04215f6cdff74ce0a3f915c1cde0d22c9385af9 |