Sync an Airtable base to a Postgres schema in real time
Project description
Airtable Postgres Sync
The goal of this library is to provide an out-of-the-box solution for replicating an entire Airtable base in a Postgres schema. There are two modes of operation:
- One-off-sync: This mode will replicate the Airtable base in the specified Postgres schema and then exit. This is useful for creating snapshots of the base for analysis or for storage as a backup.
- Perpetual sync: This mode will replicate the Airtable base in the specified Postgres schema and then continue to watch for changes in the base. When a change is detected, the change will be applied to the Postgres schema. This is useful for creating a replica of the base that can be used for analysis in real time.
This library will produce a Postgres table and view for each of the tables in the specified Airtable base. The table will take the Airtable table id for its name and the field ids for its column names. The view will have the same name as the Airtable table and the column names will be the same as the Airtable column names. For most analysis use cases it makes sense to use the view as it is more readable, but for applications requiring robustness with respect to column name changes the table should be used.
Installation
To install the library, run the following command:
pip install airtable-postgres-sync
Permissions
To use this library, you will need to create a personal access token in Airtable. This token will need to have the following scopes:
- data.records:read
- schema.bases:read
- webhook:manage
You will also need to give the Postgres user that you are using read and write access to the schema you are syncing to.
Usage
To use the library, you will need to create a config file. The config file defines all the parameters that are needed to connect to Airtable and Postgres, as well as how your program will listen for changes. The file must be in YAML format and must contain the following fields:
AIRTABLE_PG_SYNC:
DB_INFO:
HOST: # Postgres host
PORT: # Postgres port
USER: # Postgres user
PASSWORD: # Postgres password
DB_NAME: # Postgres database name
SCHEMA_NAME: # Postgres schema name
AIRTABLE_INFO:
BASE_ID: # Airtable base id to sync
PAT: # Airtable personal access token
LISTENER_INFO:
WEBHOOK_URL: # The url that Airtable will send change notifications to
PORT: # The port to listen for change notifications on
The library can be used in two ways:
- As a command line tool
To trigger a one-time sync, run the following command:
airtable-pg-sync one-time-sync --config /path/to/config.yml
To trigger a perpetual sync, run the following command:
airtable-pg-sync perpetual-sync --config /path/to/config.yml
- As a python library
To trigger a sync from within a python program, run the following code:
from airtable_pg_sync import Sync
Sync(config_path="/path/to/config.yml", perpetual=True / False).run()
Testing and Deployment
When testing this library for your use case the ngrok service is very useful. It allows you to listen for requests sent over the internet to your PC (ie the webhook POST requests).
For deployment it is recommended that you run the library in an AWS EC2 type service. A t2.micro instance should suffice.
Bugs, Feature Requests, and Contributions
If you find a bug or have a feature request, please open an issue on GitHub. Any contributions are welcome and appreciated. If you would like to contribute, please open a pull request on GitHub.
Ideas for contributions:
- Add support for other databases
- Add support for Postgres -> Airtable sync
License
This library is licensed under the MIT License. See the LICENSE file
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file airtable_pg_sync-0.0.8.tar.gz
.
File metadata
- Download URL: airtable_pg_sync-0.0.8.tar.gz
- Upload date:
- Size: 18.4 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.11.0
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 769b47f8e04b3bf5bd50e78227e999aaa68f21c4165f44859c9b31282741a264 |
|
MD5 | 4f4a7d4654d17b407a228015951cd44f |
|
BLAKE2b-256 | 8040467c07529fd5a164c70805d2d7cd25dc80fb9a7c02b4717e0e35b33aed68 |
File details
Details for the file airtable_pg_sync-0.0.8-py3-none-any.whl
.
File metadata
- Download URL: airtable_pg_sync-0.0.8-py3-none-any.whl
- Upload date:
- Size: 23.8 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.11.0
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 3b90cc6c3a1640097cebfdf7872a6da03cca96c31a50325ee03b7e10b5ca083c |
|
MD5 | 048e48427073189a633c7fc81da3a74d |
|
BLAKE2b-256 | a6b7c8b97218ca3930c785b11a2fda7d9f41b6ed749555992b803bab0cb87179 |