Skip to main content

Postgres to Elasticsearch/OpenSearch sync

Project description

PostgreSQL to Elasticsearch/OpenSearch sync

Requirements

Postgres setup

Enable logical decoding in your Postgres setting.

  • You also need to set up two parameters in your Postgres config postgresql.conf

    wal_level = logical

    max_replication_slots = 1

Installation

You can install PGSync from PyPI:

$ pip install pgsync

Config

Create a schema for the application named e.g schema.json

Example schema

Example spec

.. code-block::

[
    {
        "database": "[database name]",
        "index": "[Elasticsearch or OpenSearch index]",
        "nodes": {
            "table": "[table A]",
            "schema": "[table A schema]",
            "columns": [
                "column 1 from table A",
                "column 2 from table A",
                ... additional columns
            ],
            "children": [
                {
                    "table": "[table B with relationship to table A]",
                    "schema": "[table B schema]",
                    "columns": [
                      "column 1 from table B",
                      "column 2 from table B",
                      ... additional columns
                    ],
                    "relationship": {
                        "variant": "object",
                        "type": "one_to_many"
                    },
                    ...
                },
                {
                    ... additional children
                }
            ]
        }
    }
]

Environment variables

Setup environment variables required for the application

SCHEMA='/path/to/schema.json'

ELASTICSEARCH_HOST=localhost
ELASTICSEARCH_PORT=9200

PG_HOST=localhost
PG_USER=i-am-root # this must be a postgres superuser or replication user
PG_PORT=5432
PG_PASSWORD=*****

REDIS_HOST=redis
REDIS_PORT=6379
REDIS_DB=0
REDIS_AUTH=*****

Running

Bootstrap the database (one time only)

  • $ bootstrap --config schema.json

Run pgsync as a daemon

  • $ pgsync --config schema.json --daemon

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pgsync-4.0.0.tar.gz (114.5 kB view details)

Uploaded Source

Built Distribution

pgsync-4.0.0-py3-none-any.whl (61.2 kB view details)

Uploaded Python 3

File details

Details for the file pgsync-4.0.0.tar.gz.

File metadata

  • Download URL: pgsync-4.0.0.tar.gz
  • Upload date:
  • Size: 114.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.2

File hashes

Hashes for pgsync-4.0.0.tar.gz
Algorithm Hash digest
SHA256 10cf412435ddbebd8b3974a3d5d5bb5e7da0c5f513a2222f127e06dd7d196e28
MD5 a0525e3d8a1ecd778444066a35ec908a
BLAKE2b-256 9972bc64d349695117fcd0196c956db0e932d8319bf228c109a1039c1c6c8b4f

See more details on using hashes here.

File details

Details for the file pgsync-4.0.0-py3-none-any.whl.

File metadata

  • Download URL: pgsync-4.0.0-py3-none-any.whl
  • Upload date:
  • Size: 61.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.2

File hashes

Hashes for pgsync-4.0.0-py3-none-any.whl
Algorithm Hash digest
SHA256 e0d923a45e25aa91df7da1bd2082d865c91981e3a725b6722e5c935027ca6ac6
MD5 a287dded2e07ec955700c4e4e73545c7
BLAKE2b-256 ff5d858acd2de0216d4096d6a09a8e7920277c42a39f03ebebfac69f53a5f6c7

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page