Skip to main content

Postgres to Elasticsearch/OpenSearch sync

Project description

PostgreSQL to Elasticsearch/OpenSearch sync

Requirements

Postgres setup

Enable logical decoding in your Postgres setting.

  • You also need to set up two parameters in your Postgres config postgresql.conf

    wal_level = logical

    max_replication_slots = 1

Installation

You can install PGSync from PyPI:

$ pip install pgsync

Config

Create a schema for the application named e.g schema.json

Example schema

Example spec

.. code-block::

[
    {
        "database": "[database name]",
        "index": "[Elasticsearch or OpenSearch index]",
        "nodes": {
            "table": "[table A]",
            "schema": "[table A schema]",
            "columns": [
                "column 1 from table A",
                "column 2 from table A",
                ... additional columns
            ],
            "children": [
                {
                    "table": "[table B with relationship to table A]",
                    "schema": "[table B schema]",
                    "columns": [
                      "column 1 from table B",
                      "column 2 from table B",
                      ... additional columns
                    ],
                    "relationship": {
                        "variant": "object",
                        "type": "one_to_many"
                    },
                    ...
                },
                {
                    ... additional children
                }
            ]
        }
    }
]

Environment variables

Setup environment variables required for the application

SCHEMA='/path/to/schema.json'

ELASTICSEARCH_HOST=localhost
ELASTICSEARCH_PORT=9200

PG_HOST=localhost
PG_USER=i-am-root # this must be a postgres superuser or replication user
PG_PORT=5432
PG_PASSWORD=*****

REDIS_HOST=redis
REDIS_PORT=6379
REDIS_DB=0
REDIS_AUTH=*****

Running

Bootstrap the database (one time only)

  • $ bootstrap --config schema.json

Run pgsync as a daemon

  • $ pgsync --config schema.json --daemon

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

retake-pgsync-2.5.4.tar.gz (110.2 kB view details)

Uploaded Source

Built Distribution

retake_pgsync-2.5.4-py3-none-any.whl (58.0 kB view details)

Uploaded Python 3

File details

Details for the file retake-pgsync-2.5.4.tar.gz.

File metadata

  • Download URL: retake-pgsync-2.5.4.tar.gz
  • Upload date:
  • Size: 110.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.11.4

File hashes

Hashes for retake-pgsync-2.5.4.tar.gz
Algorithm Hash digest
SHA256 99b40018c6d2c8db78f2f741e266bce61c890754b7c62aad8dc3c634262cf213
MD5 ff56970f314f1e96f9fa2c94f9e72d15
BLAKE2b-256 0faff1cde56a454d8f00f1dc741d1c43c7810909e546c8e4bf76c9260424a3f3

See more details on using hashes here.

File details

Details for the file retake_pgsync-2.5.4-py3-none-any.whl.

File metadata

File hashes

Hashes for retake_pgsync-2.5.4-py3-none-any.whl
Algorithm Hash digest
SHA256 c1c8a93387ef8bb6949d2f7bd60861d2e7d6dbe39ba8e0f0f256bf136cef50d0
MD5 2de5cde9ad4787a2bf9e4d758f72d301
BLAKE2b-256 83188c5768a5d47cc96fd2a77fce02492c80e35800af07eaf7107bae18e61376

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page