Skip to main content

Google Cloud Datastore to BigQuery sync toolkit with Pub/Sub events, ML scoring, and workflow-ready orchestration.

Project description

CloudSync MLBridge

Google Cloud Datastore to BigQuery sync toolkit with Pub/Sub events, ML scoring, and workflow-ready orchestration.

cloudsync-mlbridge helps teams move operational data from Google Cloud Datastore into BigQuery while keeping analytics tables current through event-driven sync patterns.

It is designed for modern Google Cloud architectures using Datastore, BigQuery, Pub/Sub, Google Workflows, Cloud Run, and ML-based decision systems.

Use Cases

  • Datastore to BigQuery synchronization
  • Pub/Sub event-driven change propagation
  • BigQuery analytics table loading
  • ML-assisted record scoring before analytics ingestion
  • Workflow-ready sync payload generation
  • Operational dashboards and reporting
  • Agentic AI data movement orchestration

Install

pip install cloudsync-mlbridge

Python Usage

from cloudsync_mlbridge import SyncRecord, score_record_freshness, build_bigquery_row

record = SyncRecord(
    entity_key="customer-1001",
    kind="CustomerProfile",
    operation="UPSERT",
    data={"status": "active", "region": "US"}
)

print(score_record_freshness(record))
print(build_bigquery_row(record))

CLI Usage

cloudsync-mlbridge score --entity-key customer-1001 --kind CustomerProfile --operation UPSERT

Architecture Pattern

Recommended enterprise pattern:

  1. Application writes to Datastore.
  2. Application emits a Pub/Sub event.
  3. Cloud Run, Dataflow, or Workflows receives the event.
  4. cloudsync-mlbridge normalizes the record.
  5. Optional ML scoring is applied.
  6. Record is written to BigQuery raw/current tables.
  7. Scheduled reconciliation compares Datastore export with BigQuery.

Build and Publish

python -m pip install --upgrade build twine
python -m build
twine check dist/*
twine upload dist/*

License

MIT

Author: Raghava Chellu

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

cloudsync_mlbridge-0.1.0.tar.gz (4.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

cloudsync_mlbridge-0.1.0-py3-none-any.whl (6.1 kB view details)

Uploaded Python 3

File details

Details for the file cloudsync_mlbridge-0.1.0.tar.gz.

File metadata

  • Download URL: cloudsync_mlbridge-0.1.0.tar.gz
  • Upload date:
  • Size: 4.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.9.6

File hashes

Hashes for cloudsync_mlbridge-0.1.0.tar.gz
Algorithm Hash digest
SHA256 38e375557306e4bb5e0a5081236702b191e79399ea3a404d78dc7ba31b719850
MD5 0257340105726f7f044e29e0d27cd8c0
BLAKE2b-256 6b02ee04ed8ab41b71a3496db125c6ce3e4d47e924bb5062082b69150825c95a

See more details on using hashes here.

File details

Details for the file cloudsync_mlbridge-0.1.0-py3-none-any.whl.

File metadata

File hashes

Hashes for cloudsync_mlbridge-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 683da474101f76e9c95d29d3075e584fd8b090ba1ebc901ab453cc110643b596
MD5 b29c6cbf0f75352c41770a2118f0a129
BLAKE2b-256 28b5b8144bbd80de321b659a4cf153bea5358ab99398f5601897bc2d32bbcc18

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page