Google Cloud Datastore to BigQuery sync toolkit with Pub/Sub events, ML scoring, and workflow-ready orchestration.
Project description
CloudSync MLBridge
Google Cloud Datastore to BigQuery sync toolkit with Pub/Sub events, ML scoring, and workflow-ready orchestration.
cloudsync-mlbridge helps teams move operational data from Google Cloud Datastore into BigQuery while keeping analytics tables current through event-driven sync patterns.
It is designed for modern Google Cloud architectures using Datastore, BigQuery, Pub/Sub, Google Workflows, Cloud Run, and ML-based decision systems.
Use Cases
- Datastore to BigQuery synchronization
- Pub/Sub event-driven change propagation
- BigQuery analytics table loading
- ML-assisted record scoring before analytics ingestion
- Workflow-ready sync payload generation
- Operational dashboards and reporting
- Agentic AI data movement orchestration
Install
pip install cloudsync-mlbridge
Python Usage
from cloudsync_mlbridge import SyncRecord, score_record_freshness, build_bigquery_row
record = SyncRecord(
entity_key="customer-1001",
kind="CustomerProfile",
operation="UPSERT",
data={"status": "active", "region": "US"}
)
print(score_record_freshness(record))
print(build_bigquery_row(record))
CLI Usage
cloudsync-mlbridge score --entity-key customer-1001 --kind CustomerProfile --operation UPSERT
Architecture Pattern
Recommended enterprise pattern:
- Application writes to Datastore.
- Application emits a Pub/Sub event.
- Cloud Run, Dataflow, or Workflows receives the event.
cloudsync-mlbridgenormalizes the record.- Optional ML scoring is applied.
- Record is written to BigQuery raw/current tables.
- Scheduled reconciliation compares Datastore export with BigQuery.
Build and Publish
python -m pip install --upgrade build twine
python -m build
twine check dist/*
twine upload dist/*
License
MIT
Author: Raghava Chellu
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file cloudsync_mlbridge-0.1.0.tar.gz.
File metadata
- Download URL: cloudsync_mlbridge-0.1.0.tar.gz
- Upload date:
- Size: 4.8 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.9.6
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
38e375557306e4bb5e0a5081236702b191e79399ea3a404d78dc7ba31b719850
|
|
| MD5 |
0257340105726f7f044e29e0d27cd8c0
|
|
| BLAKE2b-256 |
6b02ee04ed8ab41b71a3496db125c6ce3e4d47e924bb5062082b69150825c95a
|
File details
Details for the file cloudsync_mlbridge-0.1.0-py3-none-any.whl.
File metadata
- Download URL: cloudsync_mlbridge-0.1.0-py3-none-any.whl
- Upload date:
- Size: 6.1 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.9.6
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
683da474101f76e9c95d29d3075e584fd8b090ba1ebc901ab453cc110643b596
|
|
| MD5 |
b29c6cbf0f75352c41770a2118f0a129
|
|
| BLAKE2b-256 |
28b5b8144bbd80de321b659a4cf153bea5358ab99398f5601897bc2d32bbcc18
|