Skip to main content

Data Revenue Connecter — read-only database profiling agent that builds a signed Data Passport

Project description

DRC Scanner — Data Revenue Connecter

A read-only database profiling agent. It runs inside your own network, profiles your database, and produces a small, human-readable, signed Data Passport — a JSON file of aggregate statistics only. Your raw data never leaves your machine. You then upload the Passport to app.datarevenue.io to auto-fill your data valuation questionnaire.

What leaves your network

Nothing during the scan. The scanner makes zero network calls while profiling. The only artifact produced is the Data Passport (drc_passport_<date>.json) — column names, row counts, and aggregate metrics. Never row values, never sample data. You can inspect the entire file before uploading it.

Install & run

# Option 1 — pip
pip install drc-scanner
drc-scan run --connect postgresql://readonly@host:5432/proddb

# Option 2 — Docker (no Python required)
docker run -it datarevenue/scanner \
  --connect postgresql://readonly@host.docker.internal:5432/proddb

Output:

drc_passport_2026-06-11.json   (the Data Passport)
+ on-screen inventory summary and upload instructions

Create a read-only database user first

drc-scan setup --db postgres

This prints the exact SQL to create a least-privilege, read-only role so the scanner can never modify your data — copy-paste it for your DBA.

Safety guarantees

  • Read-only enforced in code. Every query passes through a verb allow-list; INSERT, UPDATE, DELETE, and all DDL raise a hard error before reaching the database.
  • No raw export. The Passport builder rejects any field containing row-level values.
  • Offline scan. No network egress during profiling — verifiable with a firewall rule.
  • Signed & hashed. Each Passport carries a SHA-256 content hash (Ed25519 signing added in the next build) so the platform can detect tampering.

Status

v0.1 skeleton: Postgres connector, table/column inventory, record counts, Passport with content hash. Full metric engine, PII detection, additional connectors, and Ed25519 signing follow in subsequent builds.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

drc_scanner-0.2.0.tar.gz (39.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

drc_scanner-0.2.0-py3-none-any.whl (40.9 kB view details)

Uploaded Python 3

File details

Details for the file drc_scanner-0.2.0.tar.gz.

File metadata

  • Download URL: drc_scanner-0.2.0.tar.gz
  • Upload date:
  • Size: 39.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.10

File hashes

Hashes for drc_scanner-0.2.0.tar.gz
Algorithm Hash digest
SHA256 63ad42431132956a71b93336e584762712c0581fdaf1cda3f55348f9e2be6219
MD5 604e2f3d5bcdcf317a312cef06072341
BLAKE2b-256 a4aea6cf63e55294b8e0a3706a6683542b7ac6900abeb67a0934928e5edd788c

See more details on using hashes here.

File details

Details for the file drc_scanner-0.2.0-py3-none-any.whl.

File metadata

  • Download URL: drc_scanner-0.2.0-py3-none-any.whl
  • Upload date:
  • Size: 40.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.10

File hashes

Hashes for drc_scanner-0.2.0-py3-none-any.whl
Algorithm Hash digest
SHA256 5295aaadaa21f8268928e131a83596da84b8de22a740f51aef5c415b36f48821
MD5 2cd13b627ece7cd1695b35e8d82bfad2
BLAKE2b-256 b66be504b0aac0eb78e39f924f13dfc84722b5aa0b30d5a636e2e9748e45a511

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page