datajunction-query

OSS Implementation of a DataJunction Query Service

These details have not been verified by PyPI

Project links

repository

Project description

DataJunction Query Service

This repository (DJQS) is an open source implementation of a DataJunction query service. It allows you to create catalogs and engines that represent sqlalchemy connections. Configuring a DJ server to use a DJQS server allows DJ to query any of the database technologies supported by sqlalchemy.

Quickstart

To get started, clone this repo and start up the docker compose environment.

git clone https://github.com/DataJunction/djqs
cd djqs
docker compose up

Creating Catalogs

Catalogs can be created using the POST /catalogs/ endpoint.

curl -X 'POST' \
  'http://localhost:8001/catalogs/' \
  -H 'accept: application/json' \
  -H 'Content-Type: application/json' \
  -d '{
  "name": "djdb"
}'

Creating Engines

Engines can be created using the POST /engines/ endpoint.

curl -X 'POST' \
  'http://localhost:8001/engines/' \
  -H 'accept: application/json' \
  -H 'Content-Type: application/json' \
  -d '{
  "name": "sqlalchemy-postgresql",
  "version": "15.2",
  "uri": "postgresql://dj:dj@postgres-roads:5432/djdb"
}'

Engines can be attached to existing catalogs using the POST /catalogs/{name}/engines/ endpoint.

curl -X 'POST' \
  'http://localhost:8001/catalogs/djdb/engines/' \
  -H 'accept: application/json' \
  -H 'Content-Type: application/json' \
  -d '[
  {
    "name": "sqlalchemy-postgresql",
    "version": "15.2"
  }
]'

Executing Queries

Queries can be submitted to DJQS for a specified catalog and engine.

curl -X 'POST' \
  'http://localhost:8001/queries/' \
  -H 'accept: application/json' \
  -H 'Content-Type: application/json' \
  -d '{
  "catalog_name": "djdb",
  "engine_name": "sqlalchemy-postgresql",
  "engine_version": "15.2",
  "submitted_query": "SELECT * from roads.repair_orders",
  "async_": false
}'

Async queries can be submitted as well.

curl -X 'POST' \
  'http://localhost:8001/queries/' \
  -H 'accept: application/json' \
  -H 'Content-Type: application/json' \
  -d '{
  "catalog_name": "djdb",
  "engine_name": "sqlalchemy-postgresql",
  "engine_version": "15.2",
  "submitted_query": "SELECT * from roads.repair_orders",
  "async_": true
}'

response

{
  "catalog_name": "djdb",
  "engine_name": "sqlalchemy-postgresql",
  "engine_version": "15.2",
  "id": "<QUERY ID HERE>",
  "submitted_query": "SELECT * from roads.repair_orders",
  "executed_query": null,
  "scheduled": null,
  "started": null,
  "finished": null,
  "state": "ACCEPTED",
  "progress": 0,
  "results": [],
  "next": null,
  "previous": null,
  "errors": []
}

The query id provided in the response can then be used to check the status of the running query and get the results once it’s completed.

curl -X 'GET' \
  'http://localhost:8001/queries/<QUERY ID HERE>/' \
  -H 'accept: application/json'

response

{
  "catalog_name": "djdb",
  "engine_name": "sqlalchemy-postgresql",
  "engine_version": "15.2",
  "id": "$QUERY_ID",
  "submitted_query": "SELECT * from roads.repair_orders",
  "executed_query": "SELECT * from roads.repair_orders",
  "scheduled": "2023-02-28T07:27:55.367162",
  "started": "2023-02-28T07:27:55.367387",
  "finished": "2023-02-28T07:27:55.502412",
  "state": "FINISHED",
  "progress": 1,
  "results": [
    {
      "sql": "SELECT * from roads.repair_orders",
      "columns": [...],
      "rows": [...],
      "row_count": 25
    }
  ],
  "next": null,
  "previous": null,
  "errors": []
}

Reflection

If running a [reflection service](https://github.com/DataJunction/djrs), that service can leverage the POST /table/{table}/columns/ endpoint of DJQS to get column names and types for a given table.

curl -X 'GET' \
  'http://localhost:8001/table/djdb.roads.repair_orders/columns/?engine=sqlalchemy-postgresql&engine_version=15.2' \
  -H 'accept: application/json'

response

{
  "name": "djdb.roads.repair_orders",
  "columns": [
    {
      "name": "repair_order_id",
      "type": "INT"
    },
    {
      "name": "municipality_id",
      "type": "STR"
    },
    {
      "name": "hard_hat_id",
      "type": "INT"
    },
    {
      "name": "order_date",
      "type": "DATE"
    },
    {
      "name": "required_date",
      "type": "DATE"
    },
    {
      "name": "dispatched_date",
      "type": "DATE"
    },
    {
      "name": "dispatcher_id",
      "type": "INT"
    }
  ]
}

DuckDB

DJQS includes an example of using DuckDB as an engine and it comes preloaded with the roads example database.

Create a djduckdb catalog and a duckdb engine.

curl -X 'POST' \
  'http://localhost:8001/catalogs/' \
  -H 'accept: application/json' \
  -H 'Content-Type: application/json' \
  -d '{
  "name": "djduckdb"
}'

curl -X 'POST' \
  'http://localhost:8001/engines/' \
  -H 'accept: application/json' \
  -H 'Content-Type: application/json' \
  -d '{
  "name": "duckdb",
  "version": "0.7.1",
  "uri": "duckdb://local[*]"
}'

curl -X 'POST' \
  'http://localhost:8001/catalogs/djduckdb/engines/' \
  -H 'accept: application/json' \
  -H 'Content-Type: application/json' \
  -d '[
  {
    "name": "duckdb",
    "version": "0.7.1"
  }
]'

Now you can submit DuckDB SQL queries.

curl -X 'POST' \
  'http://localhost:8001/queries/' \
  -H 'accept: application/json' \
  -H 'Content-Type: application/json' \
  -d '{
  "catalog_name": "djduckdb",
  "engine_name": "duckdb",
  "engine_version": "0.7.1",
  "submitted_query": "SELECT * FROM roads.us_states LIMIT 10",
  "async_": false
}'

Spark

DJQS includes an example of using Spark as an engine. To try it, start up the docker compose environment and then load the example roads database into Spark.

docker exec -it djqs /bin/bash -c "python /code/docker/spark_load_roads.py"

Next, create a djspark catalog and a spark engine.

curl -X 'POST' \
  'http://localhost:8001/catalogs/' \
  -H 'accept: application/json' \
  -H 'Content-Type: application/json' \
  -d '{
  "name": "djspark"
}'

curl -X 'POST' \
  'http://localhost:8001/engines/' \
  -H 'accept: application/json' \
  -H 'Content-Type: application/json' \
  -d '{
  "name": "spark",
  "version": "3.3.2",
  "uri": "spark://local[*]"
}'

curl -X 'POST' \
  'http://localhost:8001/catalogs/djspark/engines/' \
  -H 'accept: application/json' \
  -H 'Content-Type: application/json' \
  -d '[
  {
    "name": "spark",
    "version": "3.3.2"
  }
]'

Now you can submit Spark SQL queries.

curl -X 'POST' \
  'http://localhost:8001/queries/' \
  -H 'accept: application/json' \
  -H 'Content-Type: application/json' \
  -d '{
  "catalog_name": "djspark",
  "engine_name": "spark",
  "engine_version": "3.3.2",
  "submitted_query": "SELECT * FROM roads.us_states LIMIT 10",
  "async_": false
}'

Project details

These details have not been verified by PyPI

Project links

repository

Release history Release notifications | RSS feed

0.0.41

Jan 19, 2026

0.0.40

Jan 19, 2026

0.0.39

Jan 19, 2026

0.0.38

Jan 19, 2026

0.0.37

Jan 19, 2026

0.0.36

Jan 19, 2026

0.0.35

Jan 19, 2026

0.0.34

Jan 19, 2026

0.0.33

Jan 15, 2026

0.0.32

Jan 14, 2026

0.0.31

Jan 13, 2026

0.0.30

Jan 13, 2026

0.0.29

Jan 12, 2026

0.0.28

Jan 9, 2026

0.0.27

Jan 9, 2026

0.0.26

Jan 2, 2026

0.0.25

Dec 23, 2025

0.0.24

Dec 18, 2025

0.0.23

Dec 14, 2025

0.0.22

Dec 9, 2025

0.0.21

Dec 7, 2025

0.0.20

Dec 7, 2025

0.0.19

Dec 5, 2025

0.0.18

Dec 3, 2025

0.0.17

Dec 1, 2025

0.0.16

Nov 25, 2025

0.0.15

Nov 13, 2025

0.0.14

Nov 5, 2025

0.0.13

Oct 30, 2025

0.0.12

Oct 29, 2025

0.0.11

Oct 25, 2025

0.0.10

Oct 23, 2025

0.0.9

Oct 20, 2025

0.0.8

Oct 16, 2025

0.0.7

Oct 9, 2025

0.0.6

Oct 3, 2025

0.0.5

Oct 2, 2025

0.0.4

Sep 26, 2025

0.0.3

Sep 26, 2025

0.0.2

Sep 16, 2025

0.0.1a118 pre-release

Sep 8, 2025

This version

0.0.1a117 pre-release

Sep 2, 2025

0.0.1a116 pre-release

Aug 27, 2025

0.0.1a115 pre-release

Aug 25, 2025

0.0.1a114 pre-release

Aug 25, 2025

0.0.1a113 pre-release

Aug 22, 2025

0.0.1a112 pre-release

Aug 5, 2025

0.0.1a111 pre-release

Aug 2, 2025

0.0.1a110 pre-release

Jul 29, 2025

0.0.1a109 pre-release

Jul 22, 2025

0.0.1a108 pre-release

Jul 18, 2025

0.0.1a107 pre-release

Jul 15, 2025

0.0.1a106 pre-release

Jun 27, 2025

0.0.1a105 pre-release

Jun 23, 2025

0.0.1a104 pre-release

Jun 21, 2025

0.0.1a103 pre-release

Jun 19, 2025

0.0.1a102 pre-release

Jun 17, 2025

0.0.1a101 pre-release

Jun 3, 2025

0.0.1a100 pre-release

May 22, 2025

0.0.1a99 pre-release

May 20, 2025

0.0.1a98 pre-release

May 10, 2025

0.0.1a97 pre-release

Apr 29, 2025

0.0.1a96 pre-release

Apr 25, 2025

0.0.1a95 pre-release

Apr 16, 2025

0.0.1a94 pre-release

Apr 8, 2025

0.0.1a93 pre-release

Mar 26, 2025

0.0.1a92 pre-release

Mar 20, 2025

0.0.1a91 pre-release

Mar 12, 2025

0.0.1a90 pre-release

Mar 7, 2025

0.0.1a89 pre-release

Mar 3, 2025

0.0.1a88 pre-release

Feb 26, 2025

0.0.1a87 pre-release

Feb 19, 2025

0.0.1a86 pre-release

Feb 10, 2025

0.0.1a85 pre-release

Feb 1, 2025

0.0.1a84 pre-release

Jan 28, 2025

0.0.1a83 pre-release

Jan 21, 2025

0.0.1a82 pre-release

Jan 18, 2025

0.0.1a81 pre-release

Jan 6, 2025

0.0.1a80 pre-release

Dec 23, 2024

0.0.1a79 pre-release

Dec 19, 2024

0.0.1a78 pre-release

Dec 12, 2024

0.0.1a77 pre-release

Dec 5, 2024

0.0.1a76 pre-release

Dec 4, 2024

0.0.1a75 pre-release

Dec 3, 2024

0.0.1a74 pre-release

Nov 6, 2024

0.0.1a73 pre-release

Nov 4, 2024

0.0.1a72 pre-release

Oct 31, 2024

0.0.1a70 pre-release

Oct 14, 2024

0.0.1a69 pre-release

Oct 10, 2024

0.0.1a68 pre-release

Oct 3, 2024

0.0.1a67 pre-release

Sep 30, 2024

0.0.1a66 pre-release

Sep 26, 2024

0.0.1a65 pre-release

Sep 10, 2024

0.0.1a64 pre-release

Sep 4, 2024

0.0.1a63 pre-release

Aug 29, 2024

0.0.1a62 pre-release

Aug 11, 2024

0.0.1a61 pre-release

Jul 30, 2024

0.0.1a60 pre-release

Jul 26, 2024

0.0.1a59 pre-release

Jul 15, 2024

0.0.1a58 pre-release

Jul 15, 2024

0.0.1a57 pre-release

Jul 8, 2024

0.0.1a56 pre-release

Jul 2, 2024

0.0.1a55 pre-release

Jun 25, 2024

0.0.1a54 pre-release

Jun 18, 2024

0.0.1a53 pre-release

Jun 14, 2024

0.0.1a52 pre-release

Jun 10, 2024

0.0.1a51 pre-release

Jun 3, 2024

0.0.1a50 pre-release

May 29, 2024

0.0.1a49 pre-release

May 21, 2024

0.0.1a48 pre-release

May 21, 2024

0.0.1a47 pre-release

May 15, 2024

0.0.1a46 pre-release

May 1, 2024

0.0.1a45 pre-release

Apr 10, 2024

0.0.1a44 pre-release

Feb 26, 2024

0.0.1a43 pre-release

Feb 6, 2024

0.0.1a42 pre-release

Jan 29, 2024

0.0.1a41 pre-release

Jan 26, 2024

0.0.1a40 pre-release

Jan 19, 2024

0.0.1a39 pre-release

Dec 20, 2023

0.0.1a38 pre-release

Dec 14, 2023

0.0.1a37 pre-release

Dec 7, 2023

0.0.1a36 pre-release

Dec 7, 2023

0.0.1a35 pre-release

Nov 3, 2023

0.0.1a34 pre-release

Nov 2, 2023

0.0.1a33 pre-release

Oct 27, 2023

0.0.1a32 pre-release

Oct 26, 2023

0.0.1a31 pre-release

Oct 20, 2023

0.0.1a30 pre-release

Oct 12, 2023

0.0.1a1 pre-release

Oct 12, 2023

0.0.1a1.dev0 pre-release

Oct 12, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

datajunction_query-0.0.1a117.tar.gz (147.0 kB view details)

Uploaded Sep 2, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

datajunction_query-0.0.1a117-py3-none-any.whl (23.3 kB view details)

Uploaded Sep 2, 2025 Python 3

File details

Details for the file datajunction_query-0.0.1a117.tar.gz.

File metadata

Download URL: datajunction_query-0.0.1a117.tar.gz
Upload date: Sep 2, 2025
Size: 147.0 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: python-httpx/0.28.1

File hashes

Hashes for datajunction_query-0.0.1a117.tar.gz
Algorithm	Hash digest
SHA256	`ea30784a05ca7d8d6f40074f547f585d31cc6aa1247ecd8bf524e7227b0efcd1`
MD5	`bb2bb3e0bbc6cca5ca21c21815b0c612`
BLAKE2b-256	`6ae244c5f7a5aa384831fdc734b564ba61ea4fa915b32bf5b9912ae6a2068854`

See more details on using hashes here.

File details

Details for the file datajunction_query-0.0.1a117-py3-none-any.whl.

File metadata

Download URL: datajunction_query-0.0.1a117-py3-none-any.whl
Upload date: Sep 2, 2025
Size: 23.3 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: python-httpx/0.28.1

File hashes

Hashes for datajunction_query-0.0.1a117-py3-none-any.whl
Algorithm	Hash digest
SHA256	`a3c93ecfd949a477649abcd1bac61d28562df46de0bd0b836299ce8ed70db9e3`
MD5	`0005ad582e30b363012bf1aed41e1e56`
BLAKE2b-256	`4f0d5be38445fac86a85e513d1276561165fbe3cfdb89b56df9c6de6663b5f51`

See more details on using hashes here.

datajunction-query 0.0.1a117

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

DataJunction Query Service

Quickstart

Creating Catalogs

Creating Engines

Executing Queries

Reflection

DuckDB

Spark

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes