Skip to main content

A Python wrapper for the KSQL REST API

Project description

Forked from Bryan Yang’s awsome ksql-python, brushed up, and modernized

A python wrapper for the KSQL REST API. Easily interact with the KSQL REST API using this library.

Supported KSQLDB version: 0.29.0+
Supported Python version: 3.9+

Roadmap:

  • Move to Poetry/PDM

  • Asyncify the entire codebase

  • Support newer python versions

  • More Cleanup & Refactoring

  • Benchmarks

PyPI - Version https://img.shields.io/badge/license-MIT-yellow.svg

Installation

pip install ksql-python-ng==0.29.0.1

Getting Started

Setup for KSQL

This is the GITHUB page of KSQL. https://github.com/confluentinc/ksql

If you have installed open source Confluent CLI (e.g. by installing Confluent Open Source or Enterprise Platform), you can start KSQL and its dependencies with one single command:

confluent start ksql-server

Setup for ksql-python API

  • Setup for the KSQL API:

from ksql import KSQLAPI
client = KSQLAPI('http://ksql-server:8088')
  • Setup for KSQl API with logging enabled:

import logging
from ksql import KSQLAPI
logging.basicConfig(level=logging.DEBUG)
client = KSQLAPI('http://ksql-server:8088')
  • Setup for KSQL API with Basic Authentication

from ksql import KSQLAPI
client = KSQLAPI('http://ksql-server:8088', api_key="your_key", secret="your_secret")

Options

Option

Type

Required

Description

url

string

yes

Your ksql-server url. Example: http://ksql-server:8080

timeout

integer

no

Timout for Requests. Default: 5

api_key

string

no

API Key to use on the requests

secret

string

no

Secret to use on the requests

Main Methods

ksql

This method can be used for some KSQL features which are not supported via other specific methods like query, create_stream or create_stream_as. The following example shows how to execute the show tables statement:

client.ksql('show tables')
  • Example Response [{'tables': {'statementText': 'show tables;', 'tables': []}}]

query

It will execute sql query and keep listening streaming data.

client.query('select * from table1')

This command returns a generator. It can be printed e.g. by reading its values via next(query) or a for loop. Here is a complete example:

from ksql import KSQLAPI
client = KSQLAPI('http://localhost:8088')
query = client.query('select * from table1')
for item in query: print(item)
  • Example Response

    {"queryId":"44d8413c-0018-423d-b58f-3f2064b9a312","columnNames":["ORDER_ID","TOTAL_AMOUNT","CUSTOMER_NAME"],"columnTypes":["INTEGER","DOUBLE","STRING"]}
    [3,43.0,"Palo Alto"]
    [3,43.0,"Palo Alto"]
    [3,43.0,"Palo Alto"]

Query with HTTP/2

Execute queries with the new /query-stream endpoint. Documented here

To execute a sql query use the same syntax as the regular query, with the additional use_http2=True parameter.

client.query('select * from table1', use_http2=True)

A generator is returned with the following example response

{"queryId":"44d8413c-0018-423d-b58f-3f2064b9a312","columnNames":["ORDER_ID","TOTAL_AMOUNT","CUSTOMER_NAME"],"columnTypes":["INTEGER","DOUBLE","STRING"]}
[3,43.0,"Palo Alto"]
[3,43.0,"Palo Alto"]
[3,43.0,"Palo Alto"]

To terminate the query above use the close_query call. Provide the queryId returned from the query call.

client.close_query("44d8413c-0018-423d-b58f-3f2064b9a312")

Insert rows into a Stream with HTTP/2

Uses the new /inserts-stream endpoint. See documentation

rows = [
        {"ORDER_ID": 1, "TOTAL_AMOUNT": 23.5, "CUSTOMER_NAME": "abc"},
        {"ORDER_ID": 2, "TOTAL_AMOUNT": 3.7, "CUSTOMER_NAME": "xyz"}
    ]

results = self.api_client.inserts_stream("my_stream_name", rows)

An array of object will be returned on success, with the status of each row inserted.

Simplified API

create_stream/ create_table

client.create_stream(table_name=table_name,
                     columns_type=columns_type,
                     topic=topic,
                     value_format=value_format)

Options

Option

Type

Required

Description

table_name

string

yes

name of stream/table

columns_type

list

yes

ex:['viewtime bigint','userid varchar','pageid varchar']

topic

string

yes

Kafka topic

value_format

string

no

JSON (Default) or DELIMITED or AVRO

key

string

for Table

Key (used for JOINs)

  • Responses

If create table/stream succeed:

return True

If failed:

raise a CreateError(respose_from_ksql_server)

create_stream_as

a simplified api for creating stream as select

client.create_stream_as(table_name=table_name,
                        select_columns=select_columns,
                        src_table=src_table,
                        kafka_topic=kafka_topic,
                        value_format=value_format,
                        conditions=conditions,
                        partition_by=partition_by,
                        **kwargs)
CREATE STREAM <table_name>
[WITH ( kafka_topic=<kafka_topic>, value_format=<value_format>, property_name=expression ... )]
AS SELECT  <select_columns>
FROM <src_table>
[WHERE <conditions>]
PARTITION BY <partition_by>];

Options

Option

Type

Required

Description

table_name

string

yes

name of stream/table

select_columns

list

yes

you can select [*] or ['columnA', 'columnB']

src_table

string

yes

name of source table

kafka_topic

string

no

The name of the Kafka topic of this new stream(table).

value_format

string

no

DELIMITED, JSON``(Default) or ``AVRO

conditions

string

no

The conditions in the where clause.

partition_by

string

no

Data will be distributed across partitions by this column.

kwargs

pair

no

please provide key=value pairs. Please see more options.

KSQL JOINs

KSQL JOINs between Streams and Tables are not supported yet via explicit methods, but you can use the ksql method for this like the following:

client.ksql("CREATE STREAM join_per_user WITH (VALUE_FORMAT='AVRO', KAFKA_TOPIC='join_per_user') AS SELECT Time, Amount FROM source c INNER JOIN users u on c.user = u.userid WHERE u.USERID = 1")

FileUpload

upload

Run commands from a .ksql file. Can only support ksql commands and not streaming queries.

from ksql.upload import FileUpload
pointer = FileUpload('http://ksql-server:8080')
pointer.upload('rules.ksql')

Options

Option

Type

Required

Description

ksqlfile

string

yes

name of file containing the rules

  • Responses

If ksql-commands succesfully executed:

return (List of server response for all commands)

If failed:

raise the appropriate error

More Options

There are more properties (partitions, replicas, etc…) in the official document.

KSQL Syntax Reference

  • Responses

If create table/stream succeed:

return True

If failed:

raise a CreateError(respose_from_ksql_server)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ksql_python_ng-0.29.0.1.tar.gz (20.3 kB view details)

Uploaded Source

Built Distribution

ksql_python_ng-0.29.0.1-py3-none-any.whl (13.9 kB view details)

Uploaded Python 3

File details

Details for the file ksql_python_ng-0.29.0.1.tar.gz.

File metadata

  • Download URL: ksql_python_ng-0.29.0.1.tar.gz
  • Upload date:
  • Size: 20.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.9.19

File hashes

Hashes for ksql_python_ng-0.29.0.1.tar.gz
Algorithm Hash digest
SHA256 a508683809f4eed6781d60c48feb9bc5cb7503eb200c4d9b8d3bf043ceecd060
MD5 8d8c700d7dd2b96814d94b79f5041146
BLAKE2b-256 d6fef60ac4daf12822b36beaf94bd87f02a1cad45fcce1d8860eae86330a3bfb

See more details on using hashes here.

File details

Details for the file ksql_python_ng-0.29.0.1-py3-none-any.whl.

File metadata

File hashes

Hashes for ksql_python_ng-0.29.0.1-py3-none-any.whl
Algorithm Hash digest
SHA256 3704f82fbfa66adb1f57ad5a209967ee876fc97873e91106065add0ea32b0a44
MD5 26323af117806e9cdbf1eab34069c7fc
BLAKE2b-256 e5053c7d4dd51ed39f16fe63ab0cbbb3cc7649a9570151f504243067a91d0d70

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page