Provides access to the Waii APIs

These details have not been verified by PyPI

Project links

Homepage

Development Status
- 5 - Production/Stable
Intended Audience
- Developers
License
- OSI Approved :: Apache Software License
Programming Language
- Python :: 3

Project description

Waii Python SDK Documentation

The waii-sdk-py is a Python library that allows you to interact with the Waii API. It provides a powerful SQL and AI capability to your Python applications.

Installation

To install the waii-sdk-py, you can use pip:

pip install waii-sdk-py

Importing & Initialize the SDK

>>> from waii_sdk_py import WAII
>>> from waii_sdk_py.query import *
>>> WAII.initialize(api_key="<your-api-key>")

You can get your API key from the tweakit.waii.ai (You need to register and get access from waii.ai first).

Get started

First you can print the list of available databases:

>>> print([conn.key for conn in WAII.Database.get_connections().connectors])

Then, you can activate the database connection you want to use (from one of the key in the list above)

>>> WAII.Database.activate_connection("snowflake://...&warehouse=COMPUTE_WH")

Get Database name of the active connection

>>> print([catalog.name for catalog in WAII.Database.get_catalogs().catalogs])

>>> print(WAII.Query.generate(QueryGenerationRequest(ask = "How many tables are there?")).query)

SELECT COUNT(DISTINCT table_name)
FROM waii.information_schema.tables

Run the query

>>> print(WAII.Query.run(RunQueryRequest(query = "SELECT COUNT(DISTINCT table_name) FROM waii.information_schema.tables")))

rows=[{'COUNT(DISTINCT TABLE_NAME)': 112}] more_rows=0 column_definitions=[ColumnDefinition(name='COUNT(DISTINCT TABLE_NAME)', type='FIXED')] query_uuid='01afbd1e-0001-d31e-0022-ba8700a8209e'

In order to know more details, you can check the following detailed API documentation.

Use Notebook

You can also use the SDK in Notebooks like Jupyter, you can follow the the doc here

Modules

The SDK consists of the following modules:

Database
Query
SemanticContext
History

Each module encapsulates a certain functionality of the Waii API.

Database

The Database module contains methods for handling database-related tasks.

Here are some of its methods:

Modify Connections

Database.modify_connections(params: ModifyDBConnectionRequest) -> ModifyDBConnectionResponse

This method allows to modify the database connections.

ModifyDBConnectionRequest contains the following properties:

updated: Update or add new connections
removed: Remove connections, by specifying the key of the connection
default_db_connection_key: Set the default connection (by specifying key of connection)

To add connection, you need to create DBConnection Object, which include the following fields

key: Set it to '', it will be auto generated
db_type: Acceptable values are: snowflake, postgresql, mongodb, mongodb+srv (Mongo Atlas).
description: Description of the connection
account_name: Account name of the connection, apply to Snowflake
username: Username of the connection, can be None if no username needed (such as localhost database)
password: Password of the connection, can be None if no password.
database: Database name of the connection, you must specify it. Please note that it is case sensitive for most of the databases.
warehouse: Warehouse name of the connection, apply to Snowflake
role: Role name of the connection, apply to Snowflake
host/port: Host/port of the connection, apply to postgresql and mongodb.
sample_col_values: Do you want to let Waii to sample your string/variant columns. True if you want to sample, False if you don't want to sample. Default is False. This is optional, which can help Waii to generate better queries.

Examples of creating DBConnection Object

Snowflake

DBConnection(
    key = '',
    db_type = 'snowflake',
    account_name = 'abcde-12345',
    username = 'username',
    password = 'password',
    database = 'database',
    warehouse = 'warehouse',
    role = 'role'
)

PostgreSQL

DBConnection(
    key = '',
    db_type = 'postgresql',
    username = 'username',
    password = 'password',
    database = 'database',
    host = 'localhost',
    port = 5432
)

MongoDB

DBConnection(
    key = '',
    db_type = 'mongodb', # or 'mongodb+srv' if you are using Mongo Atlas
    username = 'username',
    password = 'password',
    database = 'database',
    host = 'localhost',
    port = 27017 # if you are using Mongo Atlas, you shouldn't set port
)

Get Connections

Database.get_connections(params: GetDBConnectionRequest = GetDBConnectionRequest()) -> GetDBConnectionResponse

This method fetches the list of available database connections.

Response fields: connectors: List of DBConnection objects (No password field) connector_status: Status of the connection, are they being indexed or not.

Activate Connection

Database.activate_connection(key: str)

This method sets the scope of the current database connection.

You can run

>>> Database.get_activated_connection()

To get the current activated connection.

Get Catalogs

Database.get_catalogs(params: GetCatalogRequest = GetCatalogRequest()) -> GetCatalogResponse

This method retrieves the list of available catalogs. It includes a hierarchical list of schemas and tables.

You can run

>>> WAII.Database.get_catalogs()

To get catalogs, response fields:

CatalogDefinition:

name: Name of the catalog (database)
- schemas: List of schemas
  - name: Name of the schema
  - description: Description of the schema
    - summary: generated description of the schema
    - common_questions: list of generated common questions
    - common_tables: list of common tables used in the schema
    - tables: all tables belong to the schema
      - name: Name of the table
      - columns: List of columns
        
        name: Name of the column
        
        type: Type of the column
        
        description: Auto generated description of the column
      - comment: Comment of the table (fetched from underlying database)
      - last_altered_time: Last altered time of the table
      - refs: List of referential constraints
      - inferred_refs: List of auto-inferred referential constraints
      - inferred_constraints: List of auto-inferred constraints (pks, etc.)
      - description: Auto generated table description

Waii automatically generate table/schema descriptions when you add them, you can fetch them by using description field of table and schema from get_catalogs method.

Description of table looks like:

The CUSTOMER_ADDRESS table contains information about the addresses of customers. It includes details such as address ID, city, country, ... This table can be used to retrieve customer addresses for various purposes, such as shipping, billing, or demographic analysis.

Update Table and Schema Descriptions

You can use the following two methods to update the descriptions of tables and schemas (if you are not satisfied with the auto generated descriptions)

Database.update_table_description(params: UpdateTableDescriptionRequest) -> UpdateTableDescriptionResponse
Database.update_schema_description(params: UpdateSchemaDescriptionRequest) -> UpdateSchemaDescriptionResponse

Query

The Query module contains methods related to SQL query handling. Important: You need to activate the database connection first before using the methods in this module. Otherwise you may trying to generate query from a wrong database.

Here are some of its methods:

Generate Query

Query.generate(params: QueryGenerationRequest) -> GeneratedQuery

This method generates a SQL query based on the provided parameters.

Parameter fields:

ask: The question you want to ask Waii to generate, such as How many tables are there?
dialect: What is the dialect of the generated query, such as snowflake, postgresql, mongodb
tweak_history: We can support both asking new question, or tweak the previous question. If you want to tweak the previous question, you can set this field. (It's list of Tweak object). Each of Tweak object looks like:
- sql: The SQL query you want to tweak
- ask: The previous question you asked.
search_context: List of scopes to limit tables/schemas used to generate query (By default it searches all objects from the database). It's list of SearchContext object. Each of SearchContext object looks like:
- db_name: Catalog name (database name), default is *
- schema_name: Schema name, default is *
- table_name: Table name, default is *

Examples:

Ask a new question:

>>> WAII.Query.generate(QueryGenerationRequest(ask = "How many tables are there?"))

Ask a complex question:

>>> WAII.Query.generate(QueryGenerationRequest(ask = "Give me all stores in California that have more than 1000 customers, include the store name, store address, and the number of customers."))

Generate query with search context (limited to specific tables, schemas, etc.)

By default, Waii search all tables in the database, if you know which tables you want to use, you can specify the search context to limit the tables used to generate query.

>>> WAII.Query.generate(QueryGenerationRequest(
    ask = "How many cars we produced in Q2", 
    search_context=[SearchContext(db_name="*",
                                  schema_name="schema1",
                                  table_name="table1"),
                    SearchContext(db_name="*",
                                  schema_name="schema2",
                                  table_name="*")]))

The above query will only search tables from schema1.table1 and schema2.*

Tweak the previous question:

>>> WAII.Query.generate(QueryGenerationRequest(
    ask = "only return 10 results", 
    tweak_history=[Tweak(sql="select * from information_schema.tables", 
                         ask="give me all tables")]))

(The above tweak will return the first 10 results of the previous query)

`Query.generate` returns `GeneratedQuery` object, which contains the following fields:

uuid: UUID of the generated query
liked: Whether the query is liked or not (True/False). You can like a query by using Query.like method.
query: The generated query text
tables: Tables used in the generated query
semantic_context: Semantic context applied to the generated query
detailed_steps: Detailed steps of how the query work (in natural language)
what_changed: If you do a tweak, it will tell you what changed in the query
compilation_error: If there is any compilation error, it will show here. (Waii will try to fix the compilation error automatically, but if it tried multiple times and still cannot fix it, it will show here)

Generate Question

You can also generate questions based on your database schema, which can be useful when you want to show to your user what kind of questions can be asked to the database.

>>> Query.generate_question(GenerateQuestionRequest(schema_name='PUBLIC',
                                                    n_questions=10,
                                                    complexity=GeneratedQuestionComplexity.hard))

Parameter fields:

schema_name: The schema name you want to generate questions. This is must be specified.
n_questions: Number of questions you want to generate. Default is 10. You can choose a number between 1 to 30.
complexity: Complexity of the questions you want to generate. Default is GeneratedQuestionComplexity.hard. You can choose easy, medium, hard.

Output GenerateQuestionResponse, fields:

questions: List of generated questions. For each question it has question (string content of the question), and complexity (complexity of the question)

Transcode

>>> WAII.Query.transcode(TranscodeQueryRequest(source_dialect="postgres", source_query="select ...;", target_dialect="snowflake", ask="..."))

Parameter fields:

search_context: Same as Query.generate, you can specify the search context to limit the tables/schemas used to transcode.
source_dialect: The dialect of the source query, such as snowflake, postgresql, mongodb
source_query: The source query you want to transcode
target_dialect: The dialect of the target query, such as snowflake, postgresql, mongodb
ask: The intructions you want to ask Waii to add when transcode the query, such as place tables under schema1. It is optional.

`Query.transcode` returns `GeneratedQuery` object

Run a query (sync)

Query.run(params: RunQueryRequest) -> GetQueryResultResponse

Request fields:

query: The SQL query you want to run
session_id: Session ID of the query, default is None (which means Waii will generate a new session ID for you)
current_schema: current schema to run the query, default is None (which means Waii will use the default schema of the connection). It is SchemaName object, which has database_name (optional), schema_name fields.
session_parameter: a key value pairs to set the connection parameters, such as auto_commit, isolation_level, etc. Default is None (which means Waii will use the default connection parameters of the connection)

Response fields:

rows: List of rows returned. Each of row is json object, with column name as key, and column value as value.
more_rows: Whether there are more rows or not (True/False), it can be True when max_returned_rows is set and generated rows are more than max_returned_rows
column_definitions: List of column definitions. Each of column definition has name and type
query_uuid: UUID of the query (it is different from generate query UUID, this UUID is generated by underlying database and Snowflake)

Example:

>>> print(WAII.Query.run(RunQueryRequest(query = "SELECT COUNT(DISTINCT table_name) FROM waii.information_schema.tables")))

rows=[{'COUNT(DISTINCT TABLE_NAME)': 112}] more_rows=0 column_definitions=[ColumnDefinition(name='COUNT(DISTINCT TABLE_NAME)', type='FIXED', comment=None, sample_values=None)] query_uuid='01afbd1e-0001-d31e-0022-ba8700a8209e'

Run with default schema

from waii_sdk_py.database import *
>>> WAII.Query.run(RunQueryRequest(query='select current_schema(), current_database();', current_schema=SchemaName(schema_name='INFORMATION_SCHEMA')))

Async submit a query

In order to async submit query, you need 3 methods: Query.submit, Query.get_results, Query.cancel

Submit

Query.submit(params: RunQueryRequest) -> RunQueryResponse

Request fields same as Query.run

Response field:

query_id: the query id you need to get result of the query, or cancel the query

Get Results

Query.get_results(params: GetQueryResultRequest) -> GetQueryResultResponse

Return query result by providing uuid, same as Query.run, but you need to specify query_id instead of query in the request.

Cancel

Query.cancel(params: CancelQueryRequest) -> CancelQueryResponse

Cancel a running query by providing query_id, it is a no-op if the query is completed or failed.

Like

Query.like(params: LikeQueryRequest) -> LikeQueryResponse

This method marks a query (by specifying uuid for generated query, not id from run query) as "liked", which you can fetch it from history.

Describe

If you want to translate SQL to natural language, and explain step-by-step plans, you can use Query.describe method.

Query.describe(params: DescribeQueryRequest) -> DescribeQueryResponse

This method fetches a description of a SQL query.

Parameters:

query: The SQL query you want to describe
search_context: When describe the query, you may want to restrict which table/schema you want to use as context. Normally it is not required, and you should skip it.

Output:

summary: A summary of the query
detailed_steps: Detailed steps of how the query work (in natural language)
tables: Tables used in the query

Example:

>>> print(WAII.Query.describe(DescribeQueryRequest(query='SELECT COUNT(DISTINCT table_name) FROM information_schema.tables')))

summary='How many distinct table names are there in the information_schema.tables?' detailed_steps=['Step 1: Access the information_schema.tables table.', 'Step 2: Count the number of distinct table names.', 'Step 3: Return the count of distinct table names.'] tables=[TableName(table_name='TABLES', schema_name='INFORMATION_SCHEMA', database_name='WAII')]

Diff

This method generates a description of the difference between two SQL queries

Query.diff(params: DiffQueryRequest) -> DiffQueryResponse

Parameters:

Similar to Query.describe, you need to specify two queries you want to diff, and optionally you can specify the search context.
query: The SQL query you want to describe
previous_query: The previous SQL query you want to describe

Output:

Same as Query.describe, but it will also include what_changed field, which tells you what changed in the query.

Example:

>>> print(WAII.Query.diff(DiffQueryRequest(query='SELECT COUNT(DISTINCT table_name) FROM information_schema.tables', previous_query='SELECT COUNT(DISTINCT table_name) FROM information_schema.tables WHERE table_name LIKE \'%a%\'')))

summary='...' detailed_steps=[...] what_changed="The new query does not have any filter on 'table_name' column."

Auto Complete (Experimental)

This method allows you to automatically complete a partial query, this can be useful when you want to build a query editor with Github co-pilot-like auto complete feature.

This is an experimental feature, it may not work well in some cases.

Query.auto_complete(params: AutoCompleteRequest) -> AutoCompleteResponse

Parameters:

text: (required) The partial query you want to auto complete
cursor_offset: (required) The cursor offset of the partial query, it can be in the middle of the text (if you want to insert text in the middle. By default you should set it to
dialect: The dialect of the query, such as snowflake, postgresql, mongodb
search_context: Same as Query.describe, you can specify the search context to limit the tables/schemas used to auto complete.
max_output_tokens: (required) Maximum number of tokens returned, you should set it to a lower value like 50 to make sure it can return within a few seconds.

Output:

text completed text

Example:

>>> print(WAII.Query.auto_complete(AutoCompleteRequest(text='select from', cursor_offset=11, max_output_tokens=50)))

text='WAII.WORLD.CITY WHERE POPULATION > 1000000'

Analyze Performance (Experimental)

Note: this feature currently only support Snowflake

Query.analyze_performance(params: QueryPerformanceRequest) -> QueryPerformanceResponse

Parameters:

query_id: The query id you want to analyze performance (it should be a run query id, not generated query id

Output:

summary: A summary of the runtime of a query as well as recommendations on how to make the query run faster
recommendations: List of recommendations on how to make the query run faster
query_text: The query text of the query (based on query_id)
execution_time_ms: The execution time of the query (based on query_id)
compilation_time_ms: The compilation time of the query (based on query_id)

This method provides a summary of the runtime of a query as well as recommendations on how to make the query run faster

Examples:

>>> print(WAII.Query.analyze_performance(QueryPerformanceRequest(query_id='...')))

'summary': [
    'The metadata operation in Step 1 took 1.0 ms, which is 82% of the total execution time.',
],
'recommendations': [
    "The query is already optimal as it is a simple count distinct operation on the 'table_name' column of the 'waii.information_schema.tables' table. No further optimization is required.",
      ...
],
'query_text': "SELECT COUNT(DISTINCT table_name) FROM waii.information_schema.tables",
'execution_time_ms': 312,
'compilation_time_ms': 247

Semantic Context

The SemanticContext module contains methods related to semantic context handling.

Here are some of its methods:

Modify Semantic Context

SemanticContext.modify_semantic_context(params: ModifySemanticContextRequest) -> ModifySemanticContextResponse

This method allows you to modify the semantic context.

Get Semantic Context

SemanticContext.get_semantic_context(params: GetSemanticContextRequest = GetSemanticContextRequest()) -> GetSemanticContextResponse

This method fetches the current semantic context.

History

The History module contains methods related to history handling.

Here are some of its methods:

List

History.list(params: GetGeneratedQueryHistoryRequest = GetGeneratedQueryHistoryRequest()) -> GetGeneratedQueryHistoryResponse

This method fetches the history of generated queries.

For detailed information about the request and response objects and the available properties, refer to the respective Python files of each module.

Project details

These details have not been verified by PyPI

Project links

Homepage

Development Status
- 5 - Production/Stable
Intended Audience
- Developers
License
- OSI Approved :: Apache Software License
Programming Language
- Python :: 3

Release history Release notifications | RSS feed

1.32.0.dev1 pre-release

Jun 26, 2025

1.31.0.dev2 pre-release

Jun 19, 2025

1.31.0.dev1 pre-release

Jun 9, 2025

1.30.0

May 28, 2025

1.30.0.dev1 pre-release

May 27, 2025

1.29.2

Mar 24, 2025

1.29.1

Mar 14, 2025

1.29.0.dev2 pre-release

Mar 6, 2025

1.29.0.dev1 pre-release

Feb 26, 2025

1.28.2

Feb 11, 2025

1.28.1

Feb 4, 2025

1.28.0

Jan 31, 2025

1.25.5

Dec 10, 2024

1.25.4

Dec 4, 2024

1.25.3

Nov 25, 2024

1.25.2

Nov 15, 2024

1.25.1

Nov 13, 2024

1.23.0

Oct 13, 2024

1.22.1

Sep 13, 2024

1.22.0

Sep 13, 2024

1.22.0.dev4 pre-release

Sep 10, 2024

1.22.0.dev3 pre-release

Sep 10, 2024

1.22.0.dev2 pre-release

Aug 31, 2024

1.22.0.dev1 pre-release

Aug 28, 2024

1.21.0

Aug 23, 2024

1.21.0.dev4 pre-release

Aug 22, 2024

1.21.0.dev3 pre-release

Aug 20, 2024

1.21.0.dev2 pre-release

Aug 16, 2024

1.21.0.dev1 pre-release

Aug 16, 2024

1.21.0.dev0 pre-release

Aug 15, 2024

1.20.1

Aug 1, 2024

1.14.6

Jul 16, 2024

1.14.5

Jun 24, 2024

1.14.4

Jun 21, 2024

1.14.3

Jun 15, 2024

1.14.2

Jun 13, 2024

1.14.1

May 29, 2024

1.14.0rc0 pre-release

May 27, 2024

1.13.3

May 8, 2024

1.13.2

Apr 22, 2024

1.13.1

Apr 20, 2024

1.12.7

Apr 11, 2024

1.12.6

Apr 3, 2024

1.12.5

Apr 3, 2024

1.12.4

Apr 1, 2024

1.12.3

Mar 31, 2024

1.12.2

Mar 28, 2024

1.12.1

Mar 26, 2024

1.11.0

Mar 19, 2024

1.10.6

Feb 20, 2024

1.10.5

Feb 17, 2024

1.10.4

Feb 16, 2024

1.10.3

Feb 8, 2024

1.10.2

Feb 8, 2024

This version

1.10.1

Feb 4, 2024

1.10.0

Feb 4, 2024

1.9.1

Jan 22, 2024

1.9.0

Jan 19, 2024

1.7.9

Dec 7, 2023

1.7.8

Nov 22, 2023

1.7.7

Nov 13, 2023

1.7.6

Nov 5, 2023

1.7.5

Nov 2, 2023

1.7.4

Oct 31, 2023

1.7.3

Oct 19, 2023

1.7.2

Oct 18, 2023

1.7.1

Oct 18, 2023

0.1

Aug 9, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

waii-sdk-py-1.10.1.tar.gz (26.0 kB view details)

Uploaded Feb 4, 2024 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

waii_sdk_py-1.10.1-py3-none-any.whl (24.1 kB view details)

Uploaded Feb 4, 2024 Python 3

File details

Details for the file waii-sdk-py-1.10.1.tar.gz.

File metadata

Download URL: waii-sdk-py-1.10.1.tar.gz
Upload date: Feb 4, 2024
Size: 26.0 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.2 CPython/3.9.6

File hashes

Hashes for waii-sdk-py-1.10.1.tar.gz
Algorithm	Hash digest
SHA256	`23f41403052126722fb68c0b3ccbff64361eec687b0b8d054739ab6d3bfbee4e`
MD5	`a869f39f3471086a8824bb70aa93ecdd`
BLAKE2b-256	`4beed6320c083592f37e101f223b5807cc6b4f38e16a0c8cb4eaf913db0b0b3d`

See more details on using hashes here.

File details

Details for the file waii_sdk_py-1.10.1-py3-none-any.whl.

File metadata

Download URL: waii_sdk_py-1.10.1-py3-none-any.whl
Upload date: Feb 4, 2024
Size: 24.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.2 CPython/3.9.6

File hashes

Hashes for waii_sdk_py-1.10.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`b6d644d6304170fb1ee4af26929f81b0c32256d0e4344e11e7c40360cd601c04`
MD5	`a53897c95921090e9624f7c59fd39d9a`
BLAKE2b-256	`f23412dece0f22c3d6688d85e03a4f7764f1bfb05d79ecb3e0e8c61c335374e2`

See more details on using hashes here.

waii-sdk-py 1.10.1

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Waii Python SDK Documentation

Installation

Importing & Initialize the SDK

Get started

Use Notebook

Modules

Database

Modify Connections

Snowflake

PostgreSQL

MongoDB

Get Connections

Activate Connection

Get Catalogs

Update Table and Schema Descriptions

Query

Generate Query

Query.generate returns GeneratedQuery object, which contains the following fields:

Generate Question

Transcode

Query.transcode returns GeneratedQuery object

Run a query (sync)

Async submit a query

Submit

Get Results

Cancel

Like

Describe

Diff

Auto Complete (Experimental)

Analyze Performance (Experimental)

Semantic Context

Modify Semantic Context

Get Semantic Context

History

List

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

`Query.generate` returns `GeneratedQuery` object, which contains the following fields:

`Query.transcode` returns `GeneratedQuery` object