Skip to main content

This is package designed for auto-generation of contracts and tests in the soda syntax

Project description

Automatic Contract Creation

Automatic Contract Creation is a Python library that is designed to auto-generate various types of contracts based on the soda syntax using LLM.

Features

In the current version, connection to DB such as Clickhouse and Trino is configured. By connecting to the database, you can generate and/or upload a statistical report, as well as generate a soda contract.

Get Started

Requirements

trino==0.327.0
clickhouse-connect==0.6.22
pandas==2.2.3
polars==0.20.21
openai==1.37.0
regex==2024.9.11
load-dotenv==0.1.0
python=3.11.5
pyarrow==17.0.0
The rule based approach uses soda version 3.3.5 (SODAGenerateContract).
The LLM approach uses soda versions 3.1.5(LlamaGeneratorSODAContract).

Install and run

You can install the library using pip:

pip install automatic-contract-creation

Import the necessary objects. To use all the functionality, you can contact SODAGenerator

from automatic_contract_creation import SODAGenerator
from automatic_contract_creation import LlamaGenerateSuite
from automatic_contract_creation import LlamaGeneratorSODAContract
To rule based approach:

Create a connection object by passing the connector type and credits in dictionary format. Use the methods to auto-generate a contract or generate a statistical report.

To LLM approach:

Create a file in the virtual environment ".env" with its credits example

connection_name='clickhouse or trino'
host='hostname'
port=port
user='username'
password='password'
llama_api_url='your api url for llm'



token='token_omd'

use your method

print(LlamaGenerateSuite('db_name.table_name').get_suite())# prints the tests (the path is specified by the click example)
LlamaGenerateSuite('catalog.schema.table').generate_suite()# upload a yaml file with tests (the path is specified following the example of trino)
LlamaGenerateSuite('catalog.schema.table').save_profiling()# will save the profiling
print(LlamaGeneratorSODAContract('catalog.schema.table').generate_schema_contract())# will print the contract
LlamaGeneratorSODAContract('catalog.schema.table').get_soda_contracts()# will download the yaml file with the contract

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

automatic_contract_creation-2.0.tar.gz (29.4 kB view details)

Uploaded Source

Built Distribution

File details

Details for the file automatic_contract_creation-2.0.tar.gz.

File metadata

File hashes

Hashes for automatic_contract_creation-2.0.tar.gz
Algorithm Hash digest
SHA256 0e88fa66b1b06fb38dbebb0cd9fff3f2b0b03b127a2ff74d58502ec0ef621526
MD5 a7d01c04787ef6967dbc2a4bf9df477a
BLAKE2b-256 68eca7e1f98907344d48a58fac982f1a50c62e67e43a9cff2f1207aa079e7814

See more details on using hashes here.

File details

Details for the file automatic_contract_creation-2.0-py3-none-any.whl.

File metadata

File hashes

Hashes for automatic_contract_creation-2.0-py3-none-any.whl
Algorithm Hash digest
SHA256 eb242e37723b05706d7eeaee9f463abccbe0245213320dd972d1f89c2004c566
MD5 025a6bdd2ef329aa82963ffc4e4aece8
BLAKE2b-256 8b97045aae458c324d155f48354668a33cade0ab4f56ee3375f23aecafa0b219

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page