NLP interface for pypreql.
Project description
PreQL NLP
Natural language interface for generating PreQL query objects.
PreQL is easier for a large language model to interact with as it requires only extract relevant concepts from a text, classifying them as metrics or dimensions, and mapping them to what is available in a model.
This makes it more testable and less prone to hallucination than generating SQL directly.
Requires setting the following environment variables
- OPENAI_API_KEY
- OPENAI_MODEL
Recommended to use "gpt-3.5-turbo" or higher as the model.
Examples
Basic BQ example
from trilogy_public_models import models
from preql import Executor, Dialects
from preql_nlp import build_query
# define the model we want to parse
environment = models["bigquery.stack_overflow"]
# set up preql executor
# default bigquery executor requires local default credentials configured
executor = Dialects.BIGQUERY.default_executor(environment= environment)
# build a query off text and the selected model
processed_query = build_query(
"How many questions are asked per year?",
environment,
)
# make sure we got reasonable outputs
for concept in processed_query.output_columns:
print(concept.name)
# and run that to get our answer
results = executor.execute_query(processed_query)
for row in results:
print(row)
Setting Up Your Environment
Recommend that you work in a virtual environment with requirements from both requirements.txt and requirements-test.txt installed. The latter is necessary to run tests (surprise).
Pypreql-nlp is python 3.10+
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for pypreql_nlp-0.0.9-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 170e080a4eeacf7884e0ff36f9bab1408893b4503b62911361402c305c027b50 |
|
MD5 | b26684c5762c57563710984437c01d5f |
|
BLAKE2b-256 | 7e9df22db51dc0313a3114ee6e2c01c2f1333c64d0c97912d7277cdeaf919db2 |