A modular text-to-SQL toolkit.

These details have not been verified by PyPI

Project description

🐷 piglets

A modular library of text-to-SQL tools.

Status

piglets is currently an alpha-stage package. The API is expected to evolve before 1.0.

Get started

Install

venv

pip install piglets

uv add piglets

Install the optional dependency for the model provider you use. For OpenAI:

venv

pip install "piglets[openai]"

uv add "piglets[openai]"

Other provider extras include anthropic, google_genai, google_vertexai, bedrock, cohere, mistralai, groq, ollama, and openrouter.

Install the optional dependency for the database backend you use. For BigQuery:

venv

pip install "piglets[bigquery]"

uv add "piglets[bigquery]"

Logical planning

Use gpt-5.2 to generate 3 logical plans from a natural language query.

from piglets import LogicalPlanner

# initialise a logical planner
logical_planner = LogicalPlanner('gpt-5.2')

# generate 3 logical plan samples and aggregate them
logical_plan = logical_planner.plan(
    natural_language_query="What was the average number of piglets per week for Q4 2025?",
    num_samples=3,
)

# print the aggregated logical plan
for i, step in enumerate(logical_plan.logical_steps):
    print(f"Step {i + 1}: ")
    print(step)

# inspect the candidate plans used to create the aggregate
print(f"Aggregated from {len(logical_plan.sample_plans)} sample plans.")

>>> Step 1:
>>> 1. Identify all piglet birth (or piglet addition) events with their event dates and piglet counts.
>>> Step 2:
>>> 2. Filter the events to the Q4 2025 date range (Oct 1, 2025 through Dec 31, 2025).
>>> Step 3:
>>> 3. Assign each event to a calendar week within that quarter using a consistent week definition (e.g., week starting Monday or Sunday).
>>> Aggregated from 3 sample plans.
...

Database connector

Use DatabaseConnector to inspect a supported database and return a typed schema.

from piglets import DatabaseConnector

database_connector = DatabaseConnector(
    database_type="bigquery",
    database_name="my_bigquery_dataset",
)

database = database_connector.get_database_schema()

print(database.name)
for table in database.tables:
    print(table.name)
    for column in table.columns:
        print(f"- {column.name} ({column.data_type})")

BigQuery connections use the GOOGLE_CLOUD_PROJECT_ID environment variable by default. You can also pass gcp_project_id directly:

database_connector = DatabaseConnector(
    database_type="bigquery",
    database_name="my_bigquery_dataset",
    gcp_project_id="my-gcp-project",
)

Supported Databases

Arguments passed to the DatabaseConnector are used to create a url of the form:

dialect+driver://username:password@host:port/database

or in the case of bigquery

bigquery://project_id/dataset

and in the case of snowflake

snowflake://username:password@account/database

more intuitive paramater names and optional dependencies will be added shortly for all major cloud datawarehouse and lakehouses.

Database type	`database_type` value	Install requirement	Notes
SQLite	`sqlite`	Included by default	Uses SQLAlchemy's built-in SQLite dialect.
MySQL	`mysql`	SQLAlchemy dialect included by default	Requires a compatible MySQL DBAPI driver.
PostgreSQL	`postgresql`	SQLAlchemy dialect included by default	Requires a compatible PostgreSQL DBAPI driver.
Oracle	`oracle`	SQLAlchemy dialect included by default	Requires a compatible Oracle DBAPI driver.
Microsoft SQL Server	`mssql`	SQLAlchemy dialect included by default	Requires a compatible SQL Server DBAPI driver.
BigQuery	`bigquery`	`piglets[bigquery]`	Uses `GOOGLE_CLOUD_PROJECT_ID` or `gcp_project_id` for the GCP project.
Snowflake	`snowflake`	`piglets[snowflake]`	Uses the Snowflake SQLAlchemy dialect and Snowflake Connector for Python.

Dual-pathway pruning

Use Pruner to reduce a database schema with both preservation and deletion signals. The preservation pathway selects tables and columns that look useful for the query. The deletion pathway removes tables and columns that look irrelevant. dual_pathway_pruning() combines both paths into a final Database schema.

from piglets import DatabaseConnector, LogicalPlanner, Pruner

question = "Which tags saw the largest increase in average answer score from 2022 to 2023, considering only questions with at least 5 answers?"

logical_planner = LogicalPlanner("gpt-5.2")
logical_plan = logical_planner.plan(
    natural_language_query=question,
    num_samples=3,
)

database_connector = DatabaseConnector(
    database_type="bigquery",
    database_name="stack_overflow",
)
database = database_connector.get_database_schema()

pruner = Pruner(model_name="gpt-5.2")
pruned_database = pruner.dual_pathway_pruning(
    natural_language_query=question,
    database=database,
    logical_plan=logical_plan,
)

print(pruned_database.export_as_string())

Current scope

Database

DatabaseConnector currently supports BigQuery. It connects to a database by database_name and returns a Database object containing Table and Column objects.

Planning

The first included primitive is a LogicalPlanner that turns a natural-language analytics question into an ordered list of abstract logical steps.

The LogicalPlanner has a plan method that can generate one plan or sample multiple plans and aggregate them with num_samples.

Plan aggregation is available through LogicalPlans.aggregate(). Aggregated plans include a sample_plans attribute containing the candidate LogicalPlan objects used to produce the final plan.

Pruning

Pruner supports preservation pruning, deletion pruning, and dual-pathway pruning. Preservation pruning returns a PreservationSet of useful tables and columns. Deletion pruning returns a DeletionSet of irrelevant tables and columns. Dual-pathway pruning combines both into a final pruned Database.

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.1.18

May 18, 2026

0.1.17

May 18, 2026

0.1.16

May 17, 2026

0.1.15

May 16, 2026

0.1.14

May 3, 2026

0.1.13

Apr 21, 2026

0.1.12

Apr 19, 2026

This version

0.1.11

Apr 18, 2026

0.1.10

Apr 16, 2026

0.1.9

Apr 16, 2026

0.1.8

Apr 15, 2026

0.1.7

Apr 14, 2026

0.1.6

Apr 13, 2026

0.1.5

Apr 12, 2026

0.1.4

Apr 12, 2026

0.1.3

Apr 11, 2026

0.1.2

Apr 11, 2026

0.1.1

Apr 11, 2026

0.1.0

Apr 11, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

piglets-0.1.11.tar.gz (15.8 kB view details)

Uploaded Apr 18, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

piglets-0.1.11-py3-none-any.whl (16.1 kB view details)

Uploaded Apr 18, 2026 Python 3

File details

Details for the file piglets-0.1.11.tar.gz.

File metadata

Download URL: piglets-0.1.11.tar.gz
Upload date: Apr 18, 2026
Size: 15.8 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for piglets-0.1.11.tar.gz
Algorithm	Hash digest
SHA256	`7626b8d36d96e9e59c4b141bd22b16e72043dab18b5480f5dac099ab081fd7d6`
MD5	`2930bd3a7ba28cc3a2ecdd61bbf8a70a`
BLAKE2b-256	`eb7bcaff383a31d4906edb067d53ebd71e06fc5ade65ac0a1096acdd4a5bec97`

See more details on using hashes here.

Provenance

The following attestation bundles were made for piglets-0.1.11.tar.gz:

Publisher: publish.yml on mportdata/piglets

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: piglets-0.1.11.tar.gz
- Subject digest: 7626b8d36d96e9e59c4b141bd22b16e72043dab18b5480f5dac099ab081fd7d6
- Sigstore transparency entry: 1338808261
- Sigstore integration time: Apr 18, 2026
Source repository:
- Permalink: mportdata/piglets@ca448129af5160bb60f3b0bc553e5720de76a673
- Branch / Tag: refs/tags/v0.1.11
- Owner: https://github.com/mportdata
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@ca448129af5160bb60f3b0bc553e5720de76a673
- Trigger Event: push

File details

Details for the file piglets-0.1.11-py3-none-any.whl.

File metadata

Download URL: piglets-0.1.11-py3-none-any.whl
Upload date: Apr 18, 2026
Size: 16.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for piglets-0.1.11-py3-none-any.whl
Algorithm	Hash digest
SHA256	`277dfb60e0e3e06e548c23864021a9b88cb96633b9cec3dd55c5b1843433b2b8`
MD5	`09f4884258730217aa273a1c78acf189`
BLAKE2b-256	`a24037dd759cb37f362322717f139ae2e41fea17693c8872d45e63ba46196bc4`

See more details on using hashes here.

Provenance

The following attestation bundles were made for piglets-0.1.11-py3-none-any.whl:

Publisher: publish.yml on mportdata/piglets

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: piglets-0.1.11-py3-none-any.whl
- Subject digest: 277dfb60e0e3e06e548c23864021a9b88cb96633b9cec3dd55c5b1843433b2b8
- Sigstore transparency entry: 1338808266
- Sigstore integration time: Apr 18, 2026
Source repository:
- Permalink: mportdata/piglets@ca448129af5160bb60f3b0bc553e5720de76a673
- Branch / Tag: refs/tags/v0.1.11
- Owner: https://github.com/mportdata
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@ca448129af5160bb60f3b0bc553e5720de76a673
- Trigger Event: push

piglets 0.1.11

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

🐷 piglets

Status

Get started

Install

Logical planning

Database connector

Supported Databases

Dual-pathway pruning

Current scope

Database

Planning

Pruning

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance