LLMSQL: Benchmark for Text-to-SQL

These details have not been verified by PyPI

Project description

LLMSQL

Patched and improved version of the original large crowd-sourced dataset for developing natural language interfaces for relational databases, WikiSQL.

Our datasets are available for different scenarios on our HuggingFace page.

Overview

Install

pip3 install llmsql

This repository provides the LLMSQL Benchmark — a modernized, cleaned, and extended version of WikiSQL, designed for evaluating and fine-tuning large language models (LLMs) on Text-to-SQL tasks.

Note

The package doesn't have the dataset, it is stored on our HuggingFace page.

This package contains

Support for modern LLMs.
Tools for evaluation, inference, and finetuning.
Support for Hugging Face models out-of-the-box.
Structured for reproducibility and benchmarking.

Usage Recommendations

Modern LLMs are already strong at producing SQL queries without finetuning.
We therefore recommend that most users:

Run inference directly on the full benchmark:
- Use llmsql.LLMSQLVLLMInference (the main inference class) for generation of SQL predictions with your LLM from HF.
- Evaluate results against the benchmark with the llmsql.LLMSQLEvaluator evaluator class.
Optional finetuning:
- For research or domain adaptation, we provide finetuning script for HF models. Use llmsql finetune --help or read Finetune Readme to find more about finetuning.

[!Tip] You can find additional manuals in the README files of each folder(Inferece Readme, Evaluation Readme, Finetune Readme)

Repository Structure


WikiSQLv2/
├── evaluation/          # Scripts for downloading DB + evaluating predictions
├── inference/           # Generate SQL queries with your LLM
└── finetune/            # Fine-tuning with TRL's SFTTrainer

Quickstart

Install

Make sure you have the package installed (we used python3.11):

pip3 install llmsql

1. Run Inference

from llmsql import LLMSQLVLLMInference

# Initialize inference engine
inference = LLMSQLVLLMInference(
    model_name="Qwen/Qwen2.5-1.5B-Instruct",  # or any Hugging Face causal LM
    tensor_parallel_size=1,
)

# Run generation
results = inference.generate(
    output_file="path_to_your_outputs.jsonl",
    questions_path="data/questions.jsonl",
    tables_path="data/tables.jsonl",
    shots=5,
    batch_size=8,
    max_new_tokens=256,
    temperature=0.7,
)

2. Evaluate Results

from llmsql import LLMSQLEvaluator

evaluator = LLMSQLEvaluator(workdir_path="llmsql_workdir")
report = evaluator.evaluate(outputs_path="path_to_your_outputs.jsonl")
print(report)

Finetuning (Optional)

If you want to adapt a base model on LLMSQL:

llmsql finetune --config_file examples/example_finetune_args.yaml

This will train a model on the train/val splits with the parameters provided in the config file. You can find example config file here.

Suggested Workflow

Primary: Run inference on dataset/questions.jsonl → Evaluate with evaluation/.
Secondary (optional): Fine-tune on train/val → Test on test_questions.jsonl.

License & Citation

Please cite LLMSQL if you use it in your work:

@inproceedings{llmsql_bench,
  title={LLMSQL: Upgrading WikiSQL for the LLM Era of Text-to-SQLels},
  author={Pihulski, Dzmitry and  Charchut, Karol and Novogrodskaia, Viktoria and Koco{'n}, Jan},
  booktitle={2025 IEEE International Conference on Data Mining Workshops (ICDMW)},
  year={2025},
  organization={IEEE}
}

Project details

These details have not been verified by PyPI

Development Status
- 3 - Alpha
License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
- Python :: 3

Release history Release notifications | RSS feed

0.1.16

Mar 5, 2026

0.1.15

Feb 24, 2026

0.1.14

Dec 15, 2025

0.1.13

Dec 2, 2025

0.1.11

Oct 16, 2025

0.1.10

Oct 16, 2025

0.1.9

Oct 16, 2025

0.1.7

Oct 16, 2025

0.1.6

Oct 16, 2025

0.1.5

Oct 15, 2025

0.1.4

Oct 13, 2025

0.1.3

Sep 25, 2025

This version

0.1.2

Sep 24, 2025

0.1.1

Sep 24, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

llmsql-0.1.2.tar.gz (17.6 kB view details)

Uploaded Sep 24, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

llmsql-0.1.2-py3-none-any.whl (19.6 kB view details)

Uploaded Sep 24, 2025 Python 3

File details

Details for the file llmsql-0.1.2.tar.gz.

File metadata

Download URL: llmsql-0.1.2.tar.gz
Upload date: Sep 24, 2025
Size: 17.6 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for llmsql-0.1.2.tar.gz
Algorithm	Hash digest
SHA256	`0c679ac1cc141edfa2bbdf5b801fa6f02fa9d89e13527b50ef69bca5b78d7349`
MD5	`0a302f03ce3d8ad9d16e57ef24a75c40`
BLAKE2b-256	`4b24fce8eaf9dc45c74c02427fdbcb90e82c0b4a5714787753e1bf4ed86fd442`

See more details on using hashes here.

Provenance

The following attestation bundles were made for llmsql-0.1.2.tar.gz:

Publisher: publish.yml on LLMSQL/llmsql-benchmark

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: llmsql-0.1.2.tar.gz
- Subject digest: 0c679ac1cc141edfa2bbdf5b801fa6f02fa9d89e13527b50ef69bca5b78d7349
- Sigstore transparency entry: 555502240
- Sigstore integration time: Sep 24, 2025
Source repository:
- Permalink: LLMSQL/llmsql-benchmark@a3bc2d3e26871aebbfb6437a158e8b19358eebac
- Branch / Tag: refs/tags/v0.1.2
- Owner: https://github.com/LLMSQL
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@a3bc2d3e26871aebbfb6437a158e8b19358eebac
- Trigger Event: push

File details

Details for the file llmsql-0.1.2-py3-none-any.whl.

File metadata

Download URL: llmsql-0.1.2-py3-none-any.whl
Upload date: Sep 24, 2025
Size: 19.6 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for llmsql-0.1.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`949344c9ea4c450d50a00fb8d7f34164db27beadd0905cdb6422b529ac45b455`
MD5	`e4e538f10714519ed524108e5458d5fb`
BLAKE2b-256	`ce134d494f52d448ac3eaa42a15ef98c6343b3e76704a2a4156d3643a9658680`

See more details on using hashes here.

Provenance

The following attestation bundles were made for llmsql-0.1.2-py3-none-any.whl:

Publisher: publish.yml on LLMSQL/llmsql-benchmark

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: llmsql-0.1.2-py3-none-any.whl
- Subject digest: 949344c9ea4c450d50a00fb8d7f34164db27beadd0905cdb6422b529ac45b455
- Sigstore transparency entry: 555502250
- Sigstore integration time: Sep 24, 2025
Source repository:
- Permalink: LLMSQL/llmsql-benchmark@a3bc2d3e26871aebbfb6437a158e8b19358eebac
- Branch / Tag: refs/tags/v0.1.2
- Owner: https://github.com/LLMSQL
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@a3bc2d3e26871aebbfb6437a158e8b19358eebac
- Trigger Event: push

llmsql 0.1.2

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers

Project description

LLMSQL

Our datasets are available for different scenarios on our HuggingFace page.

Overview

Install

Note

This package contains

Usage Recommendations

Repository Structure

Quickstart

Install

1. Run Inference

2. Evaluate Results

Finetuning (Optional)

Suggested Workflow

License & Citation

Project details

Verified details

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance