Skip to main content

Add your description here

Project description

NL2SQLEvaluator

Roadmap

  • Add MySQL database executor
  • Add Precision, Recall, F1 metrics for ambiguity Text2SQL datasets

👷🏼‍♂️ Work in progress

Configuration Guide

🔧 Use a YAML config (with CLI overrides)

Run your experiment with a config file:

nl2sql_eval --config path/to/config.yaml

Example config.yaml

# Core
output_dir: ./outputs
seed: 42

# Dataset
relative_db_base_path: data/bird_dev/dev_databases
dataset_path: simone-papicchio/bird
dataset_name: bird-dev

# Model
model_name: Qwen3-Coder-30B
model: Qwen/Qwen3-Coder-30B-A3B-Instruct
temperature: 0.7
top_p: 0.8
top_k: 20
repetition_penalty: 1.05
max_tokens: 32000

# Weights & Biases
project: text2sql-eval
entity: spapicchio-politecnico-di-torino   # or your team
group: evals
mode: online                               # or "offline" on clusters without net
tags: [eval, seg]
notes: ""
job_type: eval

Override any value from the CLI

Command-line flags take precedence over the YAML:

nl2sql_eval --config config.yaml \
  --output_dir ./outputs/run-42 \
  --mode offline \
  --temperature 0.2 \
  --max_tokens 4096 \
  --tags eval --tags ablation

Notes

  • The config is flat (all keys at top level) so it works smoothly with the parser.
  • Lists (e.g., tags) can be provided in YAML or by repeating the flag in CLI (--tags ... multiple times).
  • Booleans accept true/false in YAML and --flag true/false in CLI.
  • This package uses TRL’s TRLParser / HF’s HfArgumentParser under the hood, so the same configuration behaviors apply.

This package relies on TrlParser so all the configurations available there can be used as well.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

nl2sqlevaluator-1.0.0.tar.gz (186.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

nl2sqlevaluator-1.0.0-py3-none-any.whl (36.3 kB view details)

Uploaded Python 3

File details

Details for the file nl2sqlevaluator-1.0.0.tar.gz.

File metadata

  • Download URL: nl2sqlevaluator-1.0.0.tar.gz
  • Upload date:
  • Size: 186.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.7.13

File hashes

Hashes for nl2sqlevaluator-1.0.0.tar.gz
Algorithm Hash digest
SHA256 2e7eb5fe16bf6d14b614c75f0f336318602a91c7821b2cb8f64756b2f77f54bc
MD5 884ec7a21dc2646defebc89050606f74
BLAKE2b-256 157b2759a99583d7c5c73a2f878d0b0177169e3c4766a48b3e006bb3b69e3453

See more details on using hashes here.

File details

Details for the file nl2sqlevaluator-1.0.0-py3-none-any.whl.

File metadata

File hashes

Hashes for nl2sqlevaluator-1.0.0-py3-none-any.whl
Algorithm Hash digest
SHA256 a5144b45b6bda1b3ae34bed37c527b0a2d17f2bdf642b6c0c37535f6ce18ee6b
MD5 ecbfa1e72945d480b9243cc59f2d2b0f
BLAKE2b-256 19904811c98f07590930b8d4735241c41ad855b5ea6a0cda516c7824e66c62a9

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page