Skip to main content

Your code says what it should do. Quell proves it.

Project description

quelltest

Your docstrings say what your code should do. Quelltest proves it.

PyPI Python License: MIT Docs

Quelltest reads your Python docstrings, Pydantic models, and PySpark schemas — extracts every testable requirement — generates a verified pytest test for each gap — and writes it to disk only after proving it catches real violations.

No LLM key required. No code leaves your machine.

pip install quelltest
quell check src/ --fix --no-llm

What "verified" means

Every generated test goes through two phases before it's written:

  1. Phase 1 — pass on correct code. The test runs against your original source. If it fails here, something is wrong with the generated test itself. Rejected.
  2. Phase 2 — fail on violated code. Quelltest injects a targeted violation (comments out the raise, weakens the Field bound, flips nullable=False). The test must fail. If it passes, it proved nothing. Rejected.

Only tests that survive both phases are written. This is the moat between Quelltest and every other test generation tool.


Use cases

Docstrings with Raises: / Returns:

def process_payment(amount: float, currency: str) -> PaymentResponse:
    """
    Process a payment.

    Args:
        amount: Payment amount in the specified currency.
        currency: ISO 4217 currency code.

    Returns:
        PaymentResponse with a unique transaction_id.

    Raises:
        ValueError: If amount is <= 0.
        ValueError: If currency is not supported.
    """
quell check src/payments.py --fix --no-llm
# → writes tests/test_payments.py with 2 verified MUST_RAISE tests

Generated test (simplified):

def test_process_payment_raises_valueerror_amount():
    """MUST_RAISE ValueError — auto-generated by Quelltest 0.6.9."""
    with pytest.raises(ValueError):
        process_payment(amount=0, currency="USD")

This test is verified: it passes on the original process_payment and fails when the raise ValueError is commented out.


Pydantic v2 Field constraints

from pydantic import BaseModel, Field
from typing import Literal

class PaymentRequest(BaseModel):
    amount: float = Field(gt=0, le=10_000)
    currency: Literal["USD", "EUR", "GBP", "INR"]
    idempotency_key: str = Field(min_length=8, max_length=64)
quell check src/models.py --fix --no-llm
# → BOUNDARY test for amount (>0, ≤10000)
# → ENUM_VALID test for currency (must be one of 4 values)
# → BOUNDARY test for idempotency_key (length 8–64)

PySpark schemas

from pyspark.sql.types import StructType, StructField, DoubleType, StringType

payment_schema = StructType([
    StructField("amount", DoubleType(), nullable=False),
    StructField("currency", StringType(), nullable=False),
    StructField("transaction_id", StringType(), nullable=True),
])
quell check src/schemas.py --fix --no-llm
# → NOT_NULL test for amount
# → NOT_NULL test for currency
# → TYPE_CHECK test for schema structure

Requirement coverage in CI

# Fail the pipeline if requirement coverage drops below 80%
quell check src/ --ci --threshold 0.80

# Post a gap report as a PR comment
quell pr 42 --comment

GitHub Actions workflow (generated by quell install --pr):

- name: Quelltest requirement coverage
  run: |
    pip install quelltest
    quell check src/ --ci --threshold 0.80

Reproduce a bug from plain English

quell reproduce "payment function accepts zero amount and does nothing"
# → finds the relevant function
# → generates a failing test that proves the bug
# → test is verified against the current (buggy) code

All commands

Command LLM? What it does
quell check src/ No Scan for requirement gaps
quell check src/ --fix No Generate + verify tests for every gap
quell check src/ --fix --no-llm Never Same, rule engine only
quell check src/ --ci --threshold 0.8 No Fail CI if coverage < 80%
quell pr <N> --comment No Post gap report as PR comment
quell score --badge No Print coverage score + SVG badge
quell reproduce "<bug>" Yes Turn bug description into failing test
quell prove src/file.py No Show requirement coverage for one file
quell install --pr No Write GitHub Actions workflow
quell auth login Login for LLM features

How it works

Spec readers (docstring, Pydantic, PySpark)
        │
        ▼
  list[Requirement]
        │
        ▼
Coverage checker (AST scan of test files — no test execution)
        │
        ▼
Rule engine (deterministic, no LLM) → covers ~75% of requirements
LLM engine (fallback, only if configured) → covers complex cases
        │
        ▼
Verification engine — THE MOAT
  Phase 1: test passes on original code
  Phase 2: test fails on violated code
  Always restores source in finally block
        │
        ▼
Writer (libcst injection — preserves formatting)
  Backup before write. Validate CST. Restore on failure.

Constraint kinds

Kind Detected from Violation injected
MUST_RAISE Raises: in docstring Comments out the raise
MUST_RETURN Returns: in docstring Returns None instead
BOUNDARY Field(gt=0, le=100) Sets value outside bound
ENUM_VALID Literal["a", "b"] Passes "__invalid__"
NOT_NULL nullable=False in StructField Passes None
TYPE_CHECK PySpark StructType Wrong column type

Why not just use an LLM?

LLM-generated tests match current behavior, not stated requirements. If your code has a bug (accepts zero amounts), an LLM writing tests against the live code will write a test that passes amount=0 — because that's what the code currently does.

Quelltest reads the requirement (Raises: ValueError: If amount is <= 0) and generates a test that proves the requirement is enforced. If the code is buggy, the test fails. That's the point.


Comparison

Quelltest pytest-cov Hypothesis LLM test gen
Reads existing specs Partial
No LLM required
Verifies tests catch violations
Requirement-level coverage
Preserves code formatting ✅ (libcst)
CI gate

Installation

pip install quelltest

Requires Python 3.11+. Works on Linux, macOS, Windows.

Optional extras

pip install quelltest[pyspark]   # PySpark schema scanning
pip install quelltest[mcp]       # MCP server for AI agents

Configuration

quell init   # adds [tool.quell] to pyproject.toml
[tool.quell]
llm_provider = "anthropic"        # "anthropic" | "openai" | "ollama" | "none"
llm_model    = "claude-sonnet-4-5"
enable_docstring = true
enable_types     = true
enable_mutations = false          # mutmut integration (opt-in)
auto_write       = false
score_threshold  = 0.0

LLM API key — only needed for quell reproduce and LLM fallback:

export ANTHROPIC_API_KEY=sk-ant-...
# or
export OPENAI_API_KEY=sk-...
# or login via browser (no key needed)
quell auth login

Python SDK

from quell import Quell

q = Quell()

# Scan for gaps (no LLM)
result = q.check("src/")
print(f"Score: {result.score:.0%}{len(result.uncovered)} gaps")

# Reproduce a bug (uses LLM)
q.reproduce("payment accepts zero amount")

# Project score with badge
score = q.score()
print(f"Requirement coverage: {score.percentage}%")

Full SDK reference: quell.buildsbyshashank.tech/docs/sdk/overview


Development

git clone https://github.com/shashank7109/quelltest_lib.git
cd quelltest_lib
uv sync --dev

uv run pytest tests/ -v
uv run ruff check . --fix
uv run mypy quell/

# Run quelltest on itself
uv run quell check quell/ --no-llm

Links


License

MIT — see LICENSE

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

quelltest-0.9.3.tar.gz (288.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

quelltest-0.9.3-py3-none-any.whl (106.1 kB view details)

Uploaded Python 3

File details

Details for the file quelltest-0.9.3.tar.gz.

File metadata

  • Download URL: quelltest-0.9.3.tar.gz
  • Upload date:
  • Size: 288.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.9

File hashes

Hashes for quelltest-0.9.3.tar.gz
Algorithm Hash digest
SHA256 7401128e4a412a4abae9ab493e733bb361ad6dc1b0b89db5320df0783b28c664
MD5 d5cf971c9b473827898f133464f9993f
BLAKE2b-256 b99c211c51f206dac345edce0516e6da5b7e962d56720c831767a1865666a5a1

See more details on using hashes here.

File details

Details for the file quelltest-0.9.3-py3-none-any.whl.

File metadata

  • Download URL: quelltest-0.9.3-py3-none-any.whl
  • Upload date:
  • Size: 106.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.9

File hashes

Hashes for quelltest-0.9.3-py3-none-any.whl
Algorithm Hash digest
SHA256 05daf0dbf83d442d20f58aaeb9475312982c2b497fefe711c950bca060973f6b
MD5 0ab61d7deeef8bf6d4f3f487f0da6777
BLAKE2b-256 98adc18597885b44b420a5640a3c88c3a42ac7834679927cc77a6cc947004756

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page