Skip to main content

Graph-guided, runtime-proven, LLM-assisted PR test generation for Python projects

Project description

SoftGNN Advisor

Graph-guided, runtime-proven, LLM-assisted PR testing

Know what changed. Know what tests hit it. Generate what is missing.

Tests Release License: MIT Python LLM


Install

Recommended for CLI use:

pipx install softgnn-advisor

Or install into your current environment:

pip install softgnn-advisor

Optional extras:

pip install "softgnn-advisor[llm]"   # Gemini / OpenAI-compatible generation
pip install "softgnn-advisor[gnn]"   # PyTorch Geometric ranking
pip install "softgnn-advisor[all]"   # full stack

Then run:

softgnn setup /path/to/your-repo --project my-app
softgnn generate --project my-app

For local development from source:

git clone https://github.com/minhquang0407/softgnn-advisor.git
cd softgnn-advisor
python -m venv .venv
.venv\Scripts\activate  # Windows
pip install -e ".[all]"
softgnn --help

What is SoftGNN?

SoftGNN Advisor is an experimental CLI that combines a code graph, runtime test graph, and LLM test generation to help you understand PR impact and generate missing pytest tests.

Most AI testing tools stop at:

read changed file -> ask LLM for tests -> run pytest

SoftGNN aims for a stronger loop:

scan PR -> find impacted code -> map tests that actually execute it -> generate missing tests -> verify -> refresh runtime proof

Core thesis: a generated test is not truly useful until it passes pytest and proves it hits the intended code.


The pipeline

flowchart LR
    A[PR Diff] --> B[Code Graph]
    B --> C[Impact Scan]
    C --> D[Missing Runtime Coverage]
    D --> E[LLM Semantic Test Generation]
    E --> F[Schema + Safety Validation]
    F --> G[Transactional Patch]
    G --> H[Pytest Verify]
    H --> I[Runtime Test Mapping]
    I --> J[PR Scan Confirmation]

    H -- failure --> K[LLM Repair]
    K --> F

Why it is different

Capability Naive LLM test generation SoftGNN Advisor
Reads changed code
Generates pytest tests
Validates structured LLM output
Patches transactionally
Rolls back failed generated tests
Maps tests to functions at runtime
Confirms PR coverage after generation
Supports Gemini/OpenAI-compatible LLMs varies

Features

  • PR impact scanning between Git revisions.
  • Code graph extraction from Python source files.
  • Runtime test mapping from pytest execution to source functions.
  • Missing runtime coverage detection for impacted functions.
  • LLM-assisted semantic pytest generation.
  • Native Gemini provider and OpenAI-compatible provider.
  • Structured JSON validation before writing tests.
  • Safety validation against unsafe generated code patterns.
  • Transactional patching with generated block markers.
  • Pytest verification and bounded generated-test repair loop.
  • Runtime refresh after successful generation.
  • PR scan confirmation after runtime refresh.

Verified demo

On a local social-link-prediction repo, SoftGNN used Gemini to generate behavior tests for:

FUNC:is_edge_index_sorted

Result:

pytest: 6 passed
runtime mode: per-test
runtime edges: 336
persisted: True
missing coverage before: 0
missing coverage after: 0

Fallback without an LLM produced only a shallow smoke test:

assert callable(is_edge_index_sorted)

Gemini-assisted generation produced behavior checks for sorted edges, unsorted source order, unsorted target order, single-edge input, and invalid-shape errors.

Read the full demo: docs/examples/social-link-demo.md


Install

git clone https://github.com/minhquang0407/softgnn-advisor.git
cd softgnn-advisor
python -m venv .venv

Windows PowerShell:

.\.venv\Scripts\Activate.ps1
pip install -r requirements.txt

Linux/macOS:

source .venv/bin/activate
pip install -r requirements.txt

PyTorch / PyTorch Geometric installs can be platform-specific. If installation fails, follow the official PyTorch and PyG installation guides for your environment.


Configure an LLM provider

Gemini

$env:SOFTGNN_LLM_PROVIDER="gemini"
$env:SOFTGNN_LLM_MODEL="gemini-3-flash"
$env:SOFTGNN_LLM_API_KEY="YOUR_GEMINI_API_KEY"

If your API account uses another model ID:

$env:SOFTGNN_LLM_MODEL="gemini-2.5-flash"

OpenAI-compatible endpoint

$env:SOFTGNN_LLM_PROVIDER="openai-compatible"
$env:SOFTGNN_LLM_BASE_URL="http://localhost:11434/v1"
$env:SOFTGNN_LLM_MODEL="qwen2.5-coder:7b"
$env:SOFTGNN_LLM_API_KEY="optional-if-your-endpoint-needs-it"

Generation strategies:

template  -> deterministic templates only
llm       -> require configured LLM unless fallback is allowed
auto      -> try LLM first, fallback to templates when unavailable

Quickstart

After setup, use generate for the full beginner workflow:

softgnn setup C:\repo\my-app --project my-app
softgnn generate --project my-app

generate is the one-shot command:

plan -> save latest_plan.json -> apply saved plan

During apply, SoftGNN:

writes generated tests under tests/
runs pytest with streaming output
repairs failing generated tests
keeps passing generated tests
rolls back failing generated tests
refreshes runtime coverage for kept tests
saves apply feedback to ~/.softgnn/<project>/apply_runs/<run_id>/result.json

By default, generate also replans failed/rolled-back targets once:

plan -> apply -> replan failed targets with apply feedback -> apply retry plan

To disable the extra replanning pass and save LLM tokens:

softgnn generate --project my-app --replan-iters 0

Want to review proposed tests before patching? Use plan then apply:

softgnn setup C:\repo\my-app --project my-app
softgnn plan --project my-app    # generate + save a reusable plan, no writes
softgnn apply --project my-app   # load saved plan, write tests, verify, rollback/map

apply is intentionally pure: it does not generate fresh tests when no saved plan exists. If there is no saved plan, run plan first or use generate.

Template-only generation, without an LLM:

softgnn generate --project my-app --no-llm

Advanced commands are still available (prepare, pr-scan, generate-tests, test-map).

Mental model:

setup/prepare need the repo path once
plan decides what to write
apply executes an existing plan and owns rollback
generate runs plan + apply, with one default replan attempt

Daily commands after setup:

Goal Command
One-shot plan + apply + verify softgnn generate --project my-app
One-shot without replan softgnn generate --project my-app --replan-iters 0
Review before patching softgnn plan --project my-app
Apply reviewed plan softgnn apply --project my-app
Inspect change impact softgnn scan --project my-app
Runtime test map softgnn map --project my-app
Health check softgnn doctor --project my-app
Impact of one symbol softgnn impact --project my-app FUNC:foo
Developer triage softgnn triage --project my-app "bug description"

More details: docs/quickstart.md

The full guide covers:

simple CLI workflow
plan cache and apply-from-plan
one-shot generate workflow
Git PR workflow
no-Git filesystem snapshot workflow
first-run full-scan workflow
explicit target workflow
runtime coverage mapping
patch/verify/repair/partial-rollback flow
apply feedback and failed-target replanning

Safety model

SoftGNN is conservative by default:

writes tests/ only
wraps generated code in markers
validates LLM output before patching
runs pytest before accepting generated tests
rolls back failed generated edits by default
never requires committing API keys

Generated test blocks are marked:

# <softgnn-generated target="FUNC:example" start>
...
# <softgnn-generated target="FUNC:example" end>

Recommended workflow:

run on a feature branch
start with --mode plan
use --mode patch after review
inspect git diff before commit

CLI highlights

python softgnn.py pr-scan --project social-link --repo-path "C:\path\to\repo" --base main --head HEAD
python softgnn.py test-map --project social-link --repo-path "C:\path\to\repo" --mode per-test --persist
python softgnn.py generate-tests --project social-link --repo-path "C:\path\to\repo" --mode plan --generation-strategy auto

Project status

Current release: v0.1.15

This is a developer preview. Generated tests should be reviewed before commit. Production-code fixes are intentionally out of scope for v0.1.


Roadmap

Short version:

M4  Runtime-Proven Test Generation
M5  Graph Impact Report / Dashboard
M6  Learned Test Prioritization / GNN Ranking
M7  Multi-Agent Quality Swarm
M8  Large-scale repo automation
M9  Controlled production-code fixes

Read more:


License

MIT License. See LICENSE.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

softgnn_advisor-0.1.15.tar.gz (90.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

softgnn_advisor-0.1.15-py3-none-any.whl (92.2 kB view details)

Uploaded Python 3

File details

Details for the file softgnn_advisor-0.1.15.tar.gz.

File metadata

  • Download URL: softgnn_advisor-0.1.15.tar.gz
  • Upload date:
  • Size: 90.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.14.0

File hashes

Hashes for softgnn_advisor-0.1.15.tar.gz
Algorithm Hash digest
SHA256 a7c686fe6c2151a3faca295c905dc9c123a5e2e944d305b5f2ae93bdfa947341
MD5 56a3a623f7829c6190d42ed34ed587c5
BLAKE2b-256 5274b4705bd8d1a7d4ab03c7413ed503b0b4cb2a5dbe17e7fef0e16566c35b95

See more details on using hashes here.

File details

Details for the file softgnn_advisor-0.1.15-py3-none-any.whl.

File metadata

File hashes

Hashes for softgnn_advisor-0.1.15-py3-none-any.whl
Algorithm Hash digest
SHA256 ac827626355f2912ea138330d242c24850bafdd91ecec4e6158fd1e5176009bb
MD5 0a3af4a52d820d46c6a3168025a7a6b8
BLAKE2b-256 82612402ef155f143f15b7397f3959da766eba04c11a906daec026778368a9d2

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page