Skip to main content

Python code quality tool with LLM-aware rules, plugin system, and enterprise features

Project description

prefact

PyPI version Python 3.10+ License: Apache-2.0 Code style: black

Automatic Python prefactoring toolkit — detect, fix, and validate common code issues introduced by LLMs and humans alike.

The Problem

img.png

When using LLMs for code generation, they often silently change import paths from absolute to deep relative:

# ❌ LLM introduces this
from ....llm.generator import generate_strategy
from ....loaders.yaml_loader import save_strategy_yaml

# ✅ You wanted this
from planfile.llm.generator import generate_strategy
from planfile.loaders.yaml_loader import save_strategy_yaml

prefact automatically detects, fixes, and validates such issues in a three-phase pipeline.

Features

Rule ID Auto-fix Description
Relative → Absolute imports relative-imports Converts from ....x import y to from pkg.x import y
Unused imports unused-imports Removes imports never referenced in the module
Duplicate imports duplicate-imports Removes the same name imported twice
Wildcard imports wildcard-imports 🔍 Flags from x import *
Unsorted imports sorted-imports 🔍 Flags import blocks not ordered stdlib→3rd-party→local
String concatenation string-concat 🔍 Flags "Hello " + name → suggests f-strings
Missing return types missing-return-type 🔍 Flags public functions without return type hints

✅ = auto-fix · 🔍 = scan-only (report)

Performance Improvements

  • Parallel Processing: Scans files in parallel when enabled
  • Smart Filtering: Automatically skips large files (>100KB) and empty files
  • Optimized Scanning: Excludes test directories and examples by default
  • Deduplication: Prevents duplicate tickets and TODO entries

Examples

The examples/ directory contains comprehensive examples for different use cases:

Example Description
sample-project Realistic project with all issues demonstrated
01-individual-rules Each rule explained with before/after code
02-multiple-rules Combining multiple rules for comprehensive cleanup
03-output-formats Console vs JSON output examples
04-custom-rules Writing your own prefactoring rules
05-ci-cd GitHub Actions, GitLab CI, Azure DevOps configs
06-api-usage Using prefact programmatically from Python

Quick Example

# Try the sample project
cd examples/sample-project
prefact scan --path . --config prefact.yaml
prefact fix --path . --config prefact.yaml

See examples/README.md for a detailed guide to all examples.

Installation

pip install -e .

# with dev dependencies (pytest)
pip install -e ".[dev]"

Quick Start

# Generate config file
prefact init

# List all available rules
prefact rules

# Scan only (no changes)
prefact scan --path ./my_project --package mypackage

# Fix + validate (with backups)
prefact fix --path ./my_project --package mypackage

# Dry-run (show what would change)
prefact fix --path ./my_project --package mypackage --dry-run

# Check a single file
prefact check ./my_project/src/mypackage/core/service.py --package mypackage

# JSON output for CI
prefact fix --path . --format json -o report.json

📚 Want to see prefact in action? Check out our comprehensive examples with real-world scenarios!

Pipeline Architecture

┌─────────┐      ┌─────────┐      ┌────────────┐
│  SCAN   │ ──→  │   FIX   │ ──→  │  VALIDATE  │
│         │      │         │      │            │
│ Detect  │      │ Apply   │      │ Syntax OK? │
│ issues  │      │ fixes   │      │ Regressions│
│ per rule│      │ + backup│      │ preserved? │
└─────────┘      └─────────┘      └────────────┘
  1. Scan — each rule walks the AST / CST and emits Issue objects
  2. Fix — rules with auto-fix transform the source (via libcst for formatting-safe changes)
  3. Validate — post-fix checks: syntax valid, no regressions, import counts preserved

Configuration

Create prefact.yaml (auto-generated via prefact init):

package_name: planfile

include:
  - "**/*.py"

exclude:
  - "**/venv/**"
  - "**/build/**"
  - "**/tests/**"
  - "**/test*/**"
  - "**/examples/**"

tools:
  parallel: true
  cache: true
  performance:
    max_workers: 4

rules:
  relative-imports:
    enabled: true
    severity: warning
  unused-imports:
    enabled: true
    severity: info
  duplicate-imports:
    enabled: true
  wildcard-imports:
    enabled: true
    severity: error
  sorted-imports:
    enabled: false
  string-concat:
    enabled: true
  missing-return-type:
    enabled: false

Autonomous Mode

Prefact includes an autonomous mode that automatically:

  • Scans your project for issues
  • Generates TODO.md with all found issues
  • Creates tickets in planfile.yaml for tracking
  • Updates CHANGELOG.md with fixes
# Run full autonomous workflow
prefact -a

# Or skip tests/examples for faster runs
prefact -a --skip-tests --skip-examples

Performance Improvements

Recent updates have significantly improved performance:

  • Parallel Processing: Scans files using multiple workers (configurable)
  • Smart Filtering: Skips large files (>100KB) and files with minimal content
  • Optimized Exclusions: Automatically excludes test directories and examples
  • Deduplication: Prevents duplicate tickets and TODO entries across runs

Python API

from pathlib import Path
from prefact.config import Config
from prefact.engine import RefactoringEngine

config = Config(
    project_root=Path("./my_project"),
    package_name="planfile",
    dry_run=False,
    backup=True,
)

engine = RefactoringEngine(config)
result = engine.run()

print(f"Found {result.total_issues} issues")
print(f"Fixed {result.total_fixed}")
print(f"All valid: {result.all_valid}")

Writing Custom Rules

Extend BaseRule and use the @register decorator:

from prefact.rules import BaseRule, register
from prefact.models import Issue, Fix, ValidationResult

@register
class MyCustomRule(BaseRule):
    rule_id = "my-custom-rule"
    description = "Does something useful."

    def scan_file(self, path, source):
        # Return list[Issue]
        ...

    def fix(self, path, source, issues):
        # Return (fixed_source, list[Fix])
        ...

    def validate(self, path, original, fixed):
        # Return ValidationResult
        ...

CI/CD Integration

# GitHub Actions
- name: prefact check
  run: |
    pip install ./prefact
    prefact scan --path . --format json -o prefact-report.json
    prefact fix --path . --dry-run

Running Tests

pip install -e ".[dev]"
pytest -v

License

Apache License 2.0 - see LICENSE for details.

Author

Created by Tom Sapletta - tom@sapletta.com

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

prefact-0.1.23.tar.gz (91.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

prefact-0.1.23-py3-none-any.whl (108.1 kB view details)

Uploaded Python 3

File details

Details for the file prefact-0.1.23.tar.gz.

File metadata

  • Download URL: prefact-0.1.23.tar.gz
  • Upload date:
  • Size: 91.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.7

File hashes

Hashes for prefact-0.1.23.tar.gz
Algorithm Hash digest
SHA256 20889070e5457bb39201efb75f073c1c4ad8af68307ab3b7397a0edbe14a2c07
MD5 e59c17419e4157ccbccf7e667a8c611f
BLAKE2b-256 234f61c84603fea14acdba84cd4cec11cdc207066405d9761bbf558184cbabcc

See more details on using hashes here.

File details

Details for the file prefact-0.1.23-py3-none-any.whl.

File metadata

  • Download URL: prefact-0.1.23-py3-none-any.whl
  • Upload date:
  • Size: 108.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.7

File hashes

Hashes for prefact-0.1.23-py3-none-any.whl
Algorithm Hash digest
SHA256 c90238b8831a4f59d191c3a3645c00ff3ac7ebab71b6be68137e03f578ad9672
MD5 1a66b55e2a055f2b0252f4324011417e
BLAKE2b-256 03310925afaed5d2c6844b30898a35b5c36f6d24c1bc1283d6e5363fc8df47e8

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page