AI-Generated Code Scanner — detects bugs, security vulnerabilities, and quality issues that AI coding assistants introduce
Project description
PyNeat: AI-Generated Code Cleaner
PyNeat 3.0.0 is a code scanning and cleanup tool built specifically for AI-generated code. Unlike generic linters, PyNeat targets the patterns that AI coding assistants systematically produce — phantom packages, hallucinated parameters, resource leaks, OWASP vulnerabilities, AI-specific security risks — and cleans them up automatically. Supports 9 languages.
What It Does
AI assistants are fast — but they generate code with predictable problems:
- Phantom imports — generic names like
utils,helpers,aithat don't exist - Fake parameters —
param1=x,fake=True,dummy_argthat do nothing - Resource leaks —
open()withoutwith,requestswithout timeout - Boundary errors —
list[0]without empty check,.split()[0]without validation - Redundant I/O — same API call 3+ times with identical arguments
- OWASP Top 10 — command injection, SQL injection, pickle RCE, weak crypto
- AI-specific risks — prompt injection, system prompt leakage, hallucinated API calls, tool call collisions
- Debug artifacts —
print(),pdb,console.logleft behind - Naming chaos — mixed camelCase/snake_case in the same file
- Identity comparisons —
is 200instead of== 200 - Type checks —
type(x) == listinstead ofisinstance(x, list)
PyNeat detects all of these and auto-fixes what it safely can.
Multi-Language Support
PyNeat handles 9 languages natively:
| Language | Auto-fix | Security scan |
|---|---|---|
| Python | ✅ | ✅ |
| JavaScript | ✅ | ✅ |
| TypeScript | ✅ | ✅ |
| Go | ✅ | ✅ |
| Java | ✅ | ✅ |
| Rust | ✅ | ✅ |
| C# | ✅ | ✅ |
| PHP | ✅ | ✅ |
| Ruby | ✅ | ✅ |
For maximum speed on large multi-language codebases, enable the Rust backend.
Quick Start
# Install
pip install pyneat
# Scan for issues
pyneat check your_file.py
# Clean AI-generated code patterns
pyneat clean your_file.py --dry-run --diff
# Auto-fix (with backup)
pyneat clean your_file.py --in-place --backup
For Python API usage and examples, see docs/quickstart.md.
3-Tier Package System
PyNeat uses three packages to balance safety vs. aggressiveness:
| Package | Use when |
|---|---|
safe (default) |
You want zero-risk fixes. Always-on rules that won't break code. |
conservative |
You want cleaner code. Adds unused import removal, f-string conversion, debug cleanup. |
destructive |
You want a full sweep. Aggressive refactoring — review changes before committing. |
Safe Package (Default)
Runs automatically, no flags needed:
| Rule | What it fixes |
|---|---|
IsNotNoneRule |
x != None → x is not None (PEP8) |
RangeLenRule |
range(len()) anti-pattern |
TypingRule |
Missing type annotations |
CodeQualityRule |
Magic numbers, empty except blocks |
PerformanceRule |
Inefficient loops |
SecurityScannerRule |
os.system, pickle, secrets, command injection, weak crypto |
Conservative Package
pyneat clean your_file.py --package conservative
Adds: unused import removal, .format() → f-string, @dataclass suggestions, magic number detection, safe debug cleanup (--safe-debug-clean).
Destructive Package
pyneat clean your_file.py --package destructive
Adds: import rewriting/reordering, naming convention enforcement (PascalCase), nested if flattening (Arrow Anti-pattern), TODO/FIXME removal, redundant expression simplification, dead code removal, --aggressive-clean (strip ALL print() calls), --enable-all for all rules at once.
Security Scanning
SecurityScannerRule runs in all packages automatically.
Core Security Rules (SEC-001 to SEC-060)
| Vulnerability | Auto-fix |
|---|---|
yaml.load() without Loader |
→ SafeLoader |
Empty except: pass |
→ raise |
Command injection (os.system, subprocess shell=True) |
Warning |
| SQL injection (string concatenation) | Warning |
pickle.loads() (RCE risk) |
Warning |
eval/exec dynamic execution |
Warning |
Weak crypto (random for tokens, md5/sha1) |
Warning |
Hardcoded secrets (api_key, password) |
Warning |
Template injection (render_template_string) |
Warning |
Path traversal (open() with user input) |
Warning |
| XXE (unsafe XML parsing) | Warning |
Debug mode (DEBUG=True) |
Warning |
| LDAP injection | Warning |
| SSRF / Open redirect | Warning |
| CORS misconfiguration | Warning |
NEW Security Rules (SEC-061 to SEC-072)
| Rule ID | Vulnerability | Severity | Description |
|---|---|---|---|
| SEC-061 | Missing Subresource Integrity (SRI) | Medium | External <script>/<link> without integrity attribute |
| SEC-062 | Missing Content-Type Validation | High | File upload without Content-Type verification |
| SEC-063 | Missing Rate Limiting | Medium | Sensitive endpoints without rate limiting |
| SEC-064 | Weak JWT Secret Key | Critical | Weak or hardcoded JWT secret |
| SEC-065 | Incomplete Session Destruction | Medium | Logout without full session cleanup |
| SEC-066 | Timing Attack Vulnerability | Medium | == used instead of timing-safe comparison |
| SEC-067 | Weak Server-side Validation | High | Only client-side validation, no server check |
| SEC-068 | Client-side Price Calculation | High | Price calculated on client sent to server |
| SEC-069 | Dangerous Dependencies | Medium | Outdated or vulnerable package versions |
| SEC-070 | Missing Docker Vulnerability Scan | Medium | Docker image without vulnerability scanning |
| SEC-071 | Sensitive Data in JWT | High | JWT payload contains sensitive data |
| SEC-072 | Missing CSP Nonce | Medium | Inline <script> without CSP nonce |
Extended Security Rules (SEC-073 to SEC-105+)
33 additional rules organized by OWASP Top 10 2021:
| Category | Rules | Description |
|---|---|---|
| A01: Broken Access Control | SEC-073 to SEC-075 | IDOR, privilege escalation |
| A02: Cryptographic Failures | SEC-076 to SEC-078 | Weak hash, ECB mode, hardcoded keys |
| A03: Injection | SEC-079 to SEC-082 | LDAP, XPath, SSTI, command injection |
| A05: Security Misconfiguration | SEC-083 to SEC-084 | Debug mode, CORS |
| A07: Authentication Failures | SEC-085 to SEC-086 | Weak password, brute force |
| A08: Software Integrity | SEC-087 to SEC-088 | Insecure deserialization, HTTP without TLS |
| A09: Security Logging | SEC-089 | Sensitive info in logs |
| A10: SSRF | SEC-090 | Server-side request forgery |
| Additional | SEC-091 to SEC-105 | XXE, path traversal, race condition, ReDoS, etc. |
Run pyneat check your_file.py --severity --cvss for detailed scan with CVSS scores and CWE/OWASP references.
AI Security Scanner (NEW)
Detects security risks specific to AI-generated code and AI applications:
| AI Vulnerability | Severity | Rule | Description |
|---|---|---|---|
| Prompt Injection | Critical | AI-010 | "Ignore previous instructions", "forget everything" |
| Context Confusion | Medium | AI-011 | Multi-turn conversation context confusion attacks |
| Proxy Injection | High | AI-012 | Tool call injection in AI agents |
| Missing Confidence Threshold | Medium | AI-020 | LLM output without confidence checking |
| Missing Fact Check | High | AI-021 | No fact verification for AI-generated content |
| Unguarded Sensitive Operation | High | AI-022 | Sensitive operations without guardrails |
| Verbose Error Exposure | Medium | AI-030 | Detailed errors exposing model internals |
| Missing API Rate Limit | Medium | AI-031 | AI API calls without rate limiting |
| Over-detailed System Info | Medium | AI-032 | Excessive system information in responses |
| Adversarial Input | Critical | AI-040 | Homoglyph attacks, injection patterns |
| Unicode Homograph Attack | Medium | AI-041 | Unicode confusable characters in AI inputs |
| System Prompt Leakage | High | AI-050 | Exposed system prompts in responses |
| Tool Call Collision | Medium | AI-051 | Conflicting tool names in AI agents |
| Missing Output Guardrails | High | AI-052 | AI without content filtering guardrails |
| Toxic Output Risk | Medium | AI-053 | Potentially harmful AI-generated content |
| Temperature Misuse | Low | AI-060 | Unsafe temperature parameter settings |
| Context Window Mismanagement | Medium | AI-061 | Context overflow handling issues |
| Hallucinated API Calls | High | AI-070 | Non-existent API endpoints in generated code |
Rust Backend
For large codebases, the Rust scanner (pyneat-rs) delivers 50x-100x speedup:
pip install pyneat[rust]
pyneat clean your_file.py --rust
Uses tree-sitter for AST parsing, pre-compiled regex patterns, and Rayon for parallel processing. No GIL contention for true parallelism.
Rust Backend Features
- LN-AST (Language-Neutral AST): Unified AST format for all 9 languages
- 191 Rules: 71 core + 120 language-specific rules
- Auto-fix Engine: Atomic, conflict-aware code transformations
- SARIF 2.1.0 Export: Full compliance with GitHub Security Lab format
- Python Bindings: PyO3 integration for seamless Python usage
- LSP Server: Real-time IDE diagnostics via Language Server Protocol
- CI/CD Integrations: GitHub, GitLab, SonarQube native support
Installation
pip install pyneat-cli
Or from source:
git clone https://github.com/khanhnam-nathan/Pyneat.git
cd Pyneat
pip install -e .
CLI Reference
PyNeat exposes 8 commands:
| Command | Description |
|---|---|
pyneat clean |
Clean a single file |
pyneat clean-dir |
Clean all files in a directory |
pyneat check |
Security scan (no auto-fix) |
pyneat rules |
List all available rules |
pyneat explain |
Detailed explanation of a rule (CWE, OWASP, fix steps) |
pyneat ignore |
Ignore a rule (per-file or globally) |
pyneat report |
Export security report (JSON/SARIF/HTML) |
pyneat security-db |
Manage CVE and GitHub Advisory databases |
Additional flags:
| Flag | Description |
|---|---|
--enable-all |
Enable all rules at once (destructive package) |
--export-manifest |
Auto-export PYNAGENT manifest on exit |
--dry-run |
Preview changes without writing |
--diff |
Show diff before applying |
--backup |
Backup file before modifying |
--in-place |
Modify file directly |
--fail-on |
Exit with error on specific severity threshold |
--baseline |
Ignore known issues from baseline file |
--parallel |
Number of parallel threads |
Clean a single file
# Safe package (default) — zero risk
pyneat clean your_file.py
# Preview without writing
pyneat clean your_file.py --dry-run --diff
# In-place with backup
pyneat clean your_file.py --in-place --backup
# Conservative — cleaner code
pyneat clean your_file.py --package conservative
# Destructive — full sweep
pyneat clean your_file.py --package destructive
Clean a directory
pyneat clean-dir ./src --dry-run --diff
pyneat clean-dir ./src --pattern "*.py" --in-place --backup --parallel
Security scan
pyneat check your_file.py --severity --cvss
pyneat check ./src --fail-on critical --format sarif --output report.sarif
Explain a rule
pyneat explain SEC-001
Shows: problem description, fix constraints, common mistakes, verification steps, documentation links.
Ignore a rule
# Ignore one instance at specific file + line
pyneat ignore SEC-003 --file app.py --line 42 --reason "already sanitized"
# Ignore globally for entire project
pyneat ignore SEC-003 --global --reason "not applicable to our codebase"
Export report
pyneat report ./src -f sarif -o security.sarif # GitHub Code Scanning
pyneat report ./src -f json -o report.json # Custom integration
pyneat report ./src -f html -o report.html # Human-readable
pyneat report ./src -f codeclimate -o cc.json # Code Climate
pyneat report ./src -f junit -o junit.xml # JUnit XML
Manage security databases
pyneat security-db --status # Show CVE/GHSA database status
pyneat security-db --update # Update to latest CVE + GitHub Advisory
pyneat security-db --force # Force update (ignore cache age)
Interactive Feature Menu
After every check, clean, rules, or report, PyNeat shows a smart feature menu:
┌─────────────────────────────────────────────────────────────┐
│ EXPLORE MORE FEATURES │
└─────────────────────────────────────────────────────────────┘
[A] 🔒 Security Check
Quét lỗ hổng: SQL injection, path traversal, hardcoded secrets...
→ pyneat check file.py
[B] 🧹 Clean Code
Thêm type hints, xóa unused imports, số magic, debug prints...
→ pyneat clean file.py
[C] 📖 Explain Rule
Nguyên nhân, cách fix, CWE/OWASP, verification steps...
→ pyneat explain SEC-001
[D] 📊 Export Report (JSON/SARIF)
Tích hợp CI/CD: GitHub Code Scanning, GitLab SAST...
→ pyneat report . -f sarif -o security.sarif
[q] Exit - return to terminal
[Enter] Skip this menu
Python API
from pyneat import clean_code, clean_file, analyze_code
from pyneat import RuleEngine, CodeFile, RuleConfig
# Clean code string
result = clean_code("x == None") # "x is not None"
# Clean a file
from pathlib import Path
result = clean_file(Path("app.py"), in_place=True)
print(f"Made {len(result.changes_made)} changes")
# Analyze without fixing
report = analyze_code("x == None; print('debug')")
for issue in report['issues']:
print(f" - {issue}")
Python API — Custom engine
from pyneat import RuleEngine, CodeFile
from pyneat.rules import IsNotNoneRule, DebugCleaner
engine = RuleEngine([
IsNotNoneRule(),
DebugCleaner(mode="safe"),
])
result = engine.process_code_file(CodeFile(path=Path("demo.py"), content=source))
Configuration
Add to pyproject.toml:
[tool.pyneat]
package = "safe" # safe, conservative, destructive
# Conservative
enable_unused_imports = true
enable_fstring = false
enable_dataclass = false
enable_magic_numbers = false
debug_clean_mode = "off" # off, safe, aggressive
# Destructive (caution!)
enable_import_cleaning = false
enable_naming = false
enable_refactoring = false
enable_comment_clean = false
enable_redundant = false
enable_dead_code = false
enable_match_case = false
# CI/CD
export_manifest = false
Pre-commit Integration
repos:
- repo: local
hooks:
- id: pyneat-clean
name: PyNeat AI Code Cleaner
entry: pyneat clean --package conservative --in-place
language: system
types: [python]
pass_filenames: true
args: ['--dry-run']
# Linux/macOS
bash scripts/setup-pre-commit.sh
# Windows
scripts\setup-pre-commit.bat
GitHub Actions
name: PyNeat Code Quality
on: [push, pull_request]
jobs:
pyneat:
runs-on: ubuntu-latest
steps:
- uses: actions/checkout@v4
- uses: actions/setup-python@v5
- name: Install PyNeat
run: pip install pyneat
- name: Run PyNeat
run: pyneat clean-dir . --dry-run
Full template at .github/workflows/ci.yml.
Manifest Export — CI/CD Integration
ManifestExporterwrites.pyneat.manifest.jsonwith all markersexport_to_sarif()— SARIF 2.1.0 format (GitHub Security, Azure DevOps)export_to_codeclimate()— Code Climate formatexport_to_markdown()— Human-readable report
MarkerCleanup — Stale Marker Removal
MarkerCleanupclass removes markers after issues are fixedremove_stale_markers()— only removes markers not in remaining_issuesremove_all_markers()— strips all PYNAGENT comments
VSCode Extension
PyNeat is available as a VSCode/Cursor extension:
- Real-time diagnostics for Python, JavaScript, TypeScript
- Quick Fix — auto-fix with one click
- Hover info — severity, CWE, fix constraints, verification steps
- Context menu — Apply Fix, Send to AI Agent, Ignore, Add Comment
- Save-triggered scan — runs automatically when you save
Install from .vsix or search the marketplace (coming soon).
Examples
Check out the examples/ directory for ready-to-use scripts:
| Example | Description |
|---|---|
| basic_usage.py | Scan and clean a single file |
| security_scan.py | Security scanning with SARIF export |
| batch_processing.py | Process entire projects |
| custom_rule.py | Create and use custom rules |
| pre_commit_integration.py | Integrate with pre-commit hooks |
Run an example:
python examples/basic_usage.py
Documentation
| Document | Description |
|---|---|
| docs/quickstart.md | 5-minute getting started guide |
| docs/faq.md | Frequently asked questions |
| docs/architecture.md | Technical architecture |
| docs/writing-rules.md | Creating custom rules |
| docs/github-actions-guide.md | CI/CD integration guide |
| CONTRIBUTING.md | Contribution guidelines |
| CODE_OF_CONDUCT.md | Community code of conduct |
Development
# Install dev dependencies
pip install -e ".[dev]"
# Run tests
pytest tests/
# Build distribution
python -m build
Architecture: 7-Layer Protection System
| Layer | Component | Description |
|---|---|---|
| 1 | AST Guard | Validates code structure before processing |
| 2 | Semantic Guard | Preserves code semantics during transformations |
| 3 | Type Shield | Prevents type-related regressions |
| 4 | Atomic Operations | Ensures atomic transformations |
| 5 | Scope Guard | Isolates changes within safe boundaries |
| 6 | Type Checking | Validates with mypy/pyright |
| 7 | Fuzz Testing | Stress tests with malformed inputs |
Editions & Commercial Support
PyNeat uses a dual-licensing / freemium model.
PyNeat Community (current, free)
- License: GNU AGPLv3
- Engine: Pure Python + Rust hybrid (
pyneat-rs) - Best for: Individual developers, students, small projects
- Rust coverage: ~30% of rules (security + quality)
PyNeat Standard (on request)
- Engine: Full Rust (
pyneat-rs) for extreme performance - Features: Multi-threading, 50x-100x faster, deep CI/CD integration
- Best for: Mid-sized teams, 1,000+ files
PyNeat Enterprise (on request)
- Features: Everything in Standard + Custom Ruleset API, Audit Reports, Dedicated SLA
- Best for: Large enterprises
Commercial License Exemption: If you cannot comply with AGPLv3 (e.g., proprietary SaaS, closed-source embedding), contact the author for a commercial license.
Contact: khanhnam.copywriting@gmail.com
License
PyNeat is free software: you can redistribute it and/or modify it under the terms of the GNU Affero General Public License as published by the Free Software Foundation, either version 3 of the License, or (at your option) any later version.
PyNeat is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU Affero General Public License for more details.
You should have received a copy of the GNU Affero General Public License along with this program. If not, see https://www.gnu.org/licenses/.
AGPLv3 with Commercial Exception: Commercial use of this software (e.g., bundling in paid products, SaaS services) is permitted, provided that you comply with the open source obligations under AGPLv3 §11. Contact the author for alternative licensing arrangements.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file pyneat_cli-3.1.0.tar.gz.
File metadata
- Download URL: pyneat_cli-3.1.0.tar.gz
- Upload date:
- Size: 861.3 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.10
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
b943ba725a19e18ff685d45f5523c7b79e985aef16ea6a19294f0fe507100fb9
|
|
| MD5 |
805f1b2653f9f97f91395162f354f21f
|
|
| BLAKE2b-256 |
0a3030f9c347c2298d1f77c6b867d46185445de0aeeb29d83eab9a7334c1bfb6
|
File details
Details for the file pyneat_cli-3.1.0-cp312-cp312-win_amd64.whl.
File metadata
- Download URL: pyneat_cli-3.1.0-cp312-cp312-win_amd64.whl
- Upload date:
- Size: 2.2 MB
- Tags: CPython 3.12, Windows x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.10
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
cd33b52a2ff55e16065a96dc5f62740a2aee71d0c49d477b1e8eca484a3a74bb
|
|
| MD5 |
8459436699cfe0b9899ec458b0c6cc05
|
|
| BLAKE2b-256 |
cf0b716c3a7c8263e85b4f164759b9b4ca9281703c7f15045aed8e4419851a53
|