Skip to main content

Semantic security verification CLI for scanner alerts and AI-written code

Project description

CodeSight

Security CLI for scanner alerts, code review, and CI reports.

CodeSight started as a direct code scanner. Now the main lane is stronger: take alerts from Semgrep, CodeQL, or another SARIF tool, open the matching source files, collect evidence, and decide what deserves attention.

It still scans files and folders directly. The bigger value is verification: less noise, clearer proof, and reports that can go back into CI.

PyPI CI Python 3.10+ License: MIT Docs

What It Is Now

CodeSight has three useful paths:

  • guided terminal UI for people who do not want to remember commands
  • direct scan for one file, a folder, or a git diff
  • SARIF verify for scanner alerts that need real source context

The verify path is the important one.

Scanners are good at breadth. They find suspicious lines fast. CodeSight checks what is behind the alert:

  • where the input comes from
  • which sink or trust boundary it reaches
  • what guard is missing
  • why the verdict is exploitable, likely exploitable, uncertain, or dismissed
  • what fix would actually reduce risk

No evidence, no confident verdict.

Quick Start

Guided UI:

pip install codesight
codesight

From the repo:

python -m codesight

Pick a path in the menu:

  • scan a file
  • scan a folder
  • review a git diff
  • verify a SARIF file
  • build a proof bundle
  • run judge and skeptic mode

Direct commands still work:

codesight security src/app.py
codesight scan src --task security --output sarif > codesight.sarif
codesight diff --task security

Local model:

ollama serve
codesight security src/app.py --provider ollama

Verify Scanner Alerts

Run your scanner first:

semgrep scan --config auto --sarif > semgrep.sarif

Then let CodeSight import the alert and attach local source context:

codesight verify semgrep.sarif --source . --output markdown

Useful verify modes:

codesight verify semgrep.sarif --source . --preview-context
codesight verify semgrep.sarif --source . --fail-on likely_exploitable
codesight verify semgrep.sarif --source . --judge --skeptic --profile auto --provider openai
codesight verify semgrep.sarif --source . --artifact-dir .codesight-proof

Plain import mode is conservative. It keeps scanner alerts as uncertain. Judge mode can promote, downgrade, or dismiss alerts. Skeptic mode checks serious verdicts again before CI has to trust them.

Typical summary:

Blocked: 0 exploitable issue(s)
Likely exploitable: 0
Needs review: 42
Dismissed: 0

Try the local fixture:

codesight verify examples/semgrep-verify/semgrep.sarif \
  --source examples/semgrep-verify/project \
  --output markdown

Framework fixtures are in examples/framework-profiles.

Evidence Format

CodeSight uses structured verdicts:

  • exploitable
  • likely_exploitable
  • uncertain
  • probably_false_positive
  • not_exploitable

Example:

### CS-AUTH-001: Tenant isolation bypass

- Verdict: `exploitable`
- Severity: `high`
- Confidence: `high`
- Exploitability: `91/100`
- Location: `api/projects.py:88`
- CWE: `CWE-862`

#### Evidence

- Source: `request.path_params["org_id"]`
- Sink: `Project.query.filter_by(org_id=org_id)`
- Missing guard: no membership check before project lookup

#### Evidence path

1. `api/projects.py:82` - route accepts org_id from the request path
2. `api/projects.py:88` - query trusts org_id before checking membership

Commands

Core:

  • codesight
  • codesight ui
  • codesight security <file>
  • codesight scan <dir> --task security
  • codesight diff --task security
  • codesight verify <scanner.sarif> --source .
  • codesight benchmark

Secondary:

  • codesight review <file>
  • codesight bugs <file>
  • codesight docs <file>
  • codesight explain <file>
  • codesight refactor <file>

Providers

  • OpenAI: OPENAI_API_KEY
  • Anthropic: ANTHROPIC_API_KEY
  • Google Vertex AI: GOOGLE_CLOUD_PROJECT and ADC
  • Ollama: local ollama serve
  • OpenAI-compatible: custom label from codesight config

OpenAI-compatible presets include OpenRouter, Groq, Together AI, Mistral, xAI, Fireworks, DeepSeek, Perplexity, Cerebras, Cohere, and Azure AI Foundry.

Output

codesight security app.py --output markdown
codesight security app.py --output json
codesight security app.py --output sarif > codesight.sarif
codesight verify semgrep.sarif --source . --output sarif > verified.sarif

SARIF can be uploaded to GitHub code scanning.

Privacy

CodeSight does not need a hosted account or repo connection.

  • Ollama keeps analysis local.
  • BYOK providers use your own key.
  • Project config cannot set api_key, base_url, or default_provider.
  • Project config discovery is restricted to $HOME.
  • Large files can be compressed into code maps before prompting.

Cloud providers still receive the selected code context. Use Ollama when code must stay on the machine.

Benchmarks

The built-in benchmark is a smoke test: 10 vulnerable Python cases and 2 clean false-positive traps. It is useful for checking provider behavior and prompt drift. It is not a public claim that CodeSight is better than another scanner.

codesight benchmark --models gpt-5.4 llama3
codesight benchmark --json > benchmark-results.json

Public benchmark claims need the exact cases, commands, expected verdicts, raw results, model, provider, and run date.

GitHub Action

Verify scanner SARIF:

- run: |
    python -m pip install semgrep
    semgrep scan --config auto --sarif --output semgrep.sarif

- uses: AvixoSec/codesight@v0.3.1
  with:
    mode: verify
    path: .
    sarif-input: semgrep.sarif
    output: sarif
    fail-on: exploitable
    judge: "true"
    skeptic: "true"
    profile: auto

Direct scan:

- uses: AvixoSec/codesight@v0.3.1
  with:
    provider: openai
    api-key: ${{ secrets.OPENAI_API_KEY }}
    task: security
    path: .
    output: sarif

Development

git clone https://github.com/AvixoSec/codesight.git
cd codesight
pip install -e ".[dev]"
pytest tests -v
ruff check .

License

MIT. See LICENSE.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

codesight-0.3.1.tar.gz (71.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

codesight-0.3.1-py3-none-any.whl (69.6 kB view details)

Uploaded Python 3

File details

Details for the file codesight-0.3.1.tar.gz.

File metadata

  • Download URL: codesight-0.3.1.tar.gz
  • Upload date:
  • Size: 71.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for codesight-0.3.1.tar.gz
Algorithm Hash digest
SHA256 887b7c971271fe05033b20c5fc4693c840e585196c713f2652169be44dc94015
MD5 2022723b1402bc4ae7267c574f110562
BLAKE2b-256 d534c9674b9bf8c2e0e62dc79e0a2f8a9f420852625998d03b4b8ce7debb2ee9

See more details on using hashes here.

Provenance

The following attestation bundles were made for codesight-0.3.1.tar.gz:

Publisher: publish.yml on AvixoSec/codesight

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file codesight-0.3.1-py3-none-any.whl.

File metadata

  • Download URL: codesight-0.3.1-py3-none-any.whl
  • Upload date:
  • Size: 69.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for codesight-0.3.1-py3-none-any.whl
Algorithm Hash digest
SHA256 efe57ea9b1a757caff9d73c6f9a45aa673fc69f0c142e6d877630bcfadcedc74
MD5 47c225d444563b97bf2977cabbdda5cb
BLAKE2b-256 1f55c679176df1d70e62cf87094f0269a58db462f4b835270ca02e7e4b19f311

See more details on using hashes here.

Provenance

The following attestation bundles were made for codesight-0.3.1-py3-none-any.whl:

Publisher: publish.yml on AvixoSec/codesight

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page