Local-first desktop SEO crawler

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

These details have not been verified by PyPI

Project description

xseo

A local-first SEO crawler for your desktop and your terminal. Audit your site on your own machine — no cloud, no accounts, no data leaves your computer.

xseo crawls a website, extracts on-page SEO signals, detects common issues and content duplication, and shows the results in a clean desktop UI — or runs headless from the command line so you can pipe reports into other tools and gate CI on SEO regressions. Everything runs locally and persists to a single SQLite file under ~/.xseo/.

xseo crawling a site live

xseo issues view

Features

Live crawling with a real-time progress view and a threaded background worker that keeps the UI responsive.
Polite by default — respects robots.txt and applies a configurable per-request delay so you don't hammer the sites you audit.
On-page extraction of titles, meta descriptions, headings, canonicals, robots directives, internal/external links, and more via selectolax.
Issue detection for missing/duplicate titles and descriptions, thin content, heading problems, broken links, mixed content (http resources on https pages), missing Open Graph tags, missing JSON-LD structured data, non-self-referential hreflang, pages missing from the sitemap, and other common SEO defects.
Duplicate content detection through content hashing and grouped read models.
Sortable result tables for pages, issues, and duplicate groups, with a double-click page detail dialog.
Headless CLI (xseo crawl <url>) that runs the same engine without the GUI — JSON, CSV, HTML, and SARIF reports, an xseo diff to compare crawls, and a --fail-on exit code, so you can gate CI on SEO regressions.
GitHub Action that runs an audit on every push and uploads findings to the Security → Code scanning tab via SARIF.
CSV export for every result view, so you can pipe findings into spreadsheets or other tools.
Local persistence in SQLite at ~/.xseo/xseo.sqlite3. The last crawl is restored automatically on launch.
Clean architecture — domain, application, and adapter layers are strictly separated, with ports/adapters for HTTP, persistence, export, and the UI.

Screenshots

Configure a crawl, then watch progress stream in live:

Control	Progress

Review crawled pages, detected issues, and duplicate content groups:

Pages	Duplicates

Double-click any page for full detail — headings, links, redirects, and per-page issues:

Page detail dialog

Tech stack

Python 3.12+
PySide6 for the desktop UI
httpx for HTTP fetching
selectolax for fast HTML parsing
SQLite for local storage
pytest, hypothesis, and pytest-qt for unit, property-based, and UI tests

Install

Download a ready-to-run build (no Python needed)

Download the build for your OS, unzip it, and run the xseo executable inside:

OS	Download
macOS (Apple Silicon)	xseo-macos-arm64.zip
Windows (x64)	xseo-windows-x64.zip
Linux (x64)	xseo-linux-x64.zip

These links always point to the most recent release. You can also browse every build on the Releases page.

The builds are not code-signed yet, so the OS may warn you the first time:

macOS: right-click the app → Open → Open (or System Settings → Privacy & Security → Open Anyway).

Windows: on the SmartScreen prompt, click More info → Run anyway.

Install from PyPI (for Python users)

pipx install xseo   # isolated, recommended
# or
pip install xseo

Then launch with xseo-ui. Requires Python 3.12 or newer.

From source

python3 -m pip install -e '.[test]'

Run

Launch the desktop UI:

xseo-ui

Or from the source tree:

python3 -m xseo.ui.app

Enter a URL, click Start Crawl, and watch the progress tab fill in. When the crawl finishes, browse pages, issues, and duplicate groups in their respective tabs. Double-click any page row for full detail, or export any view to CSV.

Command line

xseo also runs headless — no GUI, scriptable, and CI-friendly. It uses the same crawl engine and the same SQLite store as the desktop app, so you can audit from the terminal and still open the results in the UI later.

xseo crawl https://example.com/

Crawling https://example.com/ …

Crawled 142 pages, found 38 issues

  HIGH       3  broken_internal_link
  MEDIUM    12  meta_description_missing
  LOW       22  thin_content

Write a report in whichever format fits your workflow, or pipe JSON straight to another tool:

xseo crawl https://example.com/ --out report.json        # full JSON report
xseo crawl https://example.com/ --out - | jq '.summary'  # JSON to stdout
xseo crawl https://example.com/ --format csv  --out issues.csv
xseo crawl https://example.com/ --format html --out report.html   # shareable, self-contained
xseo crawl https://example.com/ --format sarif --out report.sarif # GitHub code scanning

Use it as a build gate — --fail-on makes the command exit non-zero when an issue at or above the given severity is found, so a regression breaks CI:

xseo crawl https://example.com/ --fail-on high

Compare two crawls to see exactly what changed — which issues are new and which were fixed:

xseo crawl https://example.com/ --out before.json
# … ship some changes …
xseo crawl https://example.com/ --out after.json
xseo diff before.json after.json --fail-on-new medium

Other flags: --limit, --delay, --timeout, --no-robots, --all-hosts, --db. Run xseo crawl --help for the full list.

GitHub Action

Run an audit on every push and surface findings in the Security → Code scanning tab:

# .github/workflows/seo.yml
name: SEO audit
on: [push]
permissions:
  security-events: write   # required to upload SARIF
jobs:
  seo:
    runs-on: ubuntu-latest
    steps:
      - uses: yuripinto/xseo@v1
        id: xseo
        with:
          url: https://example.com/
          fail-on: high          # break the build on high-severity issues
      - uses: github/codeql-action/upload-sarif@v3
        if: always()
        with:
          sarif_file: ${{ steps.xseo.outputs.report }}

Verify

python3 -m compileall src tests
python3 -m pytest -q

The current suite has 193 tests covering domain logic, adapters, integration, property-based invariants, CLI behavior, report rendering, and UI smoke tests.

Project layout

src/xseo/
├── domain/         # entities, value objects, ports, validation, events
│   ├── crawler/    # frontier + crawl engine
│   ├── extraction/ # HTML extraction
│   ├── analysis/   # SEO issue detection
│   └── duplicates/ # content duplicate detection
├── application/    # services, commands, queries, read models
├── adapters/       # HTTP, persistence, export, background worker, event bridge
└── ui/             # PySide6 app, widgets, controller

Contributing

Contributions are welcome — see CONTRIBUTING.md for dev setup, how to run the checks (ruff + pytest), and the project conventions.

About

I built xseo because I needed it. I was starting a new project and wanted a fast way to scan it for SEO issues without uploading URLs to a third-party tool, paying for another subscription, or fighting a heavy web dashboard. I wanted something that ran on my desktop, was honest about what it found, and stored results in a file I owned — so I wrote it, and I'm sharing it in case it's useful to anyone else who wants a small, local, hackable SEO crawler.

This is an early prototype. It works end-to-end and I use it on my own projects, but expect rough edges. Issues and PRs are welcome.

Built by Yuri Silva — @yurisilvapi on X/Twitter.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

yurisilva

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

0.7.0

Jun 4, 2026

0.6.0

Jun 4, 2026

0.5.0

Jun 4, 2026

0.4.0

Jun 4, 2026

0.3.0

Jun 3, 2026

0.2.0

May 28, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

xseo-0.7.0.tar.gz (68.3 kB view details)

Uploaded Jun 4, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

xseo-0.7.0-py3-none-any.whl (88.8 kB view details)

Uploaded Jun 4, 2026 Python 3

File details

Details for the file xseo-0.7.0.tar.gz.

File metadata

Download URL: xseo-0.7.0.tar.gz
Upload date: Jun 4, 2026
Size: 68.3 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for xseo-0.7.0.tar.gz
Algorithm	Hash digest
SHA256	`179c6df2226d2e430b5790acb961da66e7730f1225dac86cfb15a6d3d2bb4830`
MD5	`3a9f2d6626c79ee9426829f4cb97652d`
BLAKE2b-256	`63cf2cbcda1bc13c250aac77267fa827ad20c80364ff81a74dd77bcaeaae3741`

See more details on using hashes here.

Provenance

The following attestation bundles were made for xseo-0.7.0.tar.gz:

Publisher: release-please.yml on yuripinto/xseo

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: xseo-0.7.0.tar.gz
- Subject digest: 179c6df2226d2e430b5790acb961da66e7730f1225dac86cfb15a6d3d2bb4830
- Sigstore transparency entry: 1724872777
- Sigstore integration time: Jun 4, 2026
Source repository:
- Permalink: yuripinto/xseo@d335452f683296c574249dbca7dc2e470985e61c
- Branch / Tag: refs/heads/main
- Owner: https://github.com/yuripinto
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release-please.yml@d335452f683296c574249dbca7dc2e470985e61c
- Trigger Event: push

File details

Details for the file xseo-0.7.0-py3-none-any.whl.

File metadata

Download URL: xseo-0.7.0-py3-none-any.whl
Upload date: Jun 4, 2026
Size: 88.8 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for xseo-0.7.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`76017e798eca5df92582f15d2aa26344d005a7e63a21dbebc6e32c2d9116a89e`
MD5	`a6b5f69f8402c4ecc6c9ad71eda7bee1`
BLAKE2b-256	`e07ed5192efa70312b2ad0dbf2d58dbb0bdd02c2c94c4098e6bdcf5745e7e62a`

See more details on using hashes here.

Provenance

The following attestation bundles were made for xseo-0.7.0-py3-none-any.whl:

Publisher: release-please.yml on yuripinto/xseo

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: xseo-0.7.0-py3-none-any.whl
- Subject digest: 76017e798eca5df92582f15d2aa26344d005a7e63a21dbebc6e32c2d9116a89e
- Sigstore transparency entry: 1724872852
- Sigstore integration time: Jun 4, 2026
Source repository:
- Permalink: yuripinto/xseo@d335452f683296c574249dbca7dc2e470985e61c
- Branch / Tag: refs/heads/main
- Owner: https://github.com/yuripinto
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release-please.yml@d335452f683296c574249dbca7dc2e470985e61c
- Trigger Event: push

xseo 0.7.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

xseo

Features

Screenshots

Tech stack

Install

Download a ready-to-run build (no Python needed)

Install from PyPI (for Python users)

From source

Run

Command line

GitHub Action

Verify

Project layout

Contributing

About

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance