Skip to main content

Add your description here

Project description

worai

Command-line toolkit for WordLift operations and SEO checks. Pronunciation: "waw-RYE"

Docs: https://docs.wordlift.io/worai/

Install

  • pipx install worai
  • pip install worai

Full docs: https://docs.wordlift.io/worai/

If you plan to run seocheck, install Playwright browsers:

  • playwright install chromium

Quick Start

  • worai --help
  • worai seocheck https://example.com/sitemap.xml
  • worai google-search-console --site sc-domain:example.com --client-secrets ./client_secrets.json
  • worai <command> --help

Configuration

Config file (TOML) discovery order:

  • --config
  • WORAI_CONFIG
  • ./worai.toml
  • ~/.config/worai/config.toml
  • ~/.worai.toml

Profiles:

  • [profile.<name>] with --profile or WORAI_PROFILE

Common keys:

  • wordlift.api_key
  • gsc.client_secrets
  • gsc.token

Supported environment variables:

  • WORAI_CONFIG — path to a config TOML file (overrides discovery order).
  • WORAI_PROFILE — profile name under [profile.<name>].
  • WORAI_LOG_LEVEL — default log level (debug|info|warning|error).
  • WORAI_LOG_FORMAT — default log format (text|json).
  • WORDLIFT_KEY — WordLift API key for entity operations.
  • WORDLIFT_API_KEY — alternate WordLift API key name (also accepted by some commands).
  • GSC_CLIENT_SECRETS — path to OAuth client secrets JSON for GSC.
  • GSC_TOKEN — path to store the OAuth token.
  • GSC_OUTPUT — default output CSV path for GSC export.

Example environment setup:

export WORDLIFT_KEY="wl_..."
export WORAI_CONFIG="~/worai.toml"
export WORAI_PROFILE="dev"
export GSC_CLIENT_SECRETS="~/client_secrets.json"

Example worai.toml:

[defaults]
log_level = "info"

[wordlift]
api_key = "wl_..."

[gsc]
client_secrets = "/path/to/client_secrets.json"
token = "/path/to/gsc_token.json"

Commands

Full docs: https://docs.wordlift.io/worai/

  • seocheck — run SEO checks against sitemap URLs.
  • google-search-console — export GSC page metrics to CSV.
  • dedupe — deduplicate WordLift entities by schema:url.
  • canonicalize-duplicate-pages — choose canonical URLs using GSC KPIs.
  • delete-entities-from-csv — delete entities listed in a CSV.
  • find-faq-page-wrong-type — find/patch FAQPage type issues.
  • find-missing-names — list pages missing schema:name/headline.
  • find-url-by-type — extract schema:url by type from RDF.
  • link-groups — build/apply LinkGroup data from CSV.
  • patch — patch entities from RDF.
  • structured-data — generate JSON-LD/YARRRML mappings or materialize RDF from YARRRML.
  • validate — validate RDF against SHACL shapes.
  • upload-entities-from-turtle — upload .ttl files with resume.

Command help:

  • worai <command> --help

Autocompletion:

  • worai --install-completion
  • worai --show-completion

Examples

seocheck

  • worai seocheck https://example.com/sitemap.xml
  • worai seocheck https://example.com/sitemap.xml --output-dir ./seocheck-report --save-html
  • worai seocheck https://example.com/sitemap.xml --output-dir ./seocheck-report --no-open-report
  • worai seocheck https://example.com/sitemap.xml --user-agent "Mozilla/5.0 ..."
  • worai seocheck https://example.com/sitemap.xml --sitemap-fetch-mode browser
  • worai seocheck https://example.com/sitemap.xml --no-report-ui
  • worai seocheck https://example.com/sitemap.xml --recheck-failed --recheck-from ./seocheck-report

google-search-console

  • worai google-search-console --site sc-domain:example.com --client-secrets ./client_secrets.json

canonicalize-duplicate-pages

  • worai canonicalize-duplicate-pages --input gsc_pages.csv --output canonical_targets.csv --kpi-window 28d --kpi-metric clicks
  • worai canonicalize-duplicate-pages --input gsc_pages.csv --entity-type Product

dedupe

  • worai dedupe --dry-run

find-faq-page-wrong-type

  • worai find-faq-page-wrong-type ./data.ttl --dry-run --replace-type
  • worai find-faq-page-wrong-type ./data.ttl --patch --replace-type

find-missing-names

  • worai find-missing-names ./data.ttl

find-url-by-type

  • worai find-url-by-type ./data.ttl schema:Service schema:Product

link-groups

  • worai link-groups ./links.csv --format turtle
  • worai link-groups ./links.csv --apply --dry-run --concurrency 4

patch

  • worai patch ./data.ttl --dry-run --add-types

structured-data

  • worai structured-data create https://example.com/article Review --output-dir ./structured-data
  • worai structured-data create https://example.com/article --type Review --output-dir ./structured-data
  • worai structured-data create https://example.com/article --type Review --debug
  • worai structured-data create https://example.com/article --type Review --max-xhtml-chars 40000 --max-nesting-depth 2
  • worai structured-data generate https://example.com/sitemap.xml --yarrrml ./mapping.yarrrml --output-dir ./out
  • worai structured-data generate https://example.com/page --yarrrml ./mapping.yarrrml --format jsonld

validate

  • worai validate --shape review-snippet --shape schema-review ./data.jsonld
  • worai validate --format raw https://api.wordlift.io/data/example.jsonld

upload-entities-from-turtle

  • worai upload-entities-from-turtle ./entities --recursive --limit 50

Troubleshooting

  • Playwright missing browsers:
    • playwright install chromium
  • YARRRML conversion:
    • npm install -g @rmlio/yarrrml-parser
  • RML execution:
    • morph-kgc is included in project dependencies
  • OAuth token issues:
    • Remove the token file and re-run worai google-search-console.

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

worai-1.6.0.tar.gz (126.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

worai-1.6.0-py3-none-any.whl (163.5 kB view details)

Uploaded Python 3

File details

Details for the file worai-1.6.0.tar.gz.

File metadata

  • Download URL: worai-1.6.0.tar.gz
  • Upload date:
  • Size: 126.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for worai-1.6.0.tar.gz
Algorithm Hash digest
SHA256 778af49a60cc97cceb068f9e9c8bf6671de9d6f4bcad70be927df6432f6863c3
MD5 59e058c84eaf837ca45a349461a73172
BLAKE2b-256 4bf9c268f98e9dc76dc74b9bba11884c10a49658853b3459f234f3f142d423cb

See more details on using hashes here.

Provenance

The following attestation bundles were made for worai-1.6.0.tar.gz:

Publisher: publish.yml on wordlift/worai

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file worai-1.6.0-py3-none-any.whl.

File metadata

  • Download URL: worai-1.6.0-py3-none-any.whl
  • Upload date:
  • Size: 163.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for worai-1.6.0-py3-none-any.whl
Algorithm Hash digest
SHA256 98a124b2a4c499556ab036b6cfc61ee4dc881fee18791e38bbedfa7da273e641
MD5 20236df184c3df7523924591b552cc0d
BLAKE2b-256 36bfc70ef776fbc4483fd03d6e6b6de7a5d1ef22f23d3c4954b57b51cea5ce3c

See more details on using hashes here.

Provenance

The following attestation bundles were made for worai-1.6.0-py3-none-any.whl:

Publisher: publish.yml on wordlift/worai

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page