Skip to main content

A CLI tool to parse sitemaps and extract URL metadata into a CSV format.

Project description

sitemapxml

sitemapxml is a powerful and fast command-line tool that extracts all URLs from a given XML sitemap, fetches each URL, and generates a comprehensive CSV report containing:

  • Extract all sitemap URLs
  • HTTP Status Code
  • Title Tag
  • Meta Description
  • Content Length
  • Canonical URL

Installation

Install via pip:

pip install sitemapxml

Usage

Simply run the CLI command and pass the URL of the sitemap:

sitemapxml https://example.com/sitemap.xml

This will automatically create a sitemap_report.csv file in your current directory containing all the extracted metrics. You can also specify an output file:

sitemapxml https://example.com/sitemap.xml -o my_report.csv

Author

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sitemapxml-0.1.1.tar.gz (3.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

sitemapxml-0.1.1-py3-none-any.whl (4.1 kB view details)

Uploaded Python 3

File details

Details for the file sitemapxml-0.1.1.tar.gz.

File metadata

  • Download URL: sitemapxml-0.1.1.tar.gz
  • Upload date:
  • Size: 3.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.12

File hashes

Hashes for sitemapxml-0.1.1.tar.gz
Algorithm Hash digest
SHA256 774498a7503e9ebfbc7588b149d649c56f6557c1bfd2396bf941b7f39c38547b
MD5 e12d094e26a85bfe146e2a415eb3af81
BLAKE2b-256 d107af9aacd9c795cd12222594d8c7055ddb661e2b3d33893f807929ce54232d

See more details on using hashes here.

File details

Details for the file sitemapxml-0.1.1-py3-none-any.whl.

File metadata

  • Download URL: sitemapxml-0.1.1-py3-none-any.whl
  • Upload date:
  • Size: 4.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.12

File hashes

Hashes for sitemapxml-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 9eb80efb7961aba5b7395629bba274e892b4128332267b471df7df311449f372
MD5 15ef012e17536d3961529877d6e218ab
BLAKE2b-256 ec3c18ce4092a1a833f05ed0a216b3057e74d65d4f1b92b9ef68a807124dc2ed

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page