Skip to main content

A CLI tool to parse sitemaps and extract URL metadata into a CSV format.

Project description

sitemapxml

sitemapxml is a powerful and fast command-line tool that extracts all URLs from a given XML sitemap, fetches each URL, and generates a comprehensive CSV report containing:

  • Extract all sitemap URLs
  • HTTP Status Code
  • Title Tag
  • Meta Description
  • Content Length
  • Canonical URL

Installation

Install via pip:

pip install sitemapxml

Usage

Simply run the CLI command and pass the URL of the sitemap:

sitemapxml https://example.com/sitemap.xml

This will automatically create a sitemap_report.csv file in your current directory containing all the extracted metrics. You can also specify an output file:

sitemapxml https://example.com/sitemap.xml -o my_report.csv

Author

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sitemapxml-0.1.0.tar.gz (3.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

sitemapxml-0.1.0-py3-none-any.whl (3.9 kB view details)

Uploaded Python 3

File details

Details for the file sitemapxml-0.1.0.tar.gz.

File metadata

  • Download URL: sitemapxml-0.1.0.tar.gz
  • Upload date:
  • Size: 3.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.12

File hashes

Hashes for sitemapxml-0.1.0.tar.gz
Algorithm Hash digest
SHA256 1bcfef93b8d6eac52c99ac238069165f87a4913f1f7f962ffc13c02ab281c03c
MD5 1a676c8c76736e6e32aba19f3a3a61ee
BLAKE2b-256 e3ca4840a8ce73031441a7aea502021607570b781c3de37100acd04ca7bf195f

See more details on using hashes here.

File details

Details for the file sitemapxml-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: sitemapxml-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 3.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.12

File hashes

Hashes for sitemapxml-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 9f09d3fc2278f3a9d48aa374aa2017198016a3b64e974dbf7b06478c1309599b
MD5 24834cb31c8fc47ba5feeab6e68ae784
BLAKE2b-256 6c420b7a560b44533256f36fc1d163a2e79acfdac132997b233cff88e526ce9e

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page