Skip to main content

A command-line tool to generate sitemaps for a given URL.

Project description

Guidewd Sitemap Generator

Guidewd Sitemap Generator is a command-line tool designed to generate XML sitemaps for SEO optimization.
It crawls websites, extracts URLs, and generates a sitemap that can be submitted to search engines like Google.


Features

  • Crawl a website to generate a sitemap
  • Respect robots.txt rules (optional)
  • Limit the number of pages to crawl
  • Lightweight and easy to use from the command line
  • Generates a standard XML sitemap format for SEO purposes

Installation

You can install the package using pip:

pip install guidewd-sitemap-generator

Usage

After installation, you can run the guidewd-sitemap-generator command directly from your terminal.

  • Command-Line Interface:
guidewd-sitemap-generator <starting-url> --max-pages <max-pages> --respect-robots
  • <starting-url>: The URL to start crawling from (e.g., https://example.com).
  • --max-pages <max-pages>: Optional. Maximum number of pages to crawl (default: 200).
  • --respect-robots: Optional. If specified, the tool will respect the site's robots.txt rules.

Example

guidewd-sitemap-generator https://example.com --max-pages 100 --respect-robots

This will crawl the website starting from https://example.com, limit the crawl to 100 pages, and respect the robots.txt file.


Input

  • A valid website URL to start the crawl.

Output

  • An XML sitemap file (default: sitemap.xml) that contains a list of crawled URLs.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

guidewd-sitemap-generator-0.1.1.tar.gz (16.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

guidewd_sitemap_generator-0.1.1-py3-none-any.whl (16.8 kB view details)

Uploaded Python 3

File details

Details for the file guidewd-sitemap-generator-0.1.1.tar.gz.

File metadata

File hashes

Hashes for guidewd-sitemap-generator-0.1.1.tar.gz
Algorithm Hash digest
SHA256 4dd7147f48534ef4b95f994ac4d30842a8f1fef531758487b556c0c342aaf616
MD5 e578ca921ac4545a1e8036e639ceb909
BLAKE2b-256 b76765590d67c3920ce866d2b0474ac4b632f17163729c55407dca865afa712f

See more details on using hashes here.

File details

Details for the file guidewd_sitemap_generator-0.1.1-py3-none-any.whl.

File metadata

File hashes

Hashes for guidewd_sitemap_generator-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 93528eec63bbe8f24b0ed5917382809a924f22eac3ea5aa5f133c71727b5fb2d
MD5 b8b86cb57bc88da1b787c2e8019e107d
BLAKE2b-256 3caeb59f62cc8e80cdbc6b815e31cc0c12c723f4a52974a7eb72197b785cb7cd

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page