A command-line tool to generate sitemaps for a given URL.
Project description
Guidewd Sitemap Generator
Guidewd Sitemap Generator is a command-line tool designed to generate XML sitemaps for SEO optimization.
It crawls websites, extracts URLs, and generates a sitemap that can be submitted to search engines like Google.
Features
- Crawl a website to generate a sitemap
- Respect
robots.txtrules (optional) - Limit the number of pages to crawl
- Lightweight and easy to use from the command line
- Generates a standard XML sitemap format for SEO purposes
Installation
You can install the package using pip:
pip install guidewd-sitemap-generator
Usage
After installation, you can run the guidewd-sitemap-generator command directly from your terminal.
- Command-Line Interface:
guidewd-sitemap-generator <starting-url> --max-pages <max-pages> --respect-robots
<starting-url>: The URL to start crawling from (e.g.,https://example.com).--max-pages <max-pages>: Optional. Maximum number of pages to crawl (default: 200).--respect-robots: Optional. If specified, the tool will respect the site'srobots.txtrules.
Example
guidewd-sitemap-generator https://example.com --max-pages 100 --respect-robots
This will crawl the website starting from https://example.com, limit the crawl to 100 pages, and respect the robots.txt file.
Input
- A valid website URL to start the crawl.
Output
- An XML sitemap file (default:
sitemap.xml) that contains a list of crawled URLs.
License
This project is licensed under the MIT License - see the LICENSE file for details.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file guidewd-sitemap-generator-0.1.1.tar.gz.
File metadata
- Download URL: guidewd-sitemap-generator-0.1.1.tar.gz
- Upload date:
- Size: 16.1 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.11.9
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
4dd7147f48534ef4b95f994ac4d30842a8f1fef531758487b556c0c342aaf616
|
|
| MD5 |
e578ca921ac4545a1e8036e639ceb909
|
|
| BLAKE2b-256 |
b76765590d67c3920ce866d2b0474ac4b632f17163729c55407dca865afa712f
|
File details
Details for the file guidewd_sitemap_generator-0.1.1-py3-none-any.whl.
File metadata
- Download URL: guidewd_sitemap_generator-0.1.1-py3-none-any.whl
- Upload date:
- Size: 16.8 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.11.9
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
93528eec63bbe8f24b0ed5917382809a924f22eac3ea5aa5f133c71727b5fb2d
|
|
| MD5 |
b8b86cb57bc88da1b787c2e8019e107d
|
|
| BLAKE2b-256 |
3caeb59f62cc8e80cdbc6b815e31cc0c12c723f4a52974a7eb72197b785cb7cd
|