Fetch a given sitemap and retrieve all URLs in it.
Project description
fetch-sitemap
Fetch a given sitemap and retrieve all URLs in it.
usage: fetch-sitemap
[-h]
[--basic-auth BASIC_AUTH]
[-l LIMIT]
[-c CONCURRENCY_LIMIT]
[-t REQUEST_TIMEOUT]
[--report-path REPORT_PATH]
sitemap_url
Fetch a given sitemap and retrieve all URLs in it.
positional arguments:
sitemap_url URL of the sitemap to fetch
options:
-h, --help show this help message and exit
--basic-auth BASIC_AUTH
Basic auth information. Use: 'username:password'.
-l LIMIT, --limit LIMIT
Max number of URLs to fetch from the given sitemap.xml. Default: All
-c CONCURRENCY_LIMIT, --concurrency-limit CONCURRENCY_LIMIT
Max number of concurrent requests. Default: 10
-t REQUEST_TIMEOUT, --request-timeout REQUEST_TIMEOUT
Timeout for fetching a URL. Default: 30
--random Append a random string like ?12334232343 to each URL to bypass frontend cache. Default: False
--report-path REPORT_PATH
Store results in a CSV file. Example: ./report.csv
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
fetch-sitemap-2.tar.gz
(5.3 kB
view hashes)
Built Distribution
Close
Hashes for fetch_sitemap-2-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | dd995ce163c2ef2395b96f56f3b6de4ccd3714fbc3a81c258fb6a22e5b3483ab |
|
MD5 | da2429b4cfe20f4f68b95a5cb8ccc8ac |
|
BLAKE2b-256 | fe9997d83539d80b964604fbcf0a1db2bc6dd6d98b5ab7385d93f6f5d45d6d4a |