Skip to main content

A performant library for parsing and crawling sitemaps

Project description

PyPI - Python Version PyPI - Version Conda Version Pepy Total Downloads

Ultimate Sitemap Parser (USP) is a performant and robust Python library for parsing and crawling sitemaps.

Features

Installation

pip install ultimate-sitemap-parser

or using Anaconda:

conda install -c conda-forge ultimate-sitemap-parser

Usage

from usp.tree import sitemap_tree_for_homepage

tree = sitemap_tree_for_homepage('https://www.example.org/')

for page in tree.all_pages():
    print(page.url)

sitemap_tree_for_homepage() will return a tree of AbstractSitemap subclass objects that represent the sitemap hierarchy found on the website; see a reference of AbstractSitemap subclasses. AbstractSitemap.all_pages() returns a generator to efficiently iterate over pages without loading the entire tree into memory.

For more examples and details, see the documentation.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ultimate_sitemap_parser-1.8.0.tar.gz (39.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

ultimate_sitemap_parser-1.8.0-py3-none-any.whl (44.2 kB view details)

Uploaded Python 3

File details

Details for the file ultimate_sitemap_parser-1.8.0.tar.gz.

File metadata

  • Download URL: ultimate_sitemap_parser-1.8.0.tar.gz
  • Upload date:
  • Size: 39.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for ultimate_sitemap_parser-1.8.0.tar.gz
Algorithm Hash digest
SHA256 b89e173a7a30ae8d3fbf5c51e8b72985f2f1240e338064a315f35652c89442bc
MD5 1f4a209176912c392f94930f2bf288e6
BLAKE2b-256 8e87b4767b5181b0a6b7aafa5021b6b37b3eea104ba8ad80d3b527e8f423c85c

See more details on using hashes here.

Provenance

The following attestation bundles were made for ultimate_sitemap_parser-1.8.0.tar.gz:

Publisher: publish.yml on GateNLP/ultimate-sitemap-parser

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file ultimate_sitemap_parser-1.8.0-py3-none-any.whl.

File metadata

File hashes

Hashes for ultimate_sitemap_parser-1.8.0-py3-none-any.whl
Algorithm Hash digest
SHA256 438dcfe8aa8efc4e587b567a4abc8b1d2486c52b5ca52d5c81520f08e0434449
MD5 f640936fdaa288676300527aea33c1f6
BLAKE2b-256 43966d3eee0013dfebd45b2e3650d4db96c27471f30115dbfcb9e0a002406c1a

See more details on using hashes here.

Provenance

The following attestation bundles were made for ultimate_sitemap_parser-1.8.0-py3-none-any.whl:

Publisher: publish.yml on GateNLP/ultimate-sitemap-parser

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page