Skip to main content

A performant library for parsing and crawling sitemaps

Project description

PyPI - Python Version PyPI - Version Conda Version Pepy Total Downloads

Ultimate Sitemap Parser (USP) is a performant and robust Python library for parsing and crawling sitemaps.

Features

Installation

pip install ultimate-sitemap-parser

or using Anaconda:

conda install -c conda-forge ultimate-sitemap-parser

Usage

from usp.tree import sitemap_tree_for_homepage

tree = sitemap_tree_for_homepage('https://www.example.org/')

for page in tree.all_pages():
    print(page.url)

sitemap_tree_for_homepage() will return a tree of AbstractSitemap subclass objects that represent the sitemap hierarchy found on the website; see a reference of AbstractSitemap subclasses. AbstractSitemap.all_pages() returns a generator to efficiently iterate over pages without loading the entire tree into memory.

For more examples and details, see the documentation.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ultimate_sitemap_parser-1.8.1.tar.gz (39.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

ultimate_sitemap_parser-1.8.1-py3-none-any.whl (44.6 kB view details)

Uploaded Python 3

File details

Details for the file ultimate_sitemap_parser-1.8.1.tar.gz.

File metadata

  • Download URL: ultimate_sitemap_parser-1.8.1.tar.gz
  • Upload date:
  • Size: 39.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for ultimate_sitemap_parser-1.8.1.tar.gz
Algorithm Hash digest
SHA256 6cf5ae0cfd83a2af5650402fc0ec462c83ac7558ef781fba9c012d1db123344f
MD5 f86809572fae152c1a3c7551860711b6
BLAKE2b-256 c3954b4bfd9bc572fdfd5c0ff70c85cd1b220181aaa6eb7cc70a0d2255444c10

See more details on using hashes here.

Provenance

The following attestation bundles were made for ultimate_sitemap_parser-1.8.1.tar.gz:

Publisher: publish.yml on GateNLP/ultimate-sitemap-parser

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file ultimate_sitemap_parser-1.8.1-py3-none-any.whl.

File metadata

File hashes

Hashes for ultimate_sitemap_parser-1.8.1-py3-none-any.whl
Algorithm Hash digest
SHA256 7fe9347983ca9ca1f8960e7d5bf08cb49679b20d32532a9c7843e20fe31c0840
MD5 1cbb58c559e497e9d334a991b52d92ec
BLAKE2b-256 c7bca34d6033f30bef4cdefe8597106ea375479a2006631042ce28e4f2133522

See more details on using hashes here.

Provenance

The following attestation bundles were made for ultimate_sitemap_parser-1.8.1-py3-none-any.whl:

Publisher: publish.yml on GateNLP/ultimate-sitemap-parser

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page