Skip to main content

Powerful and flexible web scraping Search Engine

Project description

soupsavvy

Powerful and flexible web scraping Search Engine

PyPI Python Versions Codecov Docs link

Table of Contents

About

With many web scraping libraries available, each with unique interfaces and conventions, soupsavvy provides conistent and easy way of building selection workflows.

With soupsavvy, developers can focus on data extraction workflows instead of wrestling with library-specific quirks and inconsistencies. Eliminate complexity and introduce scalability and maintainability to your web scraping projects.

Key Features

soupsavvy introduces the concept of Selector, a declarative, consistent, and reusable search procedure that is based on following principles:

  • Decoupling: Selection logic is abstracted away from DOM node and traversal implementations.
  • Framework-Agnostic: Operates consistently with any supported library.
  • Flexibile & Extensibile: Lightweight, reusable components used to build complex selection workflows.

Installation

soupsavvy is published on PyPi and can be installed via pip:

pip install soupsavvy

Documentation

Full documentation can be found at documentation.

Demos

For more information about the package, its concepts and usage, read Demos section of the documentation. It's step by step guide to the most important features of the package.

Contributing

If you'd like to contribute to soupsavvy, feel free to check out the GitHub repository and submit pull requests into one of development branches. Any feedback, bug reports, or feature requests are welcome! In case of any doubts, follow Contribution Guidelines

License

soupsavvy is licensed under MIT License, allowing for both personal and commercial use. See the LICENSE file for more information.


Happy scraping!

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

soupsavvy-1.1.0.tar.gz (64.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

soupsavvy-1.1.0-py3-none-any.whl (87.4 kB view details)

Uploaded Python 3

File details

Details for the file soupsavvy-1.1.0.tar.gz.

File metadata

  • Download URL: soupsavvy-1.1.0.tar.gz
  • Upload date:
  • Size: 64.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for soupsavvy-1.1.0.tar.gz
Algorithm Hash digest
SHA256 b047d00d3fccea9fd886aaca97d5b9eca3f95330acd2cf379c8c23e229a98cd6
MD5 1dd145b7448178cae3c09f968bfc2bd9
BLAKE2b-256 a40a707b40ce1545ba3ac959a512779227634dfef6d020a33624e4aeec8f0795

See more details on using hashes here.

Provenance

The following attestation bundles were made for soupsavvy-1.1.0.tar.gz:

Publisher: production_release.yml on sewcio543/soupsavvy

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file soupsavvy-1.1.0-py3-none-any.whl.

File metadata

  • Download URL: soupsavvy-1.1.0-py3-none-any.whl
  • Upload date:
  • Size: 87.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for soupsavvy-1.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 04b4fb249168d4997bdc0885e051e68d0ca29f1f21a124b6f44c5163ecd585cc
MD5 892f87265edc7d7b1f32652b8f717ff2
BLAKE2b-256 7d8c93aadb219461169ad946ff992006bda692026ac3a28cb4b1cfa4abffc710

See more details on using hashes here.

Provenance

The following attestation bundles were made for soupsavvy-1.1.0-py3-none-any.whl:

Publisher: production_release.yml on sewcio543/soupsavvy

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page