Skip to main content

Powerful and flexible search engine for BeautifulSoup

Project description

soupsavvy

Powerful and flexible web scraping Search Engine

PyPI Python Versions Codecov Docs link

Table of Contents

About

With many web scraping libraries available, each with unique interfaces and conventions, soupsavvy provides conistent and easy way of building selection workflows.

With soupsavvy, developers can focus on data extraction workflows instead of wrestling with library-specific quirks and inconsistencies. Eliminate complexity and introduce scalability and maintainability to your web scraping projects.

Key Features

soupsavvy introduces the concept of Selector, a declarative, consistent, and reusable search procedure that is based on following principles:

  • Decoupling: Selection logic is abstracted away from DOM node and traversal implementations.
  • Framework-Agnostic: Operates consistently with any supported library.
  • Flexibile & Extensibile: Lightweight, reusable components used to build complex selection workflows.

Installation

soupsavvy is published on PyPi and can be installed via pip:

pip install soupsavvy

Documentation

Full documentation can be found at documentation.

Demos

For more information about the package, its concepts and usage, read Demos section of the documentation. It's step by step guide to the most important features of the package.

Contributing

If you'd like to contribute to soupsavvy, feel free to check out the GitHub repository and submit pull requests into one of development branches. Any feedback, bug reports, or feature requests are welcome! In case of any doubts, follow Contribution Guidelines

License

soupsavvy is licensed under MIT License, allowing for both personal and commercial use. See the LICENSE file for more information.


Happy scraping!

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

soupsavvy-1.0.0.tar.gz (56.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

soupsavvy-1.0.0-py3-none-any.whl (76.2 kB view details)

Uploaded Python 3

File details

Details for the file soupsavvy-1.0.0.tar.gz.

File metadata

  • Download URL: soupsavvy-1.0.0.tar.gz
  • Upload date:
  • Size: 56.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.0.1 CPython/3.12.8

File hashes

Hashes for soupsavvy-1.0.0.tar.gz
Algorithm Hash digest
SHA256 b9fa22abcdd8f8964ddf49acb51c72dda62795d51d9e429a5551b5abaf697c67
MD5 ee6eb8a64008739adf1afbf1daa77161
BLAKE2b-256 b2e84364dc8bf9c7f98c83358ef0600c239a3d4b380e77330e83581e8c0591c0

See more details on using hashes here.

Provenance

The following attestation bundles were made for soupsavvy-1.0.0.tar.gz:

Publisher: production_release.yml on sewcio543/soupsavvy

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file soupsavvy-1.0.0-py3-none-any.whl.

File metadata

  • Download URL: soupsavvy-1.0.0-py3-none-any.whl
  • Upload date:
  • Size: 76.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.0.1 CPython/3.12.8

File hashes

Hashes for soupsavvy-1.0.0-py3-none-any.whl
Algorithm Hash digest
SHA256 bd94e0a60545df69adf4d7275406c0b5d943664c322d2d994f1c60878b974f76
MD5 7d9f47a7f82b0ca22f64c1617ec9b7db
BLAKE2b-256 4ea1658f0aae9009cb51587a80f003e6f3cfd5f61c1caf000c86b6314f20e77a

See more details on using hashes here.

Provenance

The following attestation bundles were made for soupsavvy-1.0.0-py3-none-any.whl:

Publisher: production_release.yml on sewcio543/soupsavvy

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page