Skip to main content

scraping stuff

Project description

raggy

A Python library for scraping and document processing.

Installation

pip install raggy

For additional features:

pip install raggy[scrapling]  # Enhanced web scraping via Scrapling
pip install raggy[chroma]     # ChromaDB support
pip install raggy[tpuf]       # TurboPuffer support
pip install raggy[pdf]        # PDF processing

Read the docs

What is it?

A Python library for:

  • scraping the web to produce rich documents
  • putting these documents in vectorstores
  • querying the vectorstores to find documents similar to a query

[!TIP] See this example to chat with any website, or this example to chat with any GitHub repo.

License and Dependencies

[!IMPORTANT] This project is licensed under the MIT License - see the LICENSE file for details.

When installing the optional [scrapling] dependency, please note that Scrapling is licensed under the BSD-3-Clause license. By using this optional feature, you agree to comply with Scrapling's license terms.

Contributing

We welcome contributions! See our contributing guide for details.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

raggy-0.2.3.tar.gz (672.8 kB view details)

Uploaded Source

Built Distribution

raggy-0.2.3-py3-none-any.whl (30.2 kB view details)

Uploaded Python 3

File details

Details for the file raggy-0.2.3.tar.gz.

File metadata

  • Download URL: raggy-0.2.3.tar.gz
  • Upload date:
  • Size: 672.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.12.7

File hashes

Hashes for raggy-0.2.3.tar.gz
Algorithm Hash digest
SHA256 0f1993be40397817df36db99f78a8e5d3e12060b46c2bf664f6001d9155ed197
MD5 14dd1e200773a7ff565fd3e8dcdc7a71
BLAKE2b-256 d448ba0e7fa8129e3da07074caa3dda96a92eb06b8a25e513edd63487abfb228

See more details on using hashes here.

File details

Details for the file raggy-0.2.3-py3-none-any.whl.

File metadata

  • Download URL: raggy-0.2.3-py3-none-any.whl
  • Upload date:
  • Size: 30.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.12.7

File hashes

Hashes for raggy-0.2.3-py3-none-any.whl
Algorithm Hash digest
SHA256 34e47a9d19d0c74df91a14fe0a639ec1381b9003559c2935079136494f142247
MD5 2555554c6554f2270123924ec7af6c95
BLAKE2b-256 99451e0c69d6f21491bdebbe6fe6ab75ef02b0b9a527d36779d955ba528fb4bc

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page