scraping stuff
Project description
raggy
A Python library for scraping and document processing.
Installation
pip install raggy
For additional features:
pip install raggy[scrapling] # Enhanced web scraping via Scrapling
pip install raggy[chroma] # ChromaDB support
pip install raggy[tpuf] # TurboPuffer support
pip install raggy[pdf] # PDF processing
Read the docs
What is it?
A Python library for:
- scraping the web to produce rich documents
- putting these documents in vectorstores
- querying the vectorstores to find documents similar to a query
[!TIP] See this example to chat with any website, or this example to chat with any GitHub repo.
License and Dependencies
[!IMPORTANT] This project is licensed under the MIT License - see the LICENSE file for details.
When installing the optional
[scrapling]
dependency, please note that Scrapling is licensed under the BSD-3-Clause license. By using this optional feature, you agree to comply with Scrapling's license terms.
Contributing
We welcome contributions! See our contributing guide for details.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file raggy-0.2.3.tar.gz
.
File metadata
- Download URL: raggy-0.2.3.tar.gz
- Upload date:
- Size: 672.8 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.12.7
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 0f1993be40397817df36db99f78a8e5d3e12060b46c2bf664f6001d9155ed197 |
|
MD5 | 14dd1e200773a7ff565fd3e8dcdc7a71 |
|
BLAKE2b-256 | d448ba0e7fa8129e3da07074caa3dda96a92eb06b8a25e513edd63487abfb228 |
File details
Details for the file raggy-0.2.3-py3-none-any.whl
.
File metadata
- Download URL: raggy-0.2.3-py3-none-any.whl
- Upload date:
- Size: 30.2 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.12.7
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 34e47a9d19d0c74df91a14fe0a639ec1381b9003559c2935079136494f142247 |
|
MD5 | 2555554c6554f2270123924ec7af6c95 |
|
BLAKE2b-256 | 99451e0c69d6f21491bdebbe6fe6ab75ef02b0b9a527d36779d955ba528fb4bc |