Skip to main content

scraping stuff

Project description

raggy

a Python library for scraping and document processing

installation

uv add raggy

add extras to load different document types:

uv add 'raggy[chroma]'     # ChromaDB support
uv add 'raggy[tpuf]'       # TurboPuffer support
uv add 'raggy[pdf]'        # PDF processing

read the docs

what is it?

a simple-to-use Python library for:

  • scraping the web to produce rich documents
  • putting these documents in vectorstores
  • querying the vectorstores to find documents similar to a query

[!TIP] See this example to chat with any website, or this example to chat with any GitHub repo.

license

this project is licensed under the Apache 2.0 License - see the LICENSE file for details.

contributing

I welcome contributions! See the contributing guide for details.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

raggy-0.3.5.tar.gz (820.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

raggy-0.3.5-py3-none-any.whl (37.5 kB view details)

Uploaded Python 3

File details

Details for the file raggy-0.3.5.tar.gz.

File metadata

  • Download URL: raggy-0.3.5.tar.gz
  • Upload date:
  • Size: 820.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for raggy-0.3.5.tar.gz
Algorithm Hash digest
SHA256 5bdcf23faa1ef95d5228b74ac658dff5d2fd4b0977cbd70b92a48baf3a6031a6
MD5 c9710995b7c59132eab2b79a68337a46
BLAKE2b-256 0a41d4e2be550728ee8e1a6c14c4b8ab61152d36b5e680436a15eb5562ed64af

See more details on using hashes here.

File details

Details for the file raggy-0.3.5-py3-none-any.whl.

File metadata

  • Download URL: raggy-0.3.5-py3-none-any.whl
  • Upload date:
  • Size: 37.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for raggy-0.3.5-py3-none-any.whl
Algorithm Hash digest
SHA256 21922d8263bfb1a2d1984c07ff96f6216d39e8b0ed92b3170d4270d983b12993
MD5 a6570584e10106ed1147f008b289146e
BLAKE2b-256 037e693c2a66b0256b6971f83ee6520129f20fc06e8600a857a46777c3e75a88

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page