Skip to main content

An end to end RAG solution for web content.

Project description

ragscrape

scraper

The scraper is written assuming you are running a python virtual environment.

To create the virtual environment:

python3 -m venv .venv
source .venv/bin/activate

To install the requirements:

pip install -r requirements.txt

To run the scraper:

python3 app.py

The scraper right now shows a few different basic scraping techniques and the website provides a nice way to compare them.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

saldor-0.0.1.tar.gz (5.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

saldor-0.0.1-py3-none-any.whl (6.2 kB view details)

Uploaded Python 3

File details

Details for the file saldor-0.0.1.tar.gz.

File metadata

  • Download URL: saldor-0.0.1.tar.gz
  • Upload date:
  • Size: 5.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.8.3 CPython/3.9.9 Darwin/23.5.0

File hashes

Hashes for saldor-0.0.1.tar.gz
Algorithm Hash digest
SHA256 8bb417f4175537428323a1b3681dac6024e00ae62b93126e023dd1fc0a2e48b5
MD5 b9b2143d77632a128e59d2e5560f9b61
BLAKE2b-256 f2e9efc4214c1915a44b25ebc137f32d24c42dce23d1d5cae21fadcd38332926

See more details on using hashes here.

File details

Details for the file saldor-0.0.1-py3-none-any.whl.

File metadata

  • Download URL: saldor-0.0.1-py3-none-any.whl
  • Upload date:
  • Size: 6.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.8.3 CPython/3.9.9 Darwin/23.5.0

File hashes

Hashes for saldor-0.0.1-py3-none-any.whl
Algorithm Hash digest
SHA256 622e260030cdcd9aaff9a61a0b00242d79fd51419574552fcc52731b61efcece
MD5 77fa770a479bdd9dcd8acf4f4d2537eb
BLAKE2b-256 65b8a904af59197cfcbd9dd61bd1125af7e71a5c120fe1978230cb48d10656b9

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page