An end to end RAG solution for web content.
Project description
ragscrape
scraper
The scraper is written assuming you are running a python virtual environment.
To create the virtual environment:
python3 -m venv .venv
source .venv/bin/activate
To install the requirements:
pip install -r requirements.txt
To run the scraper:
python3 app.py
The scraper right now shows a few different basic scraping techniques and the website provides a nice way to compare them.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file saldor-0.0.1.tar.gz.
File metadata
- Download URL: saldor-0.0.1.tar.gz
- Upload date:
- Size: 5.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.8.3 CPython/3.9.9 Darwin/23.5.0
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
8bb417f4175537428323a1b3681dac6024e00ae62b93126e023dd1fc0a2e48b5
|
|
| MD5 |
b9b2143d77632a128e59d2e5560f9b61
|
|
| BLAKE2b-256 |
f2e9efc4214c1915a44b25ebc137f32d24c42dce23d1d5cae21fadcd38332926
|
File details
Details for the file saldor-0.0.1-py3-none-any.whl.
File metadata
- Download URL: saldor-0.0.1-py3-none-any.whl
- Upload date:
- Size: 6.2 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.8.3 CPython/3.9.9 Darwin/23.5.0
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
622e260030cdcd9aaff9a61a0b00242d79fd51419574552fcc52731b61efcece
|
|
| MD5 |
77fa770a479bdd9dcd8acf4f4d2537eb
|
|
| BLAKE2b-256 |
65b8a904af59197cfcbd9dd61bd1125af7e71a5c120fe1978230cb48d10656b9
|