Agentic Web Scraper
Project description
Agentic web scraping framework with a built-in chat CLI.
What is Scrapurrr?
Scrapurrr is a Python framework for building agentic web scraping and automation apps, with a ready-to-use interactive CLI called chatpurrr.
As a framework, you define a Pydantic schema, point it at a URL, and get back typed data. It handles browser rendering, anti-detection, pagination, and LLM-powered extraction automatically.
As a CLI tool, you run chatpurrr and talk to it in natural language. Navigate pages, inspect elements, extract data, all from your terminal.
Built on PatchRight and CloakBrowser for undetected browsing. Supports 100+ LLM providers via LiteLLM.
Install
pip install scrapurrr
Get Started
Use the chat CLI:
chatpurrr
Use the framework:
from scrapurrr import Scrapurrr
Documentation
- Chatpurrr - Interactive CLI usage, slash commands, setup
- Framework - Library API, extraction, agent mode, element inspection, configuration
Usage Policy
Scrapurrr is intended for legitimate use cases: data collection from public sources, authorized testing, research, and personal automation. Users are responsible for complying with the terms of service of any website they interact with. Respect robots.txt, rate limits, and applicable laws.
License
MIT. See LICENSE.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file scrapurrr-0.5.3.tar.gz.
File metadata
- Download URL: scrapurrr-0.5.3.tar.gz
- Upload date:
- Size: 281.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
a22be556feb8ea837c2d92adff626742ac46e1332b0a0893cc0817ec4929052d
|
|
| MD5 |
6cc85c62800436a7e7ba75fe97f65956
|
|
| BLAKE2b-256 |
69b3dcf30711e86814bc207ffe641c180f801066eee6cc2748293c7d8c78f0da
|
File details
Details for the file scrapurrr-0.5.3-py3-none-any.whl.
File metadata
- Download URL: scrapurrr-0.5.3-py3-none-any.whl
- Upload date:
- Size: 97.1 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
29667956b97eab332dcd747f438c0e1a1f35d89df9ae3f695503ea921b1c2067
|
|
| MD5 |
d31d0c36ccedeb56e01127b9c586b19e
|
|
| BLAKE2b-256 |
8f149317fa3eb3fb676adf7b1317672aeee832bf51e540cac0aa4858735ba1d1
|