Add your description here
Project description
bfg-crawl
A simple web crawler that fetches pages and stores them in SQLite.
Features
- Multi-threaded crawling
- Rate limiting support
- SQLite database storage
Install
pip install bfg-crawl
Usage
from crawl import Crawler, RateLimitingLoader
loader = ALoaderForThisWebsite()
crawler = Crawler("pages.db", RateLimitingLoader(loader), concurrency=5)
crawler.run()
The crawler reads URLs from a SQLite database and saves the page content back to the database.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
bfg_crawl-0.1.0.tar.gz
(2.7 kB
view details)
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file bfg_crawl-0.1.0.tar.gz.
File metadata
- Download URL: bfg_crawl-0.1.0.tar.gz
- Upload date:
- Size: 2.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.8.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
7c418b18f6941a4651cc7235938db0cb38428fc1a9187c75d33f5c7af41c0956
|
|
| MD5 |
287d24494085dde2f1a8428391f13141
|
|
| BLAKE2b-256 |
1427c1b2cbc9878a7afb62ed805a25affc0d2713b3203f43af55c5e7d3c2e5a3
|
File details
Details for the file bfg_crawl-0.1.0-py3-none-any.whl.
File metadata
- Download URL: bfg_crawl-0.1.0-py3-none-any.whl
- Upload date:
- Size: 4.3 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.8.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
a4eec45bbf80aa9e83e9f5a219ff8d96cd6a1d5686f1bcbc3ea543ff184f3290
|
|
| MD5 |
8ef033835071c2d7fbd65977b6fd2ca5
|
|
| BLAKE2b-256 |
df5432a888d25487abc9a80671ee3ad4ecb57db8b69cc19b62ef582d47300e4f
|