Add your description here
Project description
bfg-crawl
A simple web crawler that fetches pages and stores them in SQLite.
Features
- Multi-threaded crawling
- Rate limiting support
- SQLite database storage
Install
pip install bfg-crawl
Usage
from crawl import Crawler, RateLimitingLoader
loader = ALoaderForThisWebsite()
crawler = Crawler("pages.db", RateLimitingLoader(loader), concurrency=5)
crawler.run()
The crawler reads URLs from a SQLite database and saves the page content back to the database.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
bfg_crawl-0.1.2.tar.gz
(2.7 kB
view details)
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file bfg_crawl-0.1.2.tar.gz.
File metadata
- Download URL: bfg_crawl-0.1.2.tar.gz
- Upload date:
- Size: 2.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.8.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
fc926d571be8f2ca33a78d423fb41a3d321eab7380baa2f1b7638f4ed9f00a5a
|
|
| MD5 |
ae65c8a7a80c8575d8ffaf30f12d5389
|
|
| BLAKE2b-256 |
27cf3534220f621c9f31bccf4927192f337def95860eb49c5f60457bbf40ba36
|
File details
Details for the file bfg_crawl-0.1.2-py3-none-any.whl.
File metadata
- Download URL: bfg_crawl-0.1.2-py3-none-any.whl
- Upload date:
- Size: 4.3 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.8.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
df287b4ba6a75479cc1d1785ce2424118973512d4325da74459e1ca2875d59c9
|
|
| MD5 |
0ceb440435cc8bb43f00a05dea956d56
|
|
| BLAKE2b-256 |
0d2c088e4beb3b5c45e760febe4c6dfeb0df103514d0710e031149ab0816c819
|