Async Python 3.6+ web scraping micro-framework based on asyncio.
Project description
Ruia
🕸️ Async Python 3.6+ web scraping micro-framework based on asyncio.
⚡ Write less, run faster.
Overview
Ruia is an async web scraping micro-framework, written with asyncio
and aiohttp
,
aims to make crawling url as convenient as possible.
Write less, run faster:
- Documentation: 中文文档 |documentation
- Organization: python-ruia
- Plugin: awesome-ruia(Any contributions you make are greatly appreciated!)
Features
- Easy: Declarative programming
- Fast: Powered by asyncio
- Extensible: Middlewares and plugins
- Powerful: JavaScript support
Installation
# For Linux & Mac
pip install -U ruia[uvloop]
# For Windows
pip install -U ruia
# New features
pip install git+https://github.com/howie6879/ruia
Tutorials
- Overview
- Installation
- Define Data Items
- Spider Control
- Request & Response
- Customize Middleware
- Write a Plugins
TODO
- Cache for debug, to decreasing request limitation, ruia-cache
- Provide an easy way to debug the script, ruia-shell
- Distributed crawling/scraping
Contribution
Ruia is still under developing, feel free to open issues and pull requests:
- Report or fix bugs
- Require or publish plugins
- Write or fix documentation
- Add test cases
!!!Notice: We use black to format the code.
Thanks
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
ruia-0.8.5.tar.gz
(25.4 kB
view details)
Built Distribution
ruia-0.8.5-py2.py3-none-any.whl
(30.3 kB
view details)
File details
Details for the file ruia-0.8.5.tar.gz
.
File metadata
- Download URL: ruia-0.8.5.tar.gz
- Upload date:
- Size: 25.4 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.9.7
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 4c94e40e48ba6829dfcc45852d6a99a1187f1c3309d8b6858c080b7844b14a72 |
|
MD5 | 21ee60bb4fdbbf9014d26ff1e9944027 |
|
BLAKE2b-256 | de1bb3f516402d1e17f01487ae0dc19daa9c9f968beb21eb593b9903623bc5d3 |
File details
Details for the file ruia-0.8.5-py2.py3-none-any.whl
.
File metadata
- Download URL: ruia-0.8.5-py2.py3-none-any.whl
- Upload date:
- Size: 30.3 kB
- Tags: Python 2, Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.9.7
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 7da42157fd2a1437a4cd8dd0be1bb24a2aae1e656bdfc132a8cf1b1b7e779b46 |
|
MD5 | 7886b9eacb917c86828675f50decf41c |
|
BLAKE2b-256 | 12f516d254cf0a6cd1b088e56992b8a2e6f73e19f3b0b8d5024720ecf6789d4b |