Async Python 3.6+ web scraping micro-framework based on asyncio.
Project description
Ruia
🕸️ Async Python 3.6+ web scraping micro-framework based on asyncio.
⚡ Write less, run faster.
Overview
Ruia is an async web scraping micro-framework, written with asyncio and aiohttp,
aims to make crawling url as convenient as possible.
Write less, run faster:
- Documentation: 中文文档 |documentation
- Organization: python-ruia
- Plugin: awesome-ruia(Any contributions you make are greatly appreciated!)
Features
- Easy: Declarative programming
- Fast: Powered by asyncio
- Extensible: Middlewares and plugins
- Powerful: JavaScript support
Installation
# For Linux & Mac
pip install -U ruia[uvloop]
# For Windows
pip install -U ruia
# New features
pip install git+https://github.com/howie6879/ruia
Tutorials
- Overview
- Installation
- Define Data Items
- Spider Control
- Request & Response
- Customize Middleware
- Write a Plugins
TODO
- Cache for debug, to decreasing request limitation, ruia-cache
- Provide an easy way to debug the script, ruia-shell
- Distributed crawling/scraping
Contribution
Ruia is still under developing, feel free to open issues and pull requests:
- Report or fix bugs
- Require or publish plugins
- Write or fix documentation
- Add test cases
!!!Notice: We use black to format the code.
Thanks
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file ruia-0.8.5.tar.gz.
File metadata
- Download URL: ruia-0.8.5.tar.gz
- Upload date:
- Size: 25.4 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.9.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
4c94e40e48ba6829dfcc45852d6a99a1187f1c3309d8b6858c080b7844b14a72
|
|
| MD5 |
21ee60bb4fdbbf9014d26ff1e9944027
|
|
| BLAKE2b-256 |
de1bb3f516402d1e17f01487ae0dc19daa9c9f968beb21eb593b9903623bc5d3
|
File details
Details for the file ruia-0.8.5-py2.py3-none-any.whl.
File metadata
- Download URL: ruia-0.8.5-py2.py3-none-any.whl
- Upload date:
- Size: 30.3 kB
- Tags: Python 2, Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.9.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
7da42157fd2a1437a4cd8dd0be1bb24a2aae1e656bdfc132a8cf1b1b7e779b46
|
|
| MD5 |
7886b9eacb917c86828675f50decf41c
|
|
| BLAKE2b-256 |
12f516d254cf0a6cd1b088e56992b8a2e6f73e19f3b0b8d5024720ecf6789d4b
|