A light and fast python Web crawler framework based on asyncio.
Project description
Distributed🌍 - Asynchronous🏃 - Light☁️ - Fast⚡️ - Easy😄
AirSpider🕷️, a Light and Fast Python Web Crawler Framework Based on Redis🕷️
Overview👀
- AirSpider is a high-performance asynchronous crawler framework for developers 🚀
- Based on Redis: task distribution, task deduplication, and distributed ☁️
Requirements☁️
- Python 3.6➕
- Works on Linux, Windows, macOS🍎
Features🌲
- Quick to Start ☑️
- Low Coupling ☑️
- High Cohesion ☑️
- Easy Expansion ☑️
- Orderly Workflow ☑️
Installation🔨
---------------------------
# For Linux && MacOS🔥
pip3 install airspider
---------------------------
# For Windows🔥
pip3 install airspider
---------------------------
-
Documents🔥
- Introduction:介绍airspider
- Tutorials:使用airspider快速编写一个稳健的爬虫
- Plugins:编写airspider扩展
Topics
TODO✈️
- Complete Plugins of Redis🔥
- Complete Distributed Architecture☁️
Contributing👬
AirSpider🕷️ is still under Developing🔨
Feel free to open issues💬 and pull requests💗
- Report or Fix bugs🌈
- Build Powerful plugins🔥
- Make documentation Better📖
- Add Examples of Crawling 🕷️
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
AirSpider-2.0.1.tar.gz
(16.6 kB
view hashes)
Built Distribution
Close
Hashes for AirSpider-2.0.1-py2.py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 6613ec6bb84876ac9c9c95e264ae9be42790f83cc4c58fdf88be77a4064eb749 |
|
MD5 | 39f2990f9d43fcd7d69461340d2603b6 |
|
BLAKE2b-256 | 4c78b2c390b31a874a5711c649f486f19dca1ee394d297db439d11f4a173ee54 |