Skip to main content

Async Python 3.6+ web scraping micro-framework based on asyncio.

Project description

Ruia logo

Ruia

🕸️ Async Python 3.6+ web scraping micro-framework based on asyncio.

⚡ Write less, run faster.

travis codecov PyPI - Python Version PyPI Downloads gitter

Overview

Ruia is an async web scraping micro-framework, written with asyncio and aiohttp, aims to make crawling url as convenient as possible.

Write less, run faster:

Features

  • Easy: Declarative programming
  • Fast: Powered by asyncio
  • Extensible: Middlewares and plugins
  • Powerful: JavaScript support

Installation

# For Linux & Mac
pip install -U ruia[uvloop]

# For Windows
pip install -U ruia

# New features
pip install git+https://github.com/howie6879/ruia

Tutorials

  1. Overview
  2. Installation
  3. Define Data Items
  4. Spider Control
  5. Request & Response
  6. Customize Middleware
  7. Write a Plugins

TODO

  • Cache for debug, to decreasing request limitation, ruia-cache
  • Provide an easy way to debug the script, ruia-shell
  • Distributed crawling/scraping

Contribution

Ruia is still under developing, feel free to open issues and pull requests:

  • Report or fix bugs
  • Require or publish plugins
  • Write or fix documentation
  • Add test cases

!!!Notice: We use black to format the code.

Thanks

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ruia-0.8.5.tar.gz (25.4 kB view details)

Uploaded Source

Built Distribution

ruia-0.8.5-py2.py3-none-any.whl (30.3 kB view details)

Uploaded Python 2 Python 3

File details

Details for the file ruia-0.8.5.tar.gz.

File metadata

  • Download URL: ruia-0.8.5.tar.gz
  • Upload date:
  • Size: 25.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.9.7

File hashes

Hashes for ruia-0.8.5.tar.gz
Algorithm Hash digest
SHA256 4c94e40e48ba6829dfcc45852d6a99a1187f1c3309d8b6858c080b7844b14a72
MD5 21ee60bb4fdbbf9014d26ff1e9944027
BLAKE2b-256 de1bb3f516402d1e17f01487ae0dc19daa9c9f968beb21eb593b9903623bc5d3

See more details on using hashes here.

File details

Details for the file ruia-0.8.5-py2.py3-none-any.whl.

File metadata

  • Download URL: ruia-0.8.5-py2.py3-none-any.whl
  • Upload date:
  • Size: 30.3 kB
  • Tags: Python 2, Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.9.7

File hashes

Hashes for ruia-0.8.5-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 7da42157fd2a1437a4cd8dd0be1bb24a2aae1e656bdfc132a8cf1b1b7e779b46
MD5 7886b9eacb917c86828675f50decf41c
BLAKE2b-256 12f516d254cf0a6cd1b088e56992b8a2e6f73e19f3b0b8d5024720ecf6789d4b

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page