Skip to main content

Asynchronous web crawler built on asyncio

Project description

AIOCrawler

Build Status Coverage Status Codacy Badge PyPI version

Asynchronous web crawler built on asyncio

Installation

pip install pyaiocrawler

Usage

Generating sitemap

from aiocrawler import AIOCrawler

crawler = AIOCrawler('https://www.google.com', depth=3)
sitemap = await crawler.generate_sitemap()

Configuring the crawler

from aiocrawler import AIOCrawler

crawler = AIOCrawler(
    init_url='https://www.google.com',
    depth=3,
    concurrency=300,
    user_agent='My Amazing Crawler'
)

Extending the crawler

WIP

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pyaiocrawler-0.1.2.tar.gz (3.2 kB view hashes)

Uploaded Source

Built Distribution

pyaiocrawler-0.1.2-py3-none-any.whl (4.2 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page