Skip to main content

Crawlo: A high-performance asynchronous Python web crawling framework with distributed support.。

Project description

Crawlo Logo

Crawlo

一个基于 asyncio 的现代化、高性能 Python 异步爬虫框架。

快速开始核心特性文档示例

✨ 快速开始(3步上手)

1. 安装

pip install crawlo

2. 创建爬虫

crawlo startproject myproject
cd myproject
crawlo genspider example example.com

3. 运行

crawlo run example

👉 查看5分钟快速上手教程 →


🚀 核心特性

⚡ 高性能异步架构

  • 基于 asyncio + aiohttp,充分利用异步 I/O
  • 智能并发控制,自动优化吞吐量

🛡️ 强大的反反爬能力

  • 智能混合下载器:自动切换协议/浏览器引擎
  • Cloudflare 自动绕过:内置多种绕过策略
  • 隐身浏览器集成:camoufox/playwright/drissionpage
  • 自适应选择器:元素自愈,网站改版自动适配

🤖 AI 集成(MCP Server)

  • Claude/Cursor 直接调用 Crawlo 抓取能力
  • 智能抓取模式:basic/stealth/max-stealth

📊 智能调度系统

  • 优先级队列、自动重试、智能限速
  • 多维度自适应背压系统:实时调控,防止队列溢出

🔄 灵活的配置模式

模式 适用场景 Redis要求
Standalone 单机开发测试 不需要
Distributed 多节点分布式 必需
Auto 智能检测(推荐) 可选

👉 详细了解配置模式 →


📚 文档

🎯 按角色阅读

你是? 推荐阅读
新手 5分钟快速上手安装指南
开发者 配置指南调度指南
运维 配置模式详解检查点系统

📖 完整文档导航

👉 浏览完整文档 →


💡 示例项目

查看 examples/ 目录:

  • 基础示例 - 快速上手
  • 高级示例 - 复杂场景
  • 生产级示例 - 可直接用于生产

👉 查看所有示例 →


🤝 贡献

欢迎提交 Issue 和 Pull Request!

  1. Fork 本仓库
  2. 创建特性分支 (git checkout -b feature/AmazingFeature)
  3. 提交更改 (git commit -m 'Add some AmazingFeature')
  4. 推送到分支 (git push origin feature/AmazingFeature)
  5. 提交 Pull Request

📄 许可证

本项目采用 MIT 许可证 - 查看 LICENSE 文件了解详情。


⭐ 如果这个项目对你有帮助,请给我们一个 Star!

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

crawlo-1.6.5.tar.gz (755.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

crawlo-1.6.5-py3-none-any.whl (1.1 MB view details)

Uploaded Python 3

File details

Details for the file crawlo-1.6.5.tar.gz.

File metadata

  • Download URL: crawlo-1.6.5.tar.gz
  • Upload date:
  • Size: 755.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.11

File hashes

Hashes for crawlo-1.6.5.tar.gz
Algorithm Hash digest
SHA256 715a70d1d4c6e36a2ff02bde6357b41f493d8ff52061e46b80ad8a54ee4c6edc
MD5 e5c350e16beeb4ad12401725279b5d79
BLAKE2b-256 15ae4c79d3894099c72c7416e63a1f64ef35b99604c60c24b526c9948152c322

See more details on using hashes here.

File details

Details for the file crawlo-1.6.5-py3-none-any.whl.

File metadata

  • Download URL: crawlo-1.6.5-py3-none-any.whl
  • Upload date:
  • Size: 1.1 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.11

File hashes

Hashes for crawlo-1.6.5-py3-none-any.whl
Algorithm Hash digest
SHA256 6f3453bd7fe901e709ec1e4c5686c53cee23bad2be4fb63125a9ff3c72eb6eef
MD5 47cff5bceb73f006fb905668b7d55e51
BLAKE2b-256 a034ed16dd22ca57fe1594b83c745e71d89027308beae2366d4f51a6fb0ea8a9

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page