AI Engineering Trending - 爬取 GitHub Trending,过滤 AI 项目
Project description
AI Engineering Trending
English | 简体中文
A CLI tool for discovering trending AI Engineering projects from GitHub.
What is AI Engineering?
AI Engineering focuses on practical engineering applications of AI/LLM technologies, including:
- LLM SDK & API Clients - OpenAI, Anthropic, Gemini, Mistral SDKs
- Agent Frameworks - LangChain, LlamaIndex, AutoGPT, CrewAI
- RAG Tools - Retrieval-augmented generation, document processing
- Vector Databases - Chroma, Pinecone, Weaviate, Milvus
- AI Gateways - LiteLLM, OpenRouter, unified API proxies
- AI Code Assistants - Cursor, Copilot, Claude Code
- AI Observability - Langfuse, LangSmith, monitoring tools
- LLM Inference - vLLM, Ollama, inference servers
This tool filters out research papers, datasets, tutorials, and model weights to focus on production-ready projects.
Installation
# Install from PyPI
pip install github-ai-scraper
# Or install from source
pip install -e ".[dev]"
Windows Quick Start
Double-click install.bat after cloning.
If ai-scraper is not recognized, run:
py -m ai_scraper.cli --help
Quick Start
# Show this week's trending AI Engineering projects
ai-scraper trending
# Today's trending
ai-scraper trending --period daily
# This month's trending
ai-scraper trending --period monthly
# Save as Markdown
ai-scraper trending --save output.md
# Show verbose (filtered projects)
ai-scraper trending -v
Commands
| Command | Description |
|---|---|
ai-scraper trending |
Show trending AI Engineering projects |
ai-scraper trending --period daily |
Today's trending |
ai-scraper trending --period monthly |
This month's trending |
ai-scraper trending --limit 50 |
Show top 50 projects |
ai-scraper trending --save output.md |
Save as Markdown |
ai-scraper trending -v |
Show filtered projects |
ai-scraper config show |
Show current configuration |
ai-scraper db stats |
Show database statistics |
ai-scraper db clean --vacuum |
Optimize database |
Configuration
Create ai-scraper.yaml to customize:
github:
token: ${GITHUB_TOKEN} # Optional, for higher rate limits
database:
path: ./data/ai_scraper.db
trending:
languages:
- python
- typescript
- javascript
- go
- rust
timeout: 30
summary:
enabled: false
provider: anthropic
api_key: ${ANTHROPIC_API_KEY}
model: claude-3-5-haiku-20241022
AI Chinese Summaries
Enable AI-powered Chinese summaries for repository descriptions:
pip install "github-ai-scraper[ai]"
set ANTHROPIC_API_KEY=your_api_key
ai-scraper trending --ai-summary
Project Structure
github-ai-scraper/
├── src/ai_scraper/
│ ├── cli.py # CLI entry point
│ ├── config.py # Configuration management
│ ├── classifier.py # AI Engineering classification
│ ├── scraper/
│ │ └── trending.py # GitHub Trending scraper
│ ├── output/
│ │ ├── exporter.py # Markdown exporter
│ │ └── summarizer.py # AI summary generator
│ ├── models/
│ │ └── repository.py # Data models
│ └── storage/
│ └── database.py # SQLite storage
├── tests/ # Test suite
└── pyproject.toml # Package metadata
How It Works
- Scrape GitHub Trending - Fetches trending repos from GitHub's trending page
- Filter Engineering Projects - Removes tutorials, datasets, model weights, etc.
- Classify AI Projects - Uses keyword/topic matching to identify AI Engineering projects
- Sort by Growth - Orders by star growth rate
- Export Results - Outputs to console or Markdown file
Development
pip install -e ".[dev]"
pytest tests/ -v
License
MIT
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file github_ai_scraper-0.2.0.tar.gz.
File metadata
- Download URL: github_ai_scraper-0.2.0.tar.gz
- Upload date:
- Size: 237.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.11.15
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
d9ca27082c905d77365f3fcb00b359d4048ad18b4ed28b30e882e345d1520091
|
|
| MD5 |
013cb01dfc3ee3bbdb73e28e1c15f05b
|
|
| BLAKE2b-256 |
e8c50e575d28be7a5e803218cb603ea7b17448c7becad0310738c9fd1e708f40
|
File details
Details for the file github_ai_scraper-0.2.0-py3-none-any.whl.
File metadata
- Download URL: github_ai_scraper-0.2.0-py3-none-any.whl
- Upload date:
- Size: 26.9 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.11.15
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
44cf96c8b100674715a7c96eea546921ddb74439843cac4e71f3cccef913274c
|
|
| MD5 |
5517f468bcce571b0bdafc9515551425
|
|
| BLAKE2b-256 |
c446ff43d5b0c8091514f7ddc81e64e068f9dbc16968376df612ea979255b89b
|