Skip to main content

AI Engineering Trending - 爬取 GitHub Trending,过滤 AI 项目

Project description

AI Engineering Trending

English | 简体中文

A CLI tool for discovering trending AI Engineering projects from GitHub.

What is AI Engineering?

AI Engineering focuses on practical engineering applications of AI/LLM technologies, including:

  • LLM SDK & API Clients - OpenAI, Anthropic, Gemini, Mistral SDKs
  • Agent Frameworks - LangChain, LlamaIndex, AutoGPT, CrewAI
  • RAG Tools - Retrieval-augmented generation, document processing
  • Vector Databases - Chroma, Pinecone, Weaviate, Milvus
  • AI Gateways - LiteLLM, OpenRouter, unified API proxies
  • AI Code Assistants - Cursor, Copilot, Claude Code
  • AI Observability - Langfuse, LangSmith, monitoring tools
  • LLM Inference - vLLM, Ollama, inference servers

This tool filters out research papers, datasets, tutorials, and model weights to focus on production-ready projects.

Installation

# Install from PyPI
pip install github-ai-scraper

# Or install from source
pip install -e ".[dev]"

Windows Quick Start

Double-click install.bat after cloning.

If ai-scraper is not recognized, run:

py -m ai_scraper.cli --help

Quick Start

# Show this week's trending AI Engineering projects
ai-scraper trending

# Today's trending
ai-scraper trending --period daily

# This month's trending
ai-scraper trending --period monthly

# Save as Markdown
ai-scraper trending --save output.md

# Show verbose (filtered projects)
ai-scraper trending -v

Commands

Command Description
ai-scraper trending Show trending AI Engineering projects
ai-scraper trending --period daily Today's trending
ai-scraper trending --period monthly This month's trending
ai-scraper trending --limit 50 Show top 50 projects
ai-scraper trending --save output.md Save as Markdown
ai-scraper trending -v Show filtered projects
ai-scraper config show Show current configuration
ai-scraper db stats Show database statistics
ai-scraper db clean --vacuum Optimize database

Configuration

Create ai-scraper.yaml to customize:

github:
  token: ${GITHUB_TOKEN}  # Optional, for higher rate limits

database:
  path: ./data/ai_scraper.db

trending:
  languages:
    - python
    - typescript
    - javascript
    - go
    - rust
  timeout: 30

summary:
  enabled: false
  provider: anthropic
  api_key: ${ANTHROPIC_API_KEY}
  model: claude-3-5-haiku-20241022

AI Chinese Summaries

Enable AI-powered Chinese summaries for repository descriptions:

pip install "github-ai-scraper[ai]"
set ANTHROPIC_API_KEY=your_api_key
ai-scraper trending --ai-summary

Project Structure

github-ai-scraper/
├── src/ai_scraper/
│   ├── cli.py              # CLI entry point
│   ├── config.py           # Configuration management
│   ├── classifier.py       # AI Engineering classification
│   ├── scraper/
│   │   └── trending.py     # GitHub Trending scraper
│   ├── output/
│   │   ├── exporter.py     # Markdown exporter
│   │   └── summarizer.py   # AI summary generator
│   ├── models/
│   │   └── repository.py   # Data models
│   └── storage/
│       └── database.py     # SQLite storage
├── tests/                  # Test suite
└── pyproject.toml          # Package metadata

How It Works

  1. Scrape GitHub Trending - Fetches trending repos from GitHub's trending page
  2. Filter Engineering Projects - Removes tutorials, datasets, model weights, etc.
  3. Classify AI Projects - Uses keyword/topic matching to identify AI Engineering projects
  4. Sort by Growth - Orders by star growth rate
  5. Export Results - Outputs to console or Markdown file

Development

pip install -e ".[dev]"
pytest tests/ -v

License

MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

github_ai_scraper-0.2.0.tar.gz (237.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

github_ai_scraper-0.2.0-py3-none-any.whl (26.9 kB view details)

Uploaded Python 3

File details

Details for the file github_ai_scraper-0.2.0.tar.gz.

File metadata

  • Download URL: github_ai_scraper-0.2.0.tar.gz
  • Upload date:
  • Size: 237.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.11.15

File hashes

Hashes for github_ai_scraper-0.2.0.tar.gz
Algorithm Hash digest
SHA256 d9ca27082c905d77365f3fcb00b359d4048ad18b4ed28b30e882e345d1520091
MD5 013cb01dfc3ee3bbdb73e28e1c15f05b
BLAKE2b-256 e8c50e575d28be7a5e803218cb603ea7b17448c7becad0310738c9fd1e708f40

See more details on using hashes here.

File details

Details for the file github_ai_scraper-0.2.0-py3-none-any.whl.

File metadata

File hashes

Hashes for github_ai_scraper-0.2.0-py3-none-any.whl
Algorithm Hash digest
SHA256 44cf96c8b100674715a7c96eea546921ddb74439843cac4e71f3cccef913274c
MD5 5517f468bcce571b0bdafc9515551425
BLAKE2b-256 c446ff43d5b0c8091514f7ddc81e64e068f9dbc16968376df612ea979255b89b

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page