Analyze GitHub repos into structured JSON. No AI, fully deterministic.
Project description
RepoCrunch
Analyze any public GitHub repository into structured JSON. No AI, no LLMs — fully deterministic.
Give it a repo, get back tech stack, dependencies, architecture, health metrics, and security signals in clean, consistent JSON. Use it as a Python library, CLI tool, REST API, or MCP server.
Quick Start
Requires Python 3.11+ and uv.
# Install
git clone https://github.com/kimwwk/repocrunch.git
cd repocrunch
uv venv && uv pip install -e ".[all]"
# Analyze a repo
repocrunch analyze astral-sh/uv --pretty
Or install just what you need:
uv pip install -e "." # Library only (httpx + pydantic)
uv pip install -e ".[cli]" # + CLI
uv pip install -e ".[api]" # + REST API
uv pip install -e ".[mcp]" # + MCP server
uv pip install -e ".[all]" # Everything
Set a GitHub Token (optional)
Without a token you get 60 API calls/hour. With one, 5,000/hour.
export GITHUB_TOKEN=ghp_...
Usage
CLI
repocrunch analyze fastapi/fastapi --pretty # Full analysis, pretty JSON
repocrunch analyze facebook/react -f tech_stack # Single field
repocrunch analyze https://github.com/gin-gonic/gin # Full URL works too
repocrunch serve # Start REST API on :8000
repocrunch mcp # Start MCP server (STDIO)
Python Library
from repocrunch import analyze, analyze_sync
# Async
result = await analyze("fastapi/fastapi")
# Sync
result = analyze_sync("pallets/flask")
print(result.summary.stars)
print(result.tech_stack.framework)
print(result.model_dump_json(indent=2))
REST API
repocrunch serve
# Then:
curl "http://localhost:8000/analyze?repo=fastapi/fastapi" | python -m json.tool
curl "http://localhost:8000/health"
curl "http://localhost:8000/docs" # OpenAPI docs
MCP Server (for Claude, Cursor, etc.)
repocrunch mcp # Starts STDIO transport
Sample Output
$ repocrunch analyze pallets/flask --pretty
{
"schema_version": "1",
"repo": "pallets/flask",
"url": "https://github.com/pallets/flask",
"analyzed_at": "2026-02-08T19:07:31Z",
"summary": {
"stars": 71143,
"forks": 16697,
"watchers": 2092,
"last_commit": "2026-02-06T21:23:01Z",
"age_days": 5787,
"license": "BSD-3-Clause",
"primary_language": "Python",
"languages": { "Python": 99.9, "HTML": 0.1 }
},
"tech_stack": {
"runtime": "Python",
"framework": null,
"package_manager": "pip",
"dependencies": { "direct": 6, "dev": 0 },
"key_deps": ["blinker", "click", "itsdangerous", "jinja2", "markupsafe", "werkzeug"]
},
"architecture": {
"monorepo": false,
"docker": false,
"ci_cd": ["GitHub Actions"],
"test_framework": "pytest",
"has_tests": true
},
"health": {
"open_issues": 2,
"open_prs": 0,
"contributors": 862,
"commit_frequency": "daily",
"maintenance_status": "actively_maintained"
},
"security": {
"has_env_file": false,
"dependabot_enabled": false,
"branch_protection": false,
"security_policy": false
},
"warnings": [
"Branch protection status unknown (requires admin access or authenticated request)"
]
}
Supported Ecosystems
| Language | Manifest Files | Package Manager Detection |
|---|---|---|
| JavaScript / TypeScript | package.json |
npm, yarn, pnpm, bun (from lockfiles) |
| Python | pyproject.toml, requirements.txt |
pip, poetry, uv, pdm, pipenv |
| Rust | Cargo.toml |
cargo |
| Go | go.mod |
go |
| Java / Kotlin | pom.xml, build.gradle, build.gradle.kts |
maven, gradle |
| Ruby | Gemfile |
bundler |
| C / C++ | CMakeLists.txt |
cmake |
Framework detection covers 40+ frameworks across all supported ecosystems (FastAPI, Django, React, Next.js, Spring Boot, Rails, Gin, Actix, and many more).
What It Detects
| Category | Signals |
|---|---|
| Summary | Stars, forks, watchers, age, license, languages |
| Tech Stack | Runtime, framework, package manager, direct/dev dependency count, key deps |
| Architecture | Monorepo, Docker, CI/CD platform, test framework |
| Health | Commit frequency (daily/weekly/monthly/sporadic/inactive), maintenance status, contributors, open issues |
| Security | .env file committed, Dependabot enabled, branch protection, SECURITY.md present |
Roadmap
Not yet implemented, but planned:
- Secrets regex scanning — detect leaked API keys, tokens, passwords in the file tree
- Architecture type classification — library vs. application vs. framework
- API rate limiting — per-key throttling for the REST API
- Private repo support — authenticated analysis of private repositories
- npm/npx package —
npx repocrunch analyze owner/repo - Vulnerability scanning — known CVE detection in dependencies
- Comparison mode — side-by-side analysis of multiple repos
- Historical tracking — track how a repo's health changes over time
- PyPI / npm publishing —
pip install repocrunch/npm install repocrunch - Platform deployments — Apify Store, Smithery, mcp.so, RapidAPI
License
MIT
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file repocrunch-0.1.0.tar.gz.
File metadata
- Download URL: repocrunch-0.1.0.tar.gz
- Upload date:
- Size: 113.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
5eddae33201b99970cd350273b6eaf3aae403662688a045475b047e7751cff0d
|
|
| MD5 |
efbe13de55b34c36d676026617679db6
|
|
| BLAKE2b-256 |
ccdbdc07dd3e4ccf68381fa629ca50468e742b92463a07d3fb10321984852d13
|
Provenance
The following attestation bundles were made for repocrunch-0.1.0.tar.gz:
Publisher:
publish.yml on kimwwk/repocrunch
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
repocrunch-0.1.0.tar.gz -
Subject digest:
5eddae33201b99970cd350273b6eaf3aae403662688a045475b047e7751cff0d - Sigstore transparency entry: 929011002
- Sigstore integration time:
-
Permalink:
kimwwk/repocrunch@b6f2ddc127d101e2eb382a274d0d4fc1ba67e823 -
Branch / Tag:
refs/tags/v0.1.0 - Owner: https://github.com/kimwwk
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish.yml@b6f2ddc127d101e2eb382a274d0d4fc1ba67e823 -
Trigger Event:
release
-
Statement type:
File details
Details for the file repocrunch-0.1.0-py3-none-any.whl.
File metadata
- Download URL: repocrunch-0.1.0-py3-none-any.whl
- Upload date:
- Size: 26.4 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
cbf4f02d6137e55279cf023a71915803867ae17ce7edfac05233b01c1184a62b
|
|
| MD5 |
714c003d38a2aa5fa8237a296a097048
|
|
| BLAKE2b-256 |
ac0d2860cd9af1f368968404af4ebcba9da71b3f0116e03d88a7493b9262fb25
|
Provenance
The following attestation bundles were made for repocrunch-0.1.0-py3-none-any.whl:
Publisher:
publish.yml on kimwwk/repocrunch
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
repocrunch-0.1.0-py3-none-any.whl -
Subject digest:
cbf4f02d6137e55279cf023a71915803867ae17ce7edfac05233b01c1184a62b - Sigstore transparency entry: 929011007
- Sigstore integration time:
-
Permalink:
kimwwk/repocrunch@b6f2ddc127d101e2eb382a274d0d4fc1ba67e823 -
Branch / Tag:
refs/tags/v0.1.0 - Owner: https://github.com/kimwwk
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish.yml@b6f2ddc127d101e2eb382a274d0d4fc1ba67e823 -
Trigger Event:
release
-
Statement type: