Persistent structural context and ultra-fast repeated analysis for AI coding agents

These details have not been verified by PyPI

Project description

sourcecode

Persistent structural context and ultra-fast repeated analysis for AI coding agents.

Version Python

The problem

Every time an AI coding agent starts a new session, it has to re-parse the repository from scratch. For a large Java or TypeScript monolith, that means 5–15 seconds per invocation. Multiply by dozens of agent turns per hour, and repo context acquisition becomes a real bottleneck — not just latency, but tokens, compute, and iteration velocity.

sourcecode solves this with a persistent structural cache keyed on file content hashes. After the first scan, every subsequent invocation returns pre-built context in milliseconds. The repo doesn't change? The cache doesn't expire.

The cache is not a performance optimization. It is what makes sourcecode usable as infrastructure rather than a one-off tool.

Cache performance — measured on real repos

Repo	Size	Cold scan	Cache hit	Speedup
Keycloak	7,885 Java files	10.5s	0.6s	~17x
BroadleafCommerce	2,985 Java files	2.7s	0.3s	~9x

Cache keyed on content hashes — invalidated only when source changes. On repeated agent sessions against the same codebase, nearly every invocation is a cache hit.

Token output (measured):

Mode	BroadleafCommerce	Keycloak
`--compact`	~2,900	~4,000
`--agent`	~4,800	~5,500
`onboard`	~2,600	n/a
`fix-bug` (trimmed)	~27,000	~4,600

What changes at 0.3s vs 2.7s

At 2.7s per call, you use sourcecode to occasionally inspect a repo.

At 0.3s per call, you use sourcecode as constant infrastructure inside agent loops:

agent loop iteration:
  1. sourcecode . --compact          # 0.3s — instant structural context
  2. sourcecode impact PaymentService . --depth 1   # 0.4s — blast radius check
  3. agent makes targeted change
  4. repeat

Sub-second context retrieval changes the cost model for agent workflows. You can call sourcecode before every edit, before every PR review, before every test run — without batching or caching calls manually.

Installation

Homebrew (macOS / Linux)

brew tap haroundominique/sourcecode
brew install sourcecode

pip / pipx

pip install sourcecode
# or with isolation:
pipx install sourcecode

Verify

sourcecode version
# sourcecode 1.53.0

Quickstart

# High-signal summary — warm cache: ~0.3s, cold: 2–10s depending on repo size
sourcecode --compact

# Add git hotspots and uncommitted file count
sourcecode --compact --git-context

# Structured output for AI agents — bounded, noise-free, ready to inject
sourcecode --agent

# Blast radius: what breaks if this class changes?
sourcecode impact OrderService /path/to/repo

# Spring Boot 2→3 migration readiness: javax→jakarta blockers, removed APIs
sourcecode migrate-check /path/to/repo

# Spring semantic audit: TX anomalies + security surface (free)
sourcecode spring-audit /path/to/repo

# Impact chain: systemic blast radius with TX/SEC enrichment (free)
sourcecode impact-chain OrderService /path/to/repo

# Event topology: publisher → event → consumer graph (free)
sourcecode impact-chain OrderPlacedEvent /path/to/repo --type events

# REST endpoint surface
sourcecode endpoints /path/to/repo

# Request-body validation per endpoint: constraints + custom validators (free)
sourcecode validation /path/to/repo

# Onboard to an unfamiliar codebase
sourcecode onboard /path/to/repo

# PR review: risk, test gaps, changed modules
sourcecode review-pr /path/to/repo --since main

# Bug triage: risk-ranked files by symptom
sourcecode fix-bug /path/to/repo --symptom "NullPointerException in checkout"

Cache system

sourcecode maintains a persistent cache at .sourcecode-cache/ inside each repository. Two layers:

L1 (core): analysis result keyed by (git_sha, analysis_flags). Survives format changes — you can regenerate --compact vs --agent views from the same core.
L2 (view): rendered output keyed by (core_hash, view_flags). Exact output match — no recomputation.

Lookup order: L2 exact hit → L1 hit + view rebuild → full cold scan

Cache invalidation: Keyed on git commit SHA. Any commit invalidates the core cache for that repo. Uncommitted changes are not cached.

# Inspect cache state
sourcecode cache status

# Warm the cache ahead of an agent session
sourcecode cache warm

# Clear cache
sourcecode cache clear

# Check RIS freshness relative to current git HEAD
sourcecode cache freshness

--no-cache bypasses both layers and forces a fresh scan. Use in CI or when you need to verify a fresh result.

Visibility: Cache hits are silent. Use sourcecode cache status to see cache size, hit keys, and last-warmed timestamp.

Agent workflow patterns

Start of session — structural grounding

# Inject as first message to agent (bounded, deterministic)
sourcecode /repo --compact              # ~2,500–4,000 tokens
sourcecode /repo --agent               # ~4,500–5,500 tokens — more detail
sourcecode onboard /repo               # task-structured: entry points, key files, gaps

Before every change — blast radius + TX/SEC check

# Always target the INTERFACE in Spring projects, not the implementation:
sourcecode impact OrderService /repo           # ✓ 30 callers, 11 endpoints
sourcecode impact OrderServiceImpl /repo       # ✗ 0 callers (Spring DI blindness)

# Impact chain: blast radius enriched with TX boundary and security surfaces
sourcecode impact-chain OrderService /repo

# Event topology: who publishes/consumes this event, and in what TX phase?
sourcecode impact-chain OrderPlacedEvent /repo --type events

# Spring audit: catch TX anomalies before they hit production
sourcecode spring-audit /repo --scope tx

Continuous agent loop — delta context

# Only changed files + their transitive importers — minimal token cost:
sourcecode prepare-context delta /repo --since HEAD~1
sourcecode . --changed-only --git-context

PR review — structured risk signal

# JSON for programmatic use:
sourcecode review-pr /repo --since main --output review.json
jq '.ci_decision' review.json    # "analysis_success" | "git_ref_error"

# Markdown for GitHub comment:
sourcecode review-pr /repo --since main --format github-comment

Bug triage — symptom-driven

# Specific symptoms produce the best signal:
sourcecode fix-bug /repo --symptom "OIDC token refresh fails after realm update"
sourcecode fix-bug /repo --symptom "NullPointerException in OrderService during checkout"

# Generic symptoms produce noisy output — be specific.
sourcecode fix-bug /repo --symptom "payment timeout" --output triage.json

In CI — cached, deterministic, fast

# Content-hash cached — safe to run on every commit; cold only when code changes
sourcecode /repo --compact --output context.json

# PR gate
sourcecode review-pr /repo --since $BASE_REF --output review.json
DECISION=$(jq -r '.ci_decision' review.json)
if [ "$DECISION" != "analysis_success" ]; then echo "Review failed: $DECISION"; fi

What sourcecode does (and doesn't)

sourcecode reduces exploration cost. It accelerates context acquisition and minimizes repeated repo parsing. It does not replace reading code — it reduces how often an agent needs to.

Specifically:

Extracts structural signals: entry points, Spring roles, REST surfaces, dependency graphs, transactional boundaries
Builds and caches these on first scan; serves from cache on subsequent calls
Produces bounded, noise-free JSON designed for direct injection into agent context windows
Computes blast radius (impact graph) from a class or interface, traversing reverse dependencies

What it does NOT do:

No runtime analysis — all signals are static (annotation, import graph, file structure)
No semantic code understanding — reads structure, not logic
No replacement for reading code — reduces how often that's needed, not whether
Architecture pattern detection best for Spring MVC layered apps; SPI/plugin architectures (e.g. Quarkus extension model) may be misclassified
Endpoint recall for JAX-RS subresource locator pattern is ~65%
impact on implementation classes (e.g. OrderServiceImpl) returns 0 callers in Spring Boot — callers inject the interface via @Autowired. Always target the interface. When direct_callers: [] with confidence_level: high for a @Service class, re-query the interface.
no_security_signal on endpoints means no recognized method-level annotation found — does not mean the endpoint is unsecured. Projects using Spring Security filter chains show 100% no_security_signal even when fully secured. Projects using a custom authorization annotation can teach the scanner via sourcecode.config.json.
spring-audit and impact-chain are Java/Spring only — non-Java repos return spring_detected: false
Event topology via --type events does not resolve Kafka/RabbitMQ/Redis message routes — only Spring ApplicationEvent and @EventListener chains
Self-invocation TX bypass (calling @Transactional method from the same class without going through the proxy) is not detected

Pricing

🎉 Early-adoption: Pro is currently unlocked for everyone. During this phase every install runs with full Pro entitlements — no size gate, no key required. The tiers below describe the model the paywall will return to later.

Two tiers. Gating is by repo size and automation — never by command. Every command runs at full power on Free for small and mid-size repos. You upgrade when the work gets bigger or automated.

	Free — €0	Pro — €19/mo · €190/yr per dev
Repo size	≤ 500 Java source files	> 500 Java files (enterprise monoliths)
Commands	All of them, full output	Same commands, unlocked at scale
`impact` / `fix-bug` / `review-pr` / `modernize`	✅ full on small repos	✅ full on large repos (Free gets a capped preview)
`--full`, git-churn ranking, uncapped graph/semantic	✅ on small repos	✅ on large repos
`prepare-context delta`	30 free runs/repo	unlimited — CI/CD automation
`prepare-context generate-tests`	small repos	large repos
MCP local server (`mcp serve`)	✅	✅
Offline, no data egress, no account	✅	✅

Non-Java repos are free at any size — the size limit counts Java source files only, by design. sourcecode monetises enterprise Java monoliths.

sourcecode activate <key>      # activate a license key

Full breakdown: docs/PRODUCT_TIERS.md.

Command reference

`--compact` and `--agent`

Core flags. Feed directly to AI agents as first-message context.

Flag	Output	Tokens
`--compact`	High-signal summary: stacks, entry points, dependencies, confidence, gaps	~2,500–4,000
`--agent`	Structured JSON: identity, entry points, architecture, event flows	~4,500–5,500

`impact` — blast-radius analysis [free ≤500 Java files · Pro above]

sourcecode impact ClassName /path/to/repo
sourcecode impact org.example.OrderService /path/to/repo   # FQN also accepted
sourcecode impact OrderService . --depth 2                 # limit BFS depth

Field	Description
`direct_callers`	Classes that directly import or inject the target
`indirect_callers`	Transitive callers up to `--depth` (default: 4)
`endpoints_affected`	HTTP endpoints whose call chain includes the target
`transactional_boundaries_touched`	`@Transactional` classes in the blast cone
`mappers_affected`	`@Repository` / `@Mapper` / DAO classes in the blast cone
`security_surface_affected`	Security policies on affected endpoints
`cross_module_impact`	Subsystems touched, ordered by affected symbol count
`risk_score`	0–100 quantified change risk
`confidence_score`	0–1 confidence in the analysis
`explanation`	Human-readable risk summary
`candidates`	On partial match: up to 10 FQNs ranked by relevance

Best practices:

Target interfaces, not implementations: impact OrderService > impact OrderServiceImpl
Use --depth 1 when target has 200+ callers — direct endpoints are already the most actionable signal
Second impact run on the same repo is significantly faster (cache applies to underlying IR scan)

`endpoints` — REST API surface

sourcecode endpoints /path/to/repo
sourcecode endpoints /path/to/repo --output endpoints.json
sourcecode endpoints /path/to/repo --by-controller

Extracts all Spring MVC (@GetMapping, @PostMapping, @RequestMapping, etc.) and JAX-RS (@GET, @POST, @Path) endpoint methods. Returns HTTP method, path, controller class, and handler method. Each endpoint also carries its return_type. --by-controller groups the surface per controller ({by_controller, controller_count, total}) for an API-surface view.

Functional / WebFlux routing (honest limitation). Routes registered via the functional DSL — route().GET("/path", handler) / RouterFunction / CustomEndpoint, common in reactive Spring apps — are not modeled (their real paths depend on nest()/group-version prefixes that can't be resolved statically). Rather than emit partial paths that would mislead, the output reports a functional_routing block (files, route_registrations, modeled: false) plus a warning. When the annotation surface is empty but functional routes exist, the warning explicitly tells you not to read it as "no endpoints". Annotation-based (MVC/JAX-RS) repos are unaffected.

Custom security annotations. Enterprise repos often guard endpoints with a bespoke annotation instead of @PreAuthorize/@Secured. Drop a sourcecode.config.json at the repo root to teach the scanner about it — otherwise those endpoints report policy: "none_detected":

{
  "customSecurityAnnotations": [
    {
      "fullyQualifiedName": "com.example.security.M3FiltroSeguridad",
      "shortName": "M3FiltroSeguridad",
      "resourceParam": "nombreRecurso",
      "levelParam": "nivelRequerido"
    }
  ]
}

Matching endpoints then report policy: "custom" with annotation, resourceName, and requiredLevel, and are no longer counted in no_security_signal. Repos without the config behave exactly as before.

`export` — architecture views for downstream tooling

sourcecode export /path/to/repo --by-directory      # code map, path:line refs
sourcecode export /path/to/repo --module-graph       # module→module dependencies
sourcecode export /path/to/repo --integrations       # outbound HTTP/LDAP/JMS clients
sourcecode export /path/to/repo --c4                  # unified architecture + manifest

Emits structured, tool-agnostic codebase views as plain JSON/YAML — the kind of input an architecture-doc generator, diagram renderer, or code-search agent can consume directly instead of walking the tree file by file. Section labels map to the open C4 model (an open architecture notation, not a product); the schema is vendor-neutral.

Flag	Output
`--by-directory`	One group per source directory, each symbol with a `source_file:line` reference.
`--module-graph`	`{nodes, edges, summary}` — directories as modules, inter-module dependencies rolled up from class-level relation edges with hit counts + edge types.
`--integrations`	Outbound integrations (`RestTemplate`, `WebClient`, `@FeignClient`, `LdapTemplate`, `JmsTemplate`, ActiveMQ) with `file:line` evidence and a literal `target` URL/name when present.
`--c4`	Unified document: `c4.{context, containers, components, code}` + `api_surface` + a `manifest` with per-directory content hashes for incremental consumers (skip directories whose hash is unchanged). `components.module_roots` rolls leaf source dirs up to architectural module roots and classifies each `layered` (DDD: ≥2 of `domain`/`application`/`infrastructure`) vs `flat` (legacy/flat package), with a verifiable `module_count` — so a consumer enumerates real modules instead of inferring boundaries from leaf directories.

The section flags compose (pass several for one multi-section document); --c4 assembles the full export on its own. URLs assembled at runtime yield target: null (honest absence, never a guess); containers are derived from build files (Maven/Gradle) and reported as a limitation when none are found.

`spring-audit` — Spring semantic audit [free]

sourcecode spring-audit /path/to/repo
sourcecode spring-audit /path/to/repo --scope tx           # TX anomalies only
sourcecode spring-audit /path/to/repo --scope security     # security surface only
sourcecode spring-audit /path/to/repo --min-severity high

# CI/CD gate: exit 1 on any finding
sourcecode spring-audit . --ci
sourcecode spring-audit . --ci --min-severity high         # exit 1 only on high/critical
sourcecode spring-audit . --ci --format github-comment     # Markdown output + exit 1

Detects structural Spring anomalies that survive code review and tests, but cause production failures:

Pattern	Description
`TX-001`	`@Transactional` on private/final method — CGLIB proxy bypass, TX silently ignored
`TX-002`	`REQUIRES_NEW` nested inside `REQUIRED` call chain — unexpected transaction nesting
`TX-003`	`readOnly=true` boundary propagating to write operation
`TX-004`	`NOT_SUPPORTED`/`NEVER` called within active TX chain
`TX-005`	Exception swallowing inside `@Transactional` — silent TX rollback suppression
`SEC-001`	Unsecured endpoint in annotation-based security model
`SEC-002`	CVE-2025-41248: `@PreAuthorize` on inherited method from generic supertype
`SEC-003`	`@Transactional` on `@Controller`/`@RestController` — TX in wrong layer

Returns structured findings with severity, confidence, symbol, source_file, evidence, explanation, and fix_hint. JAVA/SPRING ONLY.

Endpoints guarded by a project-specific authorization annotation are treated as secured (not flagged SEC-001) once declared in sourcecode.config.json.

`impact-chain` — systemic blast radius with TX/SEC enrichment [free]

sourcecode impact-chain OrderService /path/to/repo
sourcecode impact-chain com.example.OrderService#placeOrder /path/to/repo
sourcecode impact-chain PaymentService . --depth 6

Unlike impact (which traces the caller graph), impact-chain builds on the SpringSemanticModel to enrich every step of the blast cone with transaction and security context:

Field	Description
`direct_callers`	Symbols that directly call the target
`indirect_callers`	Transitive callers (BFS up to `--depth` hops, default: 4)
`endpoints_affected`	HTTP endpoints reachable through the call chain
`transaction_boundary`	`@Transactional` semantics on the target: propagation, isolation, readOnly
`security_surfaces`	Per-endpoint security policy + SEC finding IDs
`impact_findings`	TX-001..005 and SEC-001..003 findings that touch the call chain
`risk_level`	`critical` \| `high` \| `medium` \| `low`
`confidence`	`high` \| `medium` \| `low` — `low` on a detected blind spot, `medium` on partial resolution or capped traversal. Informational interface↔impl expansion notices do not lower it, so a clean resolved query stays `high`.
`metadata.blind_spots`	`framework_di` and/or `value_type` when an empty result is unmodeled-edge driven, not real dead code (CH-007 drops `framework_di` once it recovers the wiring callers)
`metadata.external_iface_callers_recovered` / `external_iface_binding_ambiguous`	CH-007 — count of in-repo wiring callers recovered through an external interface, and whether the impl→bean binding is ambiguous (multiple in-repo implementors)

Framework/DI blind spot (CH-005). An empty blast radius is ambiguous: genuinely unused, or invoked through an edge the static graph does not model. When the target class implements/extends an external framework type (e.g. Spring Security's RedirectStrategy, a servlet Filter) it is typically wired by framework DI/config and invoked polymorphically — no in-repo edge names its methods, so direct_callers is 0. Rather than report that as risk:low at high confidence (a dangerous false negative that reads as "safe to change"), impact-chain detects the external supertype, drops confidence to low, lists it in metadata.external_supertypes, and emits a CH-005 warning telling you to search the DI/security/config wiring for the supertype. Inert markers (Serializable, Cloneable) and concrete JDK base classes extended for reuse (ArrayList, InputStream, …) are excluded.

External-interface DI caller recovery (CH-007). When the target is wired through an external interface, the consumers that inject that interface never name the target, so the static caller graph misses them — but the wiring sites are still in-repo. impact-chain reads the dependency edges to recover the in-repo classes that inject/use the external supertype and attributes them as callers (so their endpoints map too). If exactly one in-repo class implements the interface the binding is unambiguous (confidence:medium); if several do, metadata.external_iface_binding_ambiguous is true and confidence stays low. metadata.external_iface_callers_recovered reports the count. Recovered callers reach the target only if it is the bean configured for that interface (which may also have framework/third-party implementations) — the warning says so. Validated on BroadleafCommerce: querying a UserDetailsService implementation recovers the security-config and login-service classes that wire it.

Caller precision (CH-006). implements/extends are structural type declarations, not calls — so they are excluded from the caller graph. Querying a class that implements a high-fanout interface (e.g. a 40-implementor CustomEndpoint or a shared Mapper<E,D> base) does not report its sibling implementors as callers; only real injects/calls edges count. This prevents a leaf class from being inflated to a large false blast radius.

Event topology — query the publisher/consumer graph for a Spring event class:

sourcecode impact-chain OrderPlacedEvent /path/to/repo --type events

Field	Description
`publishers`	FQNs that publish this event class
`consumers`	Listeners with TX phase metadata (`AFTER_COMMIT`, `BEFORE_COMMIT`, etc.)
`event_graph`	Publisher → event → consumer edges (BFS ≤ 2 hops)
`transaction_context`	`AFTER_COMMIT` consumers, `BEFORE_COMMIT` risks
`risk_level`	Derived from TX phase and consumer count

Limitations of event topology:

Resolves Spring ApplicationEvent / @EventListener chains only
Does not trace Kafka, RabbitMQ, Redis, or other message brokers
Does not detect self-invocation proxy bypass
Conditional beans (@ConditionalOnProperty) are not evaluated at analysis time

`cold-start` — RIS bootstrap context

sourcecode cold-start /path/to/repo
sourcecode cold-start /path/to/repo --compact   # ~10K token subset

Returns the Repository Intelligence Snapshot (RIS) instantly — zero re-analysis. The RIS is built by a prior warm cache pass and includes stacks, entry points, endpoint surface, and Spring semantic signals. Status field: cold_start_ready | cold_start_stale | no_ris.

Use --compact to get a ~10K token subset safe for direct LLM injection. Full snapshot ranges from ~100K–200K tokens on medium repos — use --output FILE for local search tooling.

`repo-ir` — symbol-level IR

sourcecode repo-ir /path/to/repo --summary-only                  # ~20K tokens
sourcecode repo-ir /path/to/repo --since HEAD~1                   # symbol-level diff
sourcecode repo-ir /path/to/repo --files src/.../OrderService.java
sourcecode repo-ir /path/to/repo --max-nodes 200 --max-edges 500  # limit graph size
sourcecode repo-ir /path/to/repo --output ir.json.gz --gzip       # compressed output (~70-80% smaller)
sourcecode repo-ir /path/to/repo --include-tests                   # include test files

Builds a deterministic symbol graph: classes, methods, import/injection edges, Spring roles, subsystems.

Size control flags:

Flag	Description
`--summary-only`	Omit full graph nodes/edges; keep analysis summary, impact, and change_set (<300KB typical)
`--max-nodes N`	Keep top N nodes by impact score
`--max-edges N`	Keep top N edges (priority: edges between kept nodes)
`--gzip`	Compress output with gzip. Requires `--output`. ~70–80% smaller.
`--force`	Bypass the 50K-token size guard and emit output anyway
`--include-tests`	Include test source files (excluded by default)

Size warning: Without --summary-only, output can exceed 1MB for mid-size repos. Always use --summary-only unless you need the full graph for downstream tooling.

`explain` — architectural summary for a class

sourcecode explain UserService
sourcecode explain OrderController /path/to/repo
sourcecode explain UserService --format json

Human-readable architectural summary derived entirely from static analysis: Spring stereotype, public methods, incoming callers, outgoing dependencies, events published/consumed, @Transactional boundaries, security constraints, and related REST endpoints. JAVA/SPRING ONLY.

`pr-impact` — PR blast-radius report

sourcecode pr-impact --files changed_files.txt
sourcecode pr-impact /path/to/repo --files diff.txt --format json

Takes a file listing changed Java files (one path per line) and produces a consolidated report: modified classes, affected REST endpoints reachable through the call chain, direct callers of each changed class, event publishers/consumers triggered, @Transactional methods in changed classes, and a consolidated risk level (CRITICAL / HIGH / MEDIUM / LOW). JAVA/SPRING ONLY.

# Typical CI usage: pipe git diff to a file, then run
git diff --name-only main | grep '\.java$' > changed.txt
sourcecode pr-impact . --files changed.txt --format json

`onboard` — codebase orientation

sourcecode onboard /path/to/repo

Entry points, architecture summary, key files, confidence level, and gaps. Designed to be injected as agent context at the start of a session.

`review-pr` — PR review context [free ≤500 Java files · Pro above]

sourcecode review-pr /path/to/repo --since main
sourcecode review-pr /path/to/repo --since HEAD~3

Changed files, risk ranking, test coverage gaps, affected modules, and blast radius of changed classes. Returns a ci_decision field for CI/CD integration.

`fix-bug` — Bug triage context [free ≤500 Java files · Pro above]

sourcecode fix-bug /path/to/repo --symptom "NullPointerException in checkout"

Risk-ranked file list correlated to the symptom: keyword extraction, path matching, content matching, git commit correlation.

`modernize` — Modernization planning [free ≤500 Java files · Pro above]

sourcecode modernize /path/to/repo

High-coupling nodes (high fan-in = risky to change), dead zone candidates (isolated symbols), subsystem tangles.

`migrate-check` — Spring Boot 2→3 migration readiness

sourcecode migrate-check /path/to/repo
sourcecode migrate-check . --min-severity high
sourcecode migrate-check . --format text
sourcecode migrate-check . --output migration.json

Detects migration blockers across Java source files, Spring XML config files, and Maven/Gradle build files. 27 rules organized by target:

Jakarta namespace (MIG-001..009) — javax→jakarta

Rule	Severity	Pattern
`MIG-001`	critical	`javax.persistence` import — JPA will not compile
`MIG-002`	high	`javax.servlet` import — Servlet API changed
`MIG-003`	high	`javax.validation` import — Bean Validation changed
`MIG-004`	high	`javax.transaction` import — TX API changed
`MIG-006`	medium	`javax.annotation` import — CDI annotations changed
`MIG-007`	medium	`javax.inject` import — DI annotations changed
`MIG-008`	medium	`javax.ws.rs` import — JAX-RS changed
`MIG-009`	medium	`javax.jms` import — JMS API changed

Spring Security 6 (MIG-005, MIG-019, MIG-020)

Rule	Severity	Pattern
`MIG-005`	high	`extends WebSecurityConfigurerAdapter` — removed in Spring Security 6
`MIG-019`	high	SpringFox / `@EnableSwagger2` — incompatible with Spring Boot 3
`MIG-020`	high	`antMatchers()` / `authorizeRequests()` — replaced in Spring Security 6

Java version compatibility (MIG-010..025)

Rule	Severity	Pattern
`MIG-010`	critical	`SecurityManager` / `AccessController` — removed in Java 17 (JEP 411)
`MIG-011`	high	`sun.` / `com.sun.net.` internal API imports — strong encapsulation since Java 9
`MIG-012`	high	Nashorn `ScriptEngine` — removed in Java 15
`MIG-013`	high	`sun.misc.Unsafe` — requires `--add-opens` on Java 9+
`MIG-014`	medium	`setAccessible(true)` — may throw `InaccessibleObjectException` on Java 17+
`MIG-015`	medium	`finalize()` override — deprecated for removal since Java 18
`MIG-016`	low	`java.util.Date` / `Calendar` / `SimpleDateFormat` — use `java.time`
`MIG-021`	high	`javax.xml.bind` (JAXB) — removed from JDK in Java 11
`MIG-022`	high	`javax.xml.ws` (JAX-WS) — removed from JDK in Java 11
`MIG-023`	critical	`org.omg.*` / CORBA APIs — removed from JDK in Java 11
`MIG-024`	medium	`Thread.stop()` / `Thread.suspend()` / `Thread.resume()` — deprecated for removal
`MIG-025`	medium	`ReflectionFactory` / `MethodHandles.privateLookupIn` — JPMS deep-reflection risk

Spring XML config (MIG-030..032)

Rule	Severity	Pattern
`MIG-030`	high	`javax.*` class reference in Spring XML bean definitions
`MIG-031`	high	`<http auto-config>` or versioned spring-security ≤5 schema in XML
`MIG-032`	high	`web.xml` with Servlet ≤4 namespace — must migrate to `jakarta.ee`

Build file dependencies (MIG-040..043)

Rule	Severity	Pattern
`MIG-040`	high	`io.springfox` dependency — incompatible with Spring Boot 3
`MIG-041`	high	Hibernate 5.x explicitly pinned — Spring Boot 3 requires Hibernate 6
`MIG-042`	medium	ByteBuddy < 1.12.x — may not support Java 17+ strong encapsulation
`MIG-043`	high	EhCache 2.x (`net.sf.ehcache`) — incompatible with Spring Boot 3

Each finding includes severity, title, source_file, first_line, explanation, fix_hint, migration_target, and openrewrite_recipe (when an automated recipe exists).

`rename-class` — Java class rename

sourcecode rename-class . --from ServiceA --to ServiceB
sourcecode rename-class /path/to/repo --from OrderManager --to OrderService
sourcecode rename-class . --from OldName --to NewName --dry-run
sourcecode rename-class . --from OldName --to NewName --no-tests   # src/main only

Renames a Java class safely throughout the repository: declaration, constructor, all import statements, type references (fields, params, return types), extends/implements, generics, casts, and Spring @Qualifier names. Renames the physical .java file. Emits a structured change audit trail (file, before_lines, after_lines, intent, diff).

Use --dry-run to preview changes without writing to disk.

`chunk-file` — split large Java files for agent consumption

sourcecode chunk-file BigService.java
sourcecode chunk-file BigService.java --max-lines 300
sourcecode chunk-file BigService.java --chunk 5          # read chunk 5 only
sourcecode chunk-file BigService.java --metadata-only    # boundaries only, no content

Splits a large Java file at method/class boundaries so AI agents can read files with 10K–25K+ lines in context-sized pieces. Each chunk includes chunk_id, start_line, end_line, chunk_type, symbol name, a context_header (package + class + imports summary), and content. A size_warning flag marks methods that exceed --max-lines and cannot be split further.

`prepare-context` — task-specific context

Low-level access to all tasks with full options:

sourcecode prepare-context TASK [PATH] [OPTIONS]

Task	What it surfaces
`explain`	Architecture, entry points, key dependencies
`onboard`	Full structural context for new agents/developers
`fix-bug`	Files ranked by symptom correlation, risk, annotations
`refactor`	Structural issues, improvement opportunities
`generate-tests`	Source files without test pairs, coverage gap analysis
`review-pr`	PR diff with risk ranking, test gaps, module impact
`delta`	Incremental context: git-changed files + transitive import graph

Flags reference

Flag	Alias	Default	Description
`--compact`		off	High-signal summary (typically 2,500–4,000 tokens for mid-to-large Java repos): stacks, entry points, dependencies, confidence, gaps.
`--agent`		off	Structured JSON for AI agents: project identity, entry points, architecture, dependencies, confidence. ~4,500–5,500 tokens.
`--full`		off	Remove truncation limits on `transactional_boundaries`, `mybatis.dto_mappers`, and other capped lists.
`--git-context`	`-g`	off	Include git activity: recent commits, change hotspots, and uncommitted file count.
`--changed-only`		off	Limit output to git-modified files (staged, unstaged, untracked).
`--depth`		`4`	File tree traversal depth (1–20). Java/Maven projects auto-adjust to 12.
`--format`	`-f`	`json`	Output format: `json` or `yaml`.
`--output`	`-o`	stdout	Write output to a file instead of stdout.
`--no-cache`		off	Bypass scan cache and force a fresh analysis.
`--copy`	`-c`	off	Copy output to clipboard after a successful run.
`--no-redact`		off	Disable automatic secret redaction.
`--version`	`-v`	—	Show version and exit.

Output schema

All outputs include:

schema_version: output format version
confidence_summary: overall, stack, entry_points confidence levels (high/medium/low)
analysis_gaps: list of what could not be analyzed and why

Java/Spring-specific fields (when detected)

Field	Description
`language_version`	Java version from `maven.compiler.source` or equivalent
`deployment.spring_boot_version`	Spring Boot version
`deployment.packaging`	`jar` or `war`
`mybatis`	Mapper interface / XML file pairing summary
`transactional_boundaries`	Classes annotated with `@Transactional`
`deployment_risks`	Static risk flags: `spring-boot-2.x-eol`, `legacy-java-runtime`

Telemetry

Anonymous, opt-in. Collects: version, OS, commands, flags, duration, repo size range, errors. No source code, paths, secrets, or output content.

sourcecode telemetry status
sourcecode telemetry enable
sourcecode telemetry disable

Or: export SOURCECODE_TELEMETRY=0

Configuration

sourcecode config    # show version, config file path, telemetry status

`sourcecode.config.json` (repo root)

Optional, per-repo. Loaded from the root of the repo being analyzed. Absent or malformed config is ignored — the tool behaves exactly as without it.

Custom security annotations. Teach endpoints, spring-audit, and explain about project-specific authorization annotations (otherwise reported as policy: "none_detected"):

{
  "customSecurityAnnotations": [
    {
      "fullyQualifiedName": "com.example.security.M3FiltroSeguridad",
      "shortName": "M3FiltroSeguridad",
      "resourceParam": "nombreRecurso",
      "levelParam": "nivelRequerido"
    }
  ]
}

resourceParam / levelParam are optional and name the annotation attributes to surface as resourceName / requiredLevel. Matching endpoints report policy: "custom" and drop out of the no_security_signal count.

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

1.59.0

Jun 19, 2026

1.58.0

Jun 19, 2026

This version

1.57.0

Jun 19, 2026

1.56.0

Jun 19, 2026

1.55.0

Jun 19, 2026

1.54.0

Jun 19, 2026

1.53.0

Jun 17, 2026

1.52.0

Jun 17, 2026

1.51.0

Jun 17, 2026

1.50.0

Jun 17, 2026

1.46.0

Jun 16, 2026

1.45.0

Jun 16, 2026

1.44.0

Jun 16, 2026

1.42.0

Jun 16, 2026

1.41.0

Jun 16, 2026

1.39.0

Jun 15, 2026

1.38.0

Jun 15, 2026

1.36.5

Jun 15, 2026

1.36.4

Jun 15, 2026

1.36.3

Jun 15, 2026

1.36.2

Jun 15, 2026

1.36.1

Jun 14, 2026

1.36.0

Jun 14, 2026

1.35.36

Jun 14, 2026

1.35.35

Jun 14, 2026

1.35.34

Jun 12, 2026

1.35.33

Jun 11, 2026

1.35.32

Jun 10, 2026

1.35.31

Jun 10, 2026

1.35.30

Jun 10, 2026

1.35.29

Jun 10, 2026

1.35.28

Jun 10, 2026

1.35.27

Jun 10, 2026

1.35.26

Jun 9, 2026

1.35.25

Jun 8, 2026

1.35.24

Jun 8, 2026

1.35.23

Jun 8, 2026

1.35.22

Jun 8, 2026

1.35.20

Jun 8, 2026

1.35.19

Jun 8, 2026

1.35.18

Jun 8, 2026

1.35.17

Jun 7, 2026

1.35.16

Jun 7, 2026

1.35.15

Jun 5, 2026

1.35.14

Jun 5, 2026

1.35.13

Jun 5, 2026

1.35.12

Jun 5, 2026

1.35.11

Jun 5, 2026

1.35.10

Jun 4, 2026

1.35.9

Jun 4, 2026

1.35.8

Jun 4, 2026

1.35.7

Jun 4, 2026

1.35.6

Jun 4, 2026

1.35.5

Jun 4, 2026

1.35.4

Jun 4, 2026

1.35.3

Jun 3, 2026

1.35.2

Jun 3, 2026

1.35.1

Jun 3, 2026

1.35.0

Jun 2, 2026

1.33.25

Jun 2, 2026

1.33.24

Jun 2, 2026

1.33.23

Jun 2, 2026

1.33.22

Jun 2, 2026

1.33.21

Jun 2, 2026

1.33.20

Jun 2, 2026

1.33.19

Jun 2, 2026

1.33.18

Jun 2, 2026

1.33.17

Jun 1, 2026

1.33.16

Jun 1, 2026

1.33.15

May 30, 2026

1.33.14

May 30, 2026

1.33.13

May 30, 2026

1.33.12

May 30, 2026

1.33.11

May 30, 2026

1.33.10

May 30, 2026

1.33.9

May 30, 2026

1.33.8

May 29, 2026

1.33.7

May 29, 2026

1.33.6

May 29, 2026

1.33.5

May 29, 2026

1.33.4

May 29, 2026

1.33.3

May 29, 2026

1.33.2

May 29, 2026

1.33.1

May 29, 2026

1.33.0

May 29, 2026

1.32.7

May 29, 2026

1.32.6

May 29, 2026

1.32.5

May 29, 2026

1.32.4

May 29, 2026

1.32.3

May 28, 2026

1.32.2

May 28, 2026

1.32.1

May 28, 2026

1.32.0

May 28, 2026

1.31.32

May 28, 2026

1.31.31

May 28, 2026

1.31.30

May 26, 2026

1.31.29

May 26, 2026

1.31.28

May 26, 2026

1.31.27

May 26, 2026

1.31.26

May 26, 2026

1.31.25

May 25, 2026

1.31.24

May 25, 2026

1.31.23

May 25, 2026

1.31.22

May 25, 2026

1.31.21

May 25, 2026

1.31.20

May 25, 2026

1.31.18

May 24, 2026

1.31.17

May 24, 2026

1.31.16

May 24, 2026

1.31.15

May 24, 2026

1.31.14

May 24, 2026

1.31.13

May 24, 2026

1.31.12

May 22, 2026

1.31.11

May 22, 2026

1.31.10

May 22, 2026

1.31.9

May 21, 2026

1.31.8

May 21, 2026

1.31.7

May 21, 2026

1.31.6

May 21, 2026

1.31.5

May 21, 2026

1.31.4

May 21, 2026

1.31.3

May 21, 2026

1.31.2

May 21, 2026

1.31.1

May 21, 2026

1.31.0

May 20, 2026

1.30.30

May 20, 2026

1.30.29

May 19, 2026

1.30.28

May 19, 2026

1.30.27

May 18, 2026

1.30.26

May 18, 2026

1.30.25

May 18, 2026

1.30.24

May 18, 2026

1.30.23

May 18, 2026

1.30.22

May 18, 2026

1.30.21

May 18, 2026

1.30.20

May 18, 2026

1.30.19

May 18, 2026

1.30.18

May 18, 2026

1.30.17

May 18, 2026

1.30.16

May 18, 2026

1.30.15

May 17, 2026

1.30.14

May 17, 2026

1.30.13

May 16, 2026

1.30.12

May 16, 2026

1.30.11

May 16, 2026

1.30.10

May 16, 2026

1.30.9

May 16, 2026

1.30.8

May 16, 2026

1.30.7

May 16, 2026

1.30.6

May 16, 2026

1.30.5

May 16, 2026

1.30.4

May 16, 2026

1.30.3

May 16, 2026

1.30.2

May 16, 2026

1.30.1

May 16, 2026

1.30.0

May 16, 2026

1.29.0

May 16, 2026

1.28.0

May 16, 2026

1.27.0

May 16, 2026

1.26.0

May 13, 2026

1.24.0

May 13, 2026

1.23.0

May 13, 2026

1.22.0

May 13, 2026

1.21.0

May 13, 2026

1.20.0

May 13, 2026

1.19.0

May 13, 2026

1.18.0

May 13, 2026

1.17.0

May 13, 2026

1.16.0

May 13, 2026

1.15.1

May 13, 2026

1.15.0

May 13, 2026

1.14.0

May 12, 2026

1.13.0

May 12, 2026

1.12.0

May 11, 2026

1.11.0

May 11, 2026

1.10.0

May 9, 2026

1.9.0

May 9, 2026

1.8.0

May 8, 2026

1.7.0

May 8, 2026

1.6.0

May 8, 2026

1.5.0

May 8, 2026

1.4.0

May 8, 2026

1.3.0

May 8, 2026

1.2.0

May 8, 2026

1.1.0

May 8, 2026

1.0.0

May 5, 2026

0.49.0

May 5, 2026

0.48.0

May 5, 2026

0.47.0

May 5, 2026

0.46.0

May 5, 2026

0.45.0

May 5, 2026

0.44.0

May 4, 2026

0.43.0

May 4, 2026

0.42.0

May 4, 2026

0.41.0

May 3, 2026

0.39.0

May 3, 2026

0.38.0

May 3, 2026

0.37.0

May 3, 2026

0.36.0

May 3, 2026

0.35.0

May 3, 2026

0.34.0

May 3, 2026

0.33.0

May 2, 2026

0.32.0

May 1, 2026

0.31.0

May 1, 2026

0.30.0

May 1, 2026

0.29.0

May 1, 2026

0.28.0

Apr 30, 2026

0.27.0

Apr 29, 2026

0.26.0

Apr 29, 2026

0.25.0

Apr 29, 2026

0.24.0

Apr 29, 2026

0.23.0

Apr 26, 2026

0.22.0

Apr 25, 2026

0.21.0

Apr 25, 2026

0.20.0

Apr 24, 2026

0.19.0

Apr 24, 2026

0.18.0

Apr 24, 2026

0.17.0

Apr 24, 2026

0.15.1

Apr 23, 2026

0.15.0

Apr 23, 2026

0.14.0

Apr 23, 2026

0.13.0

Apr 18, 2026

0.12.0

Apr 18, 2026

0.11.0

Apr 15, 2026

0.10.0

Apr 14, 2026

0.9.0

Apr 11, 2026

0.8.0

Apr 10, 2026

0.7.0

Apr 10, 2026

0.6.0

Apr 8, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sourcecode-1.57.0.tar.gz (615.7 kB view details)

Uploaded Jun 19, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

sourcecode-1.57.0-py3-none-any.whl (639.3 kB view details)

Uploaded Jun 19, 2026 Python 3

File details

Details for the file sourcecode-1.57.0.tar.gz.

File metadata

Download URL: sourcecode-1.57.0.tar.gz
Upload date: Jun 19, 2026
Size: 615.7 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.14.3

File hashes

Hashes for sourcecode-1.57.0.tar.gz
Algorithm	Hash digest
SHA256	`32e2b607e592a8b5a516082ccaa1f622619b54cf07ee11de0d846a47e475e91e`
MD5	`47faecb73c036518f49b9c2a07ef0080`
BLAKE2b-256	`7918f50b95ebf68585b95491d17d7fd465d88f87647432de841677db3e6a711c`

See more details on using hashes here.

File details

Details for the file sourcecode-1.57.0-py3-none-any.whl.

File metadata

Download URL: sourcecode-1.57.0-py3-none-any.whl
Upload date: Jun 19, 2026
Size: 639.3 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.14.3

File hashes

Hashes for sourcecode-1.57.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`4ec2e858e55af9efeb680d71d59b507da9708ecc704661793c9ab9087ca7617e`
MD5	`8e27a7e56b809594e14e9eb2020c3307`
BLAKE2b-256	`24b95af36fa0347c2744286db8213d6556eb585fe5e15997e5f873143b16ff4e`

See more details on using hashes here.

sourcecode 1.57.0

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers

Project description

sourcecode

The problem

Cache performance — measured on real repos

What changes at 0.3s vs 2.7s

Installation

Homebrew (macOS / Linux)

pip / pipx

Verify

Quickstart

Cache system

Agent workflow patterns

Start of session — structural grounding

Before every change — blast radius + TX/SEC check

Continuous agent loop — delta context

PR review — structured risk signal

Bug triage — symptom-driven

In CI — cached, deterministic, fast

What sourcecode does (and doesn't)

Pricing

Command reference

--compact and --agent

impact — blast-radius analysis [free ≤500 Java files · Pro above]

endpoints — REST API surface

export — architecture views for downstream tooling

spring-audit — Spring semantic audit [free]

impact-chain — systemic blast radius with TX/SEC enrichment [free]

cold-start — RIS bootstrap context

repo-ir — symbol-level IR

explain — architectural summary for a class

pr-impact — PR blast-radius report

onboard — codebase orientation

review-pr — PR review context [free ≤500 Java files · Pro above]

fix-bug — Bug triage context [free ≤500 Java files · Pro above]

modernize — Modernization planning [free ≤500 Java files · Pro above]

migrate-check — Spring Boot 2→3 migration readiness

rename-class — Java class rename

chunk-file — split large Java files for agent consumption

prepare-context — task-specific context

Flags reference

Output schema

Java/Spring-specific fields (when detected)

Telemetry

Configuration

sourcecode.config.json (repo root)

Project details

Verified details

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

`--compact` and `--agent`

`impact` — blast-radius analysis [free ≤500 Java files · Pro above]

`endpoints` — REST API surface

`export` — architecture views for downstream tooling

`spring-audit` — Spring semantic audit [free]

`impact-chain` — systemic blast radius with TX/SEC enrichment [free]

`cold-start` — RIS bootstrap context

`repo-ir` — symbol-level IR

`explain` — architectural summary for a class

`pr-impact` — PR blast-radius report

`onboard` — codebase orientation

`review-pr` — PR review context [free ≤500 Java files · Pro above]

`fix-bug` — Bug triage context [free ≤500 Java files · Pro above]

`modernize` — Modernization planning [free ≤500 Java files · Pro above]

`migrate-check` — Spring Boot 2→3 migration readiness

`rename-class` — Java class rename

`chunk-file` — split large Java files for agent consumption

`prepare-context` — task-specific context

`sourcecode.config.json` (repo root)