Flagship NMTC application intelligence platform — pipeline analysis, eligibility validation, readiness scoring, and visualization for CDEs
Project description
NMTC Application Builder
The open-source intelligence platform for competitive CDFI Fund applications.
Documentation · Streamlit Demo · Examples · PyPI
CDEs spend months preparing NMTC allocation applications without knowing how their pipeline compares to historical winners. This library changes that — scoring your pipeline against five years of CDFI Fund award data in seconds, generating competition-ready document drafts automatically, and telling you exactly what to fix.
app = Application(cde=CDEProfile.sample(), requested_allocation=65_000_000)
app.add_pipeline(Pipeline.from_csv("pipeline.csv"))
score = app.score_win_probability()
print(f"Alignment: {score.composite_score:.0f}/100 [{score.competitive_tier}]")
# → Alignment: 66/100 [competitive]
paths = app.generate("./drafts/")
# → Word, Excel, PDF, and Markdown application package ready in ./drafts/
The Problem
CDE teams preparing NMTC allocation applications work blind. They spend weeks manually assembling pipeline data in Excel, draft narrative sections without knowing how their distress concentration or geographic diversity compares to past winners, and submit applications with no objective measure of competitiveness. The CDFI Fund receives 280–340 applications per round with a ~35% acceptance rate — yet most CDEs have no systematic way to benchmark their position before the deadline.
The Solution
nmtc-application-builder gives CDEs a programmatic intelligence layer built on five years of CDFI Fund public award data. Load your pipeline from CSV, run analyze(), and immediately see where you stand on every dimension the CDFI Fund scores: distress concentration, geographic diversity, sector mix, impact intensity, and pipeline quality. Get specific, numbered recommendations. Optimize your project subset automatically. Generate the Word, Excel, PDF, and Markdown drafts that go directly into your application package.
Quickstart
pip install nmtc-application-builder[output,viz]
from nmtcapp import Application, CDEProfile, Pipeline
from nmtcapp.optimizer import OptimizationConstraints
# 1. Define your CDE
cde = CDEProfile.sample() # or CDEProfile.from_yaml("cde.yaml")
# 2. Load your pipeline
pipeline = Pipeline.from_csv("pipeline.csv") # or Pipeline.sample(n=20) for demo
# 3. Analyze
app = Application(cde=cde, requested_allocation=65_000_000)
app.add_pipeline(pipeline)
analysis = app.analyze()
analysis.summary()
# 4. Score alignment with historical winners
score = app.score_win_probability() # alignment score, not win probability
print(f"{score.composite_score:.0f}/100 [{score.competitive_tier}]")
# 5. Get quantified recommendations
recs = app.recommendations()
print(recs.summary())
# 6. Optimize your pipeline subset
result = app.optimize_pipeline(
constraints=OptimizationConstraints(max_total_qei=65_000_000, min_states=5)
)
print(f"Score: {result.alignment_score_before*100:.0f} → {result.alignment_score_after*100:.0f}")
# 7. Generate the full application package
paths = app.generate("./drafts/")
Or bootstrap a starter project in 60 seconds:
nmtcapp init my-application/
cd my-application/
jupyter notebook analysis.ipynb
Pipeline Template v1.1
Download templates/pipeline_template.xlsx for the recommended way to provide pipeline and CDE data to the Streamlit analyzer.
Template structure
| Sheet | Purpose |
|---|---|
| CDE Profile | One row — 30 CDE-level scoring inputs (Business Strategy, Community Outcomes, Priority Points, Phase 2 flags) |
| Pipeline | One row per project — 28 columns including all new v1.1 per-project flags |
| Instructions | Field-by-field documentation, scoring formula summary, graceful-degradation notes |
| Valid Values | Dropdown source lists (do not edit) |
New per-project flags (Pipeline sheet)
These Y/N flags in the Pipeline sheet automatically compute CDE-level scoring inputs — you don't need to manually calculate percentages:
| Column | Drives | Sub-score |
|---|---|---|
Native Area (Y/N) |
pct_native_area |
CO Special Targeting |
High Migration Rural (Y/N) |
pct_high_migration_rural |
CO Special Targeting |
US Territory (Y/N) |
pct_us_territories |
CO Special Targeting |
Persistent Poverty (Y/N) |
pct_persistent_poverty |
CO Special Targeting |
Below-Market Rate (Y/N) |
products_below_market_pct |
BS Product Flexibility |
Unrelated Entity (Y/N) |
unrelated_entities_pct |
PP Unrelated Entities |
If you also supply the CDE-level percentage in the CDE Profile sheet, it takes precedence over the computed value.
Default behaviour when flags are absent
All six flags are optional. When a column is missing from the file, or a cell is blank, the flag defaults to None, which the scoring engine treats identically to N — the project contributes zero QEI to the relevant percentage. This is a conservative default: you will not be penalised for leaving a flag blank, but you also will not receive credit for that targeting category.
Version requirement: The Streamlit analyzer and
Pipeline.from_csv()both accept v1.0 files (without the flag columns) and will score them correctly — but the Special Targeting sub-score, pipeline-derived Product Flexibility, and pipeline-derived Unrelated Entities will all default to 0 without warning. Use the v1.1 xlsx template or add the flag columns to your CSV to get accurate scores for those sub-criteria.
Graceful degradation
When CDE Profile fields are missing, the Streamlit analyzer displays which sub-scores will use defaults and what those defaults are — so you can see exactly what data gaps are costing you points.
What It Does
- Pipeline ingestion — Load from CSV or v1.1 xlsx template; validates all required fields
- NMTC eligibility enrichment — Census tract lookup, distress level classification (deep / severe / LIC), opportunity zone and native area flags
- Distress concentration analysis — Deep/severe QEI percentage vs. CDFI Fund competitive thresholds (target: ≥75%)
- Geographic diversity scoring — State count, HHI concentration index, urban/rural split
- Sector mix analysis — Shannon entropy, dominant sector, high-priority sector alignment
- Impact projection — Jobs per $MM QEI benchmarked against historical winner distributions
- Win alignment scoring — 5-dimensional score (0–100) against CY2020–2024 winner patterns
- Quantified recommendations — Specific, numbered improvement actions per dimension with estimated score impact
- Pipeline optimizer — Greedy + local-search selects the best project subset for your target budget
- Output generation — Word, Excel, PDF, and Markdown application drafts in one call
- Geographic visualizations — Publication-quality pipeline maps, radar charts, and benchmark plots at 300 DPI
- CLI —
nmtcapp init/nmtcapp analyzefor quick command-line workflows
Methodology note: Alignment scores measure similarity to historical winner patterns — they are not win probabilities. The CDFI Fund does not publish rejected application data, so a true probability model cannot be built from public information alone.
Sample Output Gallery
Generated outputs are in examples/sample_output/ — Word, Excel, PDF, and Markdown for a realistic sample CDE application.
The three example notebooks tell a complete story:
| Notebook | What it demonstrates |
|---|---|
| 01_quickstart.ipynb | End-to-end workflow in 10 minutes |
| 02_full_application_walkthrough.ipynb | Complete document generation |
| 03_intelligence_and_optimization.ipynb | 5.5 → 65.9 → 79.1/100 — weak→competitive pipeline transformation |
Architecture
nmtc-application-builder/
├── nmtcapp/
│ ├── core/ Application · CDEProfile · Pipeline
│ ├── intelligence/ PipelineAnalyzer · WinProbabilityModel · RecommendationEngine
│ ├── optimizer/ PipelineOptimizer · CandidatePool · Objectives
│ ├── validation/ EligibilityCheck · CompletenessCheck · ReadinessScore
│ ├── integrations/ nmtc-mapper · nmtc-calc · cdfidata · impact-ledger
│ ├── visualization/ pipeline maps · distress heatmap · radar · alignment charts
│ ├── renderers/ Word · Excel · PDF · Markdown builders
│ ├── data/ historical awards · benchmark thresholds · schema
│ └── cli.py nmtcapp init / analyze / version
├── examples/ 3 executed Jupyter notebooks + sample output
├── streamlit_app/ Interactive web demo (4 pages)
├── templates/ pipeline_template.xlsx (v1.1) · pipeline_template.csv · cde_profile_template.yaml
└── docs/ MkDocs documentation site
Built on the Open-Source CDFI Analytics Stack
This library integrates six companion libraries built for the CDFI space:
| Library | Role in this project |
|---|---|
nmtc-mapper |
Census tract geocoding and eligibility classification (deep / severe / LIC) |
nmtc-calc |
NMTC leveraged deal economics (QEI → NMTCs → investor equity) |
cdfidata |
CDFI Fund TLR/CLR/Awards ETL and dataset loader |
impact-ledger |
Portfolio-level impact tracking by sector |
hmda-analyzer |
HMDA CRA assessment data integration |
cra-scraper |
Community Reinvestment Act data extraction |
Use Cases
CDE application teams — Run analyze() on your pipeline weekly during application season. Watch your readiness score improve as you add projects and address recommendations. Generate the first draft of every section automatically.
CDFI consultants — Drop a client's pipeline CSV in and produce a competitive benchmark report in minutes. Show exactly where they stand vs. historical winners before committing to a full engagement.
Researchers and policy analysts — Query the embedded CY2020–2024 CDFI Fund award statistics. Study what differentiates winning applications across distress concentration, geographic reach, and impact intensity.
CDEs evaluating pipeline strategy — Use the optimizer to understand what subset of your project pipeline maximizes competitive alignment given a target allocation amount and diversity constraints.
Limitations & Honest Disclosures
- Not a win probability model. Alignment score ≠ probability of receiving an allocation. The CDFI Fund does not publish rejected application data, so a calibrated probability model cannot be built from public information alone.
- Historical patterns, not current NOFA. Benchmarks derive from CY2020–2024 award data. CDFI Fund priorities shift — always check the current NOFA for updated criteria.
- Approximate geographic data. Pipeline maps use state centroids, not actual project addresses. Eligibility enrichment uses
nmtc-mapperand falls back to embedded sample data when offline. - Not a substitute for expert review. Always have a qualified CDFI practitioner or attorney review application materials before submission.
- No investor or underwriting analysis. This library covers competitive positioning, not deal structuring, investor sourcing, or legal compliance.
Documentation
Full documentation at jaypatel1511.github.io/nmtc-application-builder
- Installation guide
- 60-second quickstart
- Pipeline analysis workflow
- Win alignment scoring & methodology
- Full API reference
- Honest limitations
Contributing
Contributions welcome — bug fixes, additional data sources, visualization improvements, and documentation all help.
git clone https://github.com/Jaypatel1511/nmtc-application-builder.git
cd nmtc-application-builder
pip install -e ".[dev]"
PYTHONPATH=. pytest tests/ -v # 544 tests, should all pass
See CONTRIBUTING.md for guidelines on pull requests, code style, and issue reporting.
License
MIT License — see LICENSE for details.
Built by Jay Patel as part of an open-source CDFI analytics portfolio. Not affiliated with the CDFI Fund or the US Treasury.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file nmtc_application_builder-1.1.0.tar.gz.
File metadata
- Download URL: nmtc_application_builder-1.1.0.tar.gz
- Upload date:
- Size: 119.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.9.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
d6acf140bfd6034207e25c09c3659eb0725a685ae4d1725dfb9a04a4da1ebabb
|
|
| MD5 |
632fad6b135d39f339ce95b0503e4990
|
|
| BLAKE2b-256 |
b60e2dbf354284d12b903fbe363b6de476fad5214b6d168201d1da32686d4659
|
File details
Details for the file nmtc_application_builder-1.1.0-py3-none-any.whl.
File metadata
- Download URL: nmtc_application_builder-1.1.0-py3-none-any.whl
- Upload date:
- Size: 138.5 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.9.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
8c6f1cef277e92d96bdd99e6328e77c60e93c9da4a61f3e2095cbd8cb66a5a2d
|
|
| MD5 |
299d64926e575fc0630b6f4510b17cea
|
|
| BLAKE2b-256 |
2f18c902f13420e2ab1c04950b06a749eb658b7435844abe7fb9fec35c9e0b25
|