Presidio security-hardened deal-flow triage & due-diligence toolkit for early-stage AngelList syndicate deals

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

These details have not been verified by PyPI

Project description

presidio-hardened-angellist

Presidio security-hardened deal-flow triage & due-diligence toolkit for early-stage (pre-seed / seed) startups sourced via AngelList syndicates.

Why not an API client? The legacy AngelList Startup/Funding Data API (api.angel.co) has been shut down — AngelList today is fund/SPV infrastructure, not an open data API. So this toolkit triages the deal flow you actually receive — forwarded syndicate deal emails — instead of calling a dead endpoint. The original Presidio hardening layer is retained and reused for every outbound enrichment call.

How it works

forwarded .eml ─▶ intake ─▶ extraction ─▶ enrichment ─▶ triage rubric ─▶ memo
                  (parse)   (regex first,  (hardened     (deterministic   (Claude or
                            LLM fallback)   HTTP fetch)    scorecard)       template)

Intake — parse a forwarded .eml (or pasted text) into a structured Deal.
Extraction — deterministic regex/heuristics first; Claude fallback only when the parse is too thin (is_complete() is False).
Enrichment (opt-in) — fetch the company website through the hardened session to backfill a one-liner.
Triage — score against a deterministic pre-seed/seed rubric → composite + tier (Pass / Track / Dig deeper / Strong lead).
Memo (opt-in) — Claude-assisted investment memo, with a templated fallback so --memo still works with no API key.

The deterministic path needs no API key. The LLM steps activate only when ANTHROPIC_API_KEY is set and the [llm] extra is installed.

Installation

pip install presidio-hardened-angellist            # deterministic core
pip install 'presidio-hardened-angellist[llm]'     # + Claude extraction/memo

For development:

git clone https://github.com/presidio-v/presidio-hardened-angellist.git
cd presidio-hardened-angellist
uv venv && source .venv/bin/activate
uv pip install -e ".[dev,llm]"

CLI usage

angeltriage deal.eml                 # scorecard for one deal
angeltriage deal.eml --memo          # + investment memo
angeltriage deal.eml --enrich        # fetch the company site for more signal
angeltriage deal.eml --json          # machine-readable output (pipe-friendly)
cat deal.txt | angeltriage -         # read a pasted email from stdin
angeltriage *.eml                    # batch, ranked by composite score
angeltriage deals.csv                # batch-triage a CSV of deals (one row each)
angeltriage --imap                   # pull deal emails over IMAP (see below)
angeltriage --watch --interval 300   # poll IMAP every 5 min, auto-triage new deals
angeltriage deal.eml --no-llm        # force the deterministic-only path
angeltriage deal.eml --weights w.json  # tune dimension weights (see below)
angeltriage deal.eml --rubric r.json   # full rubric config (see below)
angeltriage deal.eml --save          # persist to the deal queue (see below)
angeltriage --queue                  # show the ranked, saved deal queue
angeltriage --set-status 4 passed    # update a saved deal's workflow status

.eml/text inputs are parsed as emails; .csv inputs are triaged a row at a time. You can mix files in one batch — everything is ranked together by score.

Example output:

Nimbus Robotics  [Strong lead · 83.0/100]
  pre-seed · SAFE · $10,000,000 cap · lead: Jane Okafor
  Warehouse-automation robots for SMB 3PLs.

  Scorecard:
    Team       4.5/5   2 founders; credential signals: ex-, former, mit
    Market     3.5/5   clear one-liner present
    Traction   4.5/5   signals: customers, month-over-month, mrr, paying
    Terms      4.0/5   cap $10,000,000; SAFE
    Syndicate  4.0/5   lead: Jane Okafor; allocation $250,000

Library usage

from presidio_angellist import triage_email

result = triage_email("deal.eml", memo=True)
print(result.scorecard.tier, result.scorecard.composite)  # Strong lead 83.0
print(result.deal.valuation_cap)  # 10000000.0
print(result.memo)

Tune the rubric weights:

from presidio_angellist import score_deal, parse_email

deal = parse_email("deal.eml")
sc = score_deal(
    deal, weights={"team": 0.4, "market": 0.2, "traction": 0.2, "terms": 0.1, "syndicate": 0.1}
)

Triage rubric (pre-seed / seed)

Dimension	What it weighs
Team	Founder count, technical co-founder, credential signals (ex-FAANG, YC, etc.)
Market	Crispness of the one-liner / sector framing
Traction	Revenue, users, LOIs, growth — any early signal
Terms	Valuation cap sanity for the stage, instrument (SAFE/priced)
Syndicate	Named lead, allocation, social proof

Risk flags (solo founder, missing cap, cap too high for stage, no traction, no website) are surfaced separately.

Out-of-scope (growth-stage) detection

The rubric targets pre-seed / seed. When a deal looks later-stage — an explicit Series A/B/C, ARR/revenue ≥ $5M, or a priced/venture round with a large round size — it's tagged Out of scope with a note (e.g. "Likely growth-stage (~$40M ARR; venture round $20M) — outside pre-seed/seed scope; score is indicative only") instead of being given a misleading tier. The composite is still computed, but flagged as indicative. Exposed as detect_stage_scope(deal) and Scorecard.scope_note.

Tuning the weights

Weights live in DEFAULT_WEIGHTS and are overridable per call, or from a JSON config file via --weights:

{
  "team": 0.5,
  "traction": 0.3
}

angeltriage deal.eml --weights weights.json

Dimensions you omit keep their default weight (so partial overrides are fine), weights need not sum to one (the composite normalizes by total weight), and at least one must be positive. Valid dimensions: team, market, traction, terms, syndicate. From the library:

from presidio_angellist import load_weights, triage_email

result = triage_email("deal.eml", weights=load_weights("weights.json"))

Full rubric config (`--rubric`)

For more than weights, pass a --rubric file. All sections are optional and merge over the defaults:

{
  "weights": { "team": 0.4, "traction": 0.25 },
  "tier_thresholds": { "Strong lead": 90, "Dig deeper": 75 },
  "cap_ceilings": { "pre-seed": 8000000, "seed": 25000000 },
  "risk_penalty": 5.0
}

tier_thresholds — minimum composite (0–100) for each tier label. The Pass floor at 0 is always retained.
cap_ceilings — per-stage valuation-cap ceiling (USD); caps above it raise a risk flag and dock the Terms score.
risk_penalty — composite points deducted per risk flag (default 0).

angeltriage deal.eml --rubric rubric.json   # mutually exclusive with --weights

from presidio_angellist import load_rubric_config, triage_email

result = triage_email("deal.eml", config=load_rubric_config("rubric.json"))

Validation fails closed — unknown keys/dimensions, out-of-range thresholds, negative penalties, or malformed JSON raise WeightsConfigError.

CSV batch import

angeltriage deals.csv triages one Deal per row. Headers are matched case-insensitively against common aliases:

Field	Accepted headers (any of)
company	`company`, `name`, `startup`
valuation_cap	`valuation_cap`, `cap`, `valuation`
round_size	`round_size`, `raising`, `round`, `target`
website	`website`, `url`, `site`
founders	`founders`, `founder`, `team` (split on `;` / `,`)
…	`one_liner`, `sector`, `stage`, `instrument`, `allocation`, `lead`, `deadline`, `location`, `traction`, `links`

Money cells accept $1.2M, 1,200,000, or 500k. Rows without a company are skipped.

from presidio_angellist import triage_csv

for result in triage_csv("deals.csv"):
    print(result.deal.company, result.scorecard.tier)

IMAP intake

--imap pulls deal emails straight from a mailbox (file syndicate emails into a folder, then poll it). It runs wherever you run it — your laptop or a server, not a phone. Credentials come from the environment only (never the command line) — use an app-specific password (iCloud, Gmail with 2FA):

export IMAP_HOST=imap.mail.me.com      # iCloud; Gmail: imap.gmail.com
export IMAP_USER=you@icloud.com
export IMAP_PASSWORD=abcd-efgh-ijkl-mnop   # app-specific password
export IMAP_FOLDER=Deals               # optional; defaults to INBOX

angeltriage --imap --save              # fetch UNSEEN, triage, save to the queue
angeltriage --imap --imap-all --imap-limit 20    # most recent 20, read or not
angeltriage --imap --imap-from deals@syndicate.com

Env var	Purpose
`IMAP_HOST` / `IMAP_USER` / `IMAP_PASSWORD`	Required connection + app-specific password
`IMAP_PORT`	Optional, default `993`
`IMAP_FOLDER`	Optional, default `INBOX` (or use `--imap-folder`)
`IMAP_SSL`	Optional, default on (`0`/`false` to disable)

Flags: --imap-folder, --imap-all (not just UNSEEN), --imap-from ADDR, --imap-limit N. The mailbox is opened read-only, so messages aren't marked read — re-polling re-fetches them and the deal queue dedups by deal identity.

from presidio_angellist import imap_config_from_env, triage_imap

cfg = imap_config_from_env(folder="Deals", limit=20)  # reads IMAP_* env vars
for result in triage_imap(cfg):
    print(result.deal.company, result.scorecard.tier)

Watch mode (continuous polling)

--watch polls the mailbox on an interval and auto-triages new deals into the queue — a hands-off inbox-to-queue pipeline:

angeltriage --watch --interval 300        # poll every 5 min until Ctrl-C, saving new deals
angeltriage --watch --max-cycles 12       # poll 12 times then stop

Within a session, messages are deduped by Message-ID so the same unread email isn't re-triaged every poll; across restarts the deal queue dedups by deal identity. The first poll fails fast on a bad config/credentials; later polls tolerate transient network errors and keep going. Each poll prints a one-line summary plus any newly-saved deals. For a cron-style setup, use --max-cycles 1 on a schedule instead of a long-running process.

from presidio_angellist import DealStore, imap_config_from_env, watch

with DealStore() as store:
    watch(imap_config_from_env(folder="Deals"), store, interval=300)

⚠️ Don't put your mail password in a shared/remote shell. Keep it in a local .env / your shell profile, scoped to where you run the tool.

Local / self-hosted LLM

By default the LLM layer uses Anthropic (ANTHROPIC_API_KEY). To run against a local or self-hosted OpenAI-compatible model instead (mlx_lm.server, Ollama, vLLM, LM Studio…), set a base URL — that alone switches backends:

export ANGELTRIAGE_LLM_BASE_URL=http://127.0.0.1:8080/v1
export ANGELTRIAGE_LLM_MODEL=my-local-model
export ANGELTRIAGE_LLM_API_KEY=not-needed     # optional; many local servers ignore it
export ANGELTRIAGE_LLM_TIMEOUT=120            # optional, seconds

Env var	Purpose
`ANGELTRIAGE_LLM_BASE_URL`	OpenAI-compatible base (e.g. `…/v1`); set this to use a local model
`ANGELTRIAGE_LLM_MODEL`	Model id the server expects
`ANGELTRIAGE_LLM_API_KEY`	Optional bearer token (default `not-needed`)
`ANGELTRIAGE_LLM_PROVIDER`	Optional, `openai`/`anthropic` to force a backend
`ANGELTRIAGE_LLM_EXTRA_BODY`	Optional JSON merged into the request body for server-specific params

For a reasoning model (e.g. Qwen3), disable its thinking so it returns the final answer directly (otherwise it may emit only reasoning tokens and no content):

export ANGELTRIAGE_LLM_EXTRA_BODY='{"chat_template_kwargs":{"enable_thinking":false}}'

Local endpoints are usually loopback, which the enrichment SSRF guard refuses — so LLM calls deliberately bypass that guard. Point it only at a server you control (SECURITY.md). If the model is unreachable, triage degrades to deterministic scoring + a templated memo rather than failing.

Email notifications (`--notify`)

--notify emails each deal new to the store to a recipient list over SMTP — ideal for a daily unattended run. Config is environment-only:

export ANGELTRIAGE_SMTP_HOST=smtp.example.com
export ANGELTRIAGE_SMTP_PORT=465              # 465 = implicit TLS; else STARTTLS
export ANGELTRIAGE_SMTP_USER=you@example.com
export ANGELTRIAGE_SMTP_PASSWORD=…            # app-specific password
export ANGELTRIAGE_SMTP_FROM=you@example.com  # optional, defaults to USER
export ANGELTRIAGE_NOTIFY_TO="a@example.com, b@example.com"

# daily one-shot: poll the al folder once, triage, save, email new deals
angeltriage --watch --max-cycles 1 --imap-folder al --notify

--notify requires a store (implicit under --watch; pass --save otherwise) and only emails genuinely new deals — so a read-only mailbox re-fetch won't re-send. With --watch --max-cycles 1, a persisted processed_messages table ensures each message is triaged exactly once across daily runs. Send failures are loud (non-zero exit), so a deal is never silently dropped.

Deal queue (persistence)

--save persists triaged deals to a local SQLite store so triage becomes a workflow you work over time, instead of one-shot:

angeltriage inbox/*.eml --save           # triage + save the batch
angeltriage --queue                      # ranked list of everything saved
angeltriage --queue --status new         # filter by workflow status
angeltriage --set-status 4 tracking      # new -> tracking -> passed -> committed

  #  tier         score  status     seen  company
  1  Strong lead   83.0  tracking      2  Nimbus Robotics
  3  Track         49.5  new           1  Solo Stealth

Dedup across runs — deals are keyed by website domain (or normalized company name when there's no site), so the same deal forwarded by two syndicates collapses to one row. seen counts how many times it arrived.
Status is preserved on re-save — re-triaging a passed deal won't reset it to new; only the scorecard/score refresh.
Store location — ~/.angeltriage/deals.db by default; override with --db FILE or the ANGELTRIAGE_DB env var. The DB is local; nothing leaves your machine.

from presidio_angellist import DealStore, triage_email

with DealStore() as store:  # default path, or DealStore("deals.db")
    saved, is_new = store.save(triage_email("deal.eml"))
    for row in store.list(status="new"):
        print(row.id, row.company, row.tier, row.composite)
    store.set_status(saved.id, "tracking")

Security hardening (retained, reused for enrichment)

Feature	What it does
Strict TLS 1.2+ enforcement	Rejects TLS 1.0/1.1; ephemeral-EC ciphers only; `verify=True` always
HTTP → HTTPS auto-upgrade	Insecure `http://` URLs are silently upgraded; non-HTTP(S) schemes refused
SSRF guard	Refuses targets resolving to loopback/private/link-local (incl. `169.254.169.254`)/reserved addresses, on every redirect hop, plus ambiguous numeric notations; redirect depth capped at 5
API key / secret redaction	`RedactingFilter` on the `presidio_angellist` logger scrubs Bearer tokens, `sk_live_` / `sk-ant-` keys, and `access_token=`/`api_key=` from every log record
Retry with backoff	Exponential backoff on connection errors / 429 / 5xx, honouring `Retry-After`
Per-host rate limiting	Token-bucket limiter; prevents accidental DoS of enrichment hosts
Security event logging	Structured logs for every hardening action (`presidio_angellist` logger)

Every outbound enrichment request goes through HardenedSession. Untrusted deal text sent to the optional LLM layer is fenced and the system prompt treats it as data, not instructions (prompt-injection defense); plaintext IMAP is refused unless explicitly opted in. See SECURITY.md for the full trust-boundary model.

Roadmap

Next 12 months

Now / in flight — OpenSSF hardening: governance and assurance documentation, Scorecard workflow, Bandit SAST, SBOM generation, an Atheris fuzz harness over the intake path, and the OpenSSF Best Practices badge (passing, then silver).
Next — 0.8.0: pluggable enrichment providers (Crunchbase / Harmonic) behind a stable interface, and queue export / digest output. First signed release tag and GitHub Release carrying the SBOM and build-provenance attestation.
Later, under evaluation — a migration mechanism for the SQLite deal store (needed before the schema changes; see SEMVER.md), broader founder/traction signal extraction, and an independent third-party security review.

Release history

Version	Highlights
0.2.0	Pivot to deal-flow triage: email intake, deterministic rubric, `--weights` config, LLM extraction fallback + memo, `angeltriage` CLI
0.3.0	CSV/batch import, full rubric config (`--rubric`: tiers, cap ceilings, per-flag penalty), HTML-email robustness, og/title enrichment fallbacks
0.4.0	SQLite deal queue: `--save` / `--queue` / `--set-status`, dedup across runs, workflow statuses
0.5.0	IMAP intake (`--imap`, key-gated)
0.5.1	IMAP watch mode (`--watch`: interval polling, in-session dedup, auto-save)
0.5.2	Better company/one-liner extraction (body cues); growth-stage out-of-scope detection
0.6.0	Security-hardening release: SSRF guard, sink-enforced log redaction, LLM prompt-injection defense, restored retry/backoff, plaintext-IMAP refusal, CVE-floored deps + `pip-audit` in CI
0.7.0	Local/self-hosted LLM backend (OpenAI-compatible), SMTP deal notifications (`--notify`), exactly-once polling (`processed_messages`)
0.8.0 (planned)	Pluggable enrichment providers (Crunchbase/Harmonic), queue export/digest

Running tests

pytest -v --cov=presidio_angellist --cov-report=term-missing

Project structure

presidio-hardened-angellist/
├── src/presidio_angellist/
│   ├── __init__.py          # public API
│   ├── hardening.py         # TLS / redaction / rate-limit primitives
│   ├── models.py            # Deal, Scorecard, TriageResult
│   ├── intake/email.py      # forwarded .eml / text -> Deal (deterministic)
│   ├── intake/csv.py        # CSV of deals -> list[Deal]
│   ├── intake/imap.py       # pull deal emails over IMAP (key-gated)
│   ├── watch.py             # --watch: poll IMAP on an interval, auto-triage
│   ├── enrich/web.py        # hardened website enrichment
│   ├── rubric_config.py     # RubricConfig + defaults (weights/tiers/ceilings)
│   ├── triage/rubric.py     # deterministic pre-seed/seed scorecard
│   ├── triage/memo.py       # LLM memo + templated fallback
│   ├── store.py             # SQLite-backed persistent deal queue
│   ├── config.py            # --weights / --rubric config loaders
│   ├── llm.py               # optional Claude extraction/memo (key-gated)
│   ├── pipeline.py          # end-to-end triage_email()
│   └── cli.py               # angeltriage entrypoint
├── tests/
├── pyproject.toml
├── LICENSE                  # MIT
├── README.md
└── SECURITY.md

Governance, Architecture, Security

Governance — roles, decision process, and how to become a maintainer.
Architecture — components, trust boundaries, and the core processing path.
Assurance case — the security claims and the evidence backing each one.
Security policy — supported versions and how to report a vulnerability.
Contributing — the bar a change has to clear, including the test policy.
Stability guarantees — the public API and the invariants downstream can rely on.
Changelog — what changed in each release.

License

MIT — see LICENSE.

Security

See SECURITY.md for our vulnerability disclosure policy.

SDLC

This repository is developed under the Presidio hardened-family SDLC: https://github.com/presidio-v/presidio-hardened-docs/blob/main/sdlc/sdlc-report.md.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

vstantch

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

0.7.3

Jul 29, 2026

0.7.2

Jul 29, 2026

0.7.1

Jun 6, 2026

0.7.0

Jun 6, 2026

0.6.0

Jun 6, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

presidio_hardened_angellist-0.7.3.tar.gz (155.8 kB view details)

Uploaded Jul 29, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

presidio_hardened_angellist-0.7.3-py3-none-any.whl (63.7 kB view details)

Uploaded Jul 29, 2026 Python 3

File details

Details for the file presidio_hardened_angellist-0.7.3.tar.gz.

File metadata

Download URL: presidio_hardened_angellist-0.7.3.tar.gz
Upload date: Jul 29, 2026
Size: 155.8 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.14

File hashes

Hashes for presidio_hardened_angellist-0.7.3.tar.gz
Algorithm	Hash digest
SHA256	`d766cbcdfaaaa985303229881b6252ec677b069d37caccd89d1cac4ff25054be`
MD5	`af25abdb7d84f69c397e13bedec67d8c`
BLAKE2b-256	`148066c5631f852a47338e80d8a0dbb7005aeb1dc032fd804b15c544394a56e0`

See more details on using hashes here.

Provenance

The following attestation bundles were made for presidio_hardened_angellist-0.7.3.tar.gz:

Publisher: publish.yml on presidio-v/presidio-hardened-angellist

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: presidio_hardened_angellist-0.7.3.tar.gz
- Subject digest: d766cbcdfaaaa985303229881b6252ec677b069d37caccd89d1cac4ff25054be
- Sigstore transparency entry: 2284193047
- Sigstore integration time: Jul 29, 2026
Source repository:
- Permalink: presidio-v/presidio-hardened-angellist@bee539031a457a97e3102417546588090b60e4e0
- Branch / Tag: refs/tags/v0.7.3
- Owner: https://github.com/presidio-v
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@bee539031a457a97e3102417546588090b60e4e0
- Trigger Event: push

File details

Details for the file presidio_hardened_angellist-0.7.3-py3-none-any.whl.

File metadata

Download URL: presidio_hardened_angellist-0.7.3-py3-none-any.whl
Upload date: Jul 29, 2026
Size: 63.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.14

File hashes

Hashes for presidio_hardened_angellist-0.7.3-py3-none-any.whl
Algorithm	Hash digest
SHA256	`bfbcd05b6ee0a14f71e069a11735f4ce996f5ebcc39ec4acc8d1f5a09490eb44`
MD5	`aad594ab9b16b2826dacb35344bec69e`
BLAKE2b-256	`c94c0b6a5a923fde3b616055742f4fe33bc0d16587144176d5e933d6be700e30`

See more details on using hashes here.

Provenance

The following attestation bundles were made for presidio_hardened_angellist-0.7.3-py3-none-any.whl:

Publisher: publish.yml on presidio-v/presidio-hardened-angellist

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: presidio_hardened_angellist-0.7.3-py3-none-any.whl
- Subject digest: bfbcd05b6ee0a14f71e069a11735f4ce996f5ebcc39ec4acc8d1f5a09490eb44
- Sigstore transparency entry: 2284193419
- Sigstore integration time: Jul 29, 2026
Source repository:
- Permalink: presidio-v/presidio-hardened-angellist@bee539031a457a97e3102417546588090b60e4e0
- Branch / Tag: refs/tags/v0.7.3
- Owner: https://github.com/presidio-v
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@bee539031a457a97e3102417546588090b60e4e0
- Trigger Event: push

presidio-hardened-angellist 0.7.3

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

presidio-hardened-angellist

How it works

Installation

CLI usage

Library usage

Triage rubric (pre-seed / seed)

Out-of-scope (growth-stage) detection

Tuning the weights

Full rubric config (--rubric)

CSV batch import

IMAP intake

Watch mode (continuous polling)

Local / self-hosted LLM

Email notifications (--notify)

Deal queue (persistence)

Security hardening (retained, reused for enrichment)

Roadmap

Next 12 months

Release history

Running tests

Project structure

Governance, Architecture, Security

License

Security

SDLC

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance

Full rubric config (`--rubric`)

Email notifications (`--notify`)