Audit PEA-eligibility of ETF KID documents with a vision LLM. French PEA (Plan d'Épargne en Actions) rules built in.

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

These details have not been verified by PyPI

Project description

pea-audit

Audit French PEA (Plan d'Épargne en Actions) eligibility of ETFs by reading their KID (Key Information Document) with a vision LLM. Tells you whether a fund is actually eligible for a French PEA account — with verbatim citations from the document.

$ python audit_cli.py samples/amundi_pea_monde_kid.pdf
📄 Audit de : samples/amundi_pea_monde_kid.pdf

  ✅ ÉLIGIBLE PEA    (confiance : high)

  Émetteur     : Amundi
  ISIN         : FR001400U5Q4
  Indice       : MSCI World Index EUR
  Réplication  : synthetic_swap

  Le fonds est éligible au PEA car il utilise une réplication synthétique
  via swap (IFT) avec un panier d'actions européennes ≥75%.

  Preuves :
    p.1 — « Le Fonds est éligible au Plan d'Épargne en Actions français (PEA) ... »
    p.1 — « La performance sera échangée contre celle de l'Indice de Référence ... »

Why

PEA eligibility is opaque and changes silently — issuers re-domicile, swap counterparties, switch to ESG-screened variants, and rename funds (e.g. Amundi PEA Nasdaq-100 silently became "Amundi PEA US Tech Screened" under the same ticker). Brokers don't always flag this. pea-audit reads each fund's KID directly and tells you what the document actually says, with quotes you can verify.

Install

pip install pea-audit

Optional extras:

pip install 'pea-audit[observability]'  # adds Langfuse for LLM tracing
pip install 'pea-audit[evals]'           # adds pyyaml for the eval suite
pip install 'pea-audit[dev]'             # everything above + python-dotenv

Quickstart

from pathlib import Path
from pea_audit import audit_pdf, VerdictCache
from pea_audit.llm import OllamaCloudClient

# Default backend: Ollama Cloud running Gemma 4 31b
llm = OllamaCloudClient(api_key="sk-...")  # from https://ollama.com/settings/keys

# Cache is opt-in. Library never writes to disk unless you supply one.
cache = VerdictCache(Path("./cache"))

verdict = audit_pdf("path/to/kid.pdf", llm=llm, cache=cache)

print(verdict.eligible)        # "yes" | "no" | "uncertain"
print(verdict.replication)     # "physical" | "synthetic_swap" | "unknown"
print(verdict.isin)            # deterministic — extracted from PDF text + Luhn-validated
for c in verdict.evidence:
    print(f"  p.{c.page}: « {c.quote} »")

Audit by ticker (built-in URL registry)

from pea_audit import audit_ticker, VerdictCache
from pea_audit.llm import OllamaCloudClient

llm = OllamaCloudClient(api_key="sk-...")
cache = VerdictCache(Path("./cache"))

result = audit_ticker("EWLD.PA", llm=llm, kid_dir=Path("./kids"), cache=cache)
print(result.verdict.eligible)  # "yes"

Built-ins ship for the most common French ETFs (Amundi PEA range, BNP Paribas Easy). Add more:

from pea_audit.sources import register_source, KIDSource

register_source(KIDSource(
    ticker="LYX.PA",
    isin="FR0010411884",
    url="https://www.lyxoretf.fr/.../kid.pdf",
    issuer="Lyxor",
))

Architecture

Two protocols make this library extensible without forking:

`VisionLLM` — swap the model

from typing import Any, Protocol

class VisionLLM(Protocol):
    def analyze_images(
        self,
        images: list[bytes],
        prompt: str,
        schema: dict[str, Any],
        system: str | None = None,
    ) -> dict[str, Any]: ...

The default OllamaCloudClient wraps Gemma 4 via Ollama Cloud with tenacity retries on transient errors and optional Langfuse tracing. Anyone can implement this protocol to plug in Claude vision, GPT-4o, Gemini, a local Ollama instance, etc.

`KIDSource` — add issuers

from pea_audit.sources import register_source, KIDSource, get_source, all_sources

A registry of ticker → KID URL mappings. Ships builtins for Amundi (URL pattern), BNP Paribas (per-fund UUIDs); URL helpers for BlackRock/iShares + Vanguard are importable but don't auto-register (most of their funds are PEA-ineligible — they're for testing the negative path).

Eval baseline

The repo ships 13 regression cases under evals/cases/*.yaml — 7 PEA-eligible synthetic-swap, 6 ineligible physical non-EEA — covering Amundi, BNP, BlackRock/iShares, Vanguard. Current baseline on Gemma 4 31b-cloud: 13/13 (100%). Run before any prompt or model change:

python evals/run.py

Production niceties

Retries on transient errors — tenacity with exponential backoff (1s → 4s → 16s), only on network/timeout/5xx (not on 4xx or schema errors that won't self-resolve)
Optional observability — Langfuse traces per LLM call (model, input/output, tokens, latency). Activates when LANGFUSE_PUBLIC_KEY/LANGFUSE_SECRET_KEY are set, silent no-op otherwise
Deterministic ISINs — vision misreads of the 12-char ISIN string are corrected by regex-extracting candidates from the PDF text layer and validating with the Luhn check digit
Versioned prompts — pea_audit/prompts/audit_v{N}.md files, selected via prompt_version= parameter; rollback is a config change, not a code edit
Hard vs soft fields in diffs — compare_verdicts() defaults to comparing only categorical fields (eligible, replication, isin) so monthly re-audit doesn't false-fire on LLM rephrasing of free-text issuer/index names

Reference app: ETFTracker

The repo also ships a personal-tool app that consumes the library: a French ETF portfolio tracker with a Streamlit dashboard, monthly re-audit cron, FastAPI service, and Docker compose deployment. See ETFTracker.md (French) for that side.

To run it: cp positions.csv.example positions.csv, edit with your own holdings, cp .env.example .env with your Ollama key, then docker compose up -d web or streamlit run dashboard.py.

Publishing checklist (maintainer)

PyPI publication uses trusted publishers (OIDC) — no API token secret needed in CI.

One-time setup:

Create the project on https://pypi.org (or first on https://test.pypi.org for a dry-run)
Add a Trusted Publisher pointing to release.yml in this repo, environment pypi
In GitHub repo settings, create the pypi environment (no secrets needed)

Per-release:

# 1. Update version in pyproject.toml + add entry to CHANGELOG.md
# 2. Verify it builds + tests pass
python -m build
pytest tests/

# 3. Tag and push — CI takes over
git tag v0.1.0
git push origin v0.1.0

The release.yml workflow builds the wheel + sdist and publishes to PyPI automatically on tag push.

Contributing

See CONTRIBUTING.md.

License

MIT.

Disclaimer

This is a personal-finance tool. The LLM-judged eligibility verdict is informational, not regulatory advice — always cross-check against the actual DIC/KID before buying.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

andrelaurel

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.2.1

May 24, 2026

0.2.0

May 24, 2026

This version

0.1.0

May 24, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pea_audit-0.1.0.tar.gz (17.4 kB view details)

Uploaded May 24, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

pea_audit-0.1.0-py3-none-any.whl (28.6 kB view details)

Uploaded May 24, 2026 Python 3

File details

Details for the file pea_audit-0.1.0.tar.gz.

File metadata

Download URL: pea_audit-0.1.0.tar.gz
Upload date: May 24, 2026
Size: 17.4 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for pea_audit-0.1.0.tar.gz
Algorithm	Hash digest
SHA256	`e377de4d4a0b1f498f8ba5f5e4b6056451183a086b8398116a523ad2416138bd`
MD5	`c1cd7c6d0b245ba44b7c2dbb272f9289`
BLAKE2b-256	`9598a4c580ead137d48f46a7a7374b3f02ed91925b83a1fd8c720926479024d3`

See more details on using hashes here.

Provenance

The following attestation bundles were made for pea_audit-0.1.0.tar.gz:

Publisher: release.yml on AndreLiar/pea-audit

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: pea_audit-0.1.0.tar.gz
- Subject digest: e377de4d4a0b1f498f8ba5f5e4b6056451183a086b8398116a523ad2416138bd
- Sigstore transparency entry: 1624729959
- Sigstore integration time: May 24, 2026
Source repository:
- Permalink: AndreLiar/pea-audit@40bc6eba357adf02e9361ecf5b17e2d6b35ee9a8
- Branch / Tag: refs/tags/v0.1.0
- Owner: https://github.com/AndreLiar
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@40bc6eba357adf02e9361ecf5b17e2d6b35ee9a8
- Trigger Event: push

File details

Details for the file pea_audit-0.1.0-py3-none-any.whl.

File metadata

Download URL: pea_audit-0.1.0-py3-none-any.whl
Upload date: May 24, 2026
Size: 28.6 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for pea_audit-0.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`06902ba905e27a6f0426c7e36595f9990f9a0ff6ed46117aeabe10653076081e`
MD5	`d9a7fe9484d3bbe0d7b73f53a75de7e2`
BLAKE2b-256	`3a70036f579d1c5d90dbf19548599d9ae28b1c8187fafc1ff85fd335cd7aa41a`

See more details on using hashes here.

Provenance

The following attestation bundles were made for pea_audit-0.1.0-py3-none-any.whl:

Publisher: release.yml on AndreLiar/pea-audit

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: pea_audit-0.1.0-py3-none-any.whl
- Subject digest: 06902ba905e27a6f0426c7e36595f9990f9a0ff6ed46117aeabe10653076081e
- Sigstore transparency entry: 1624729962
- Sigstore integration time: May 24, 2026
Source repository:
- Permalink: AndreLiar/pea-audit@40bc6eba357adf02e9361ecf5b17e2d6b35ee9a8
- Branch / Tag: refs/tags/v0.1.0
- Owner: https://github.com/AndreLiar
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@40bc6eba357adf02e9361ecf5b17e2d6b35ee9a8
- Trigger Event: push

pea-audit 0.1.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

pea-audit

Why

Install

Quickstart

Audit by ticker (built-in URL registry)

Architecture

VisionLLM — swap the model

KIDSource — add issuers

Eval baseline

Production niceties

Reference app: ETFTracker

Publishing checklist (maintainer)

Contributing

License

Disclaimer

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance

`VisionLLM` — swap the model

`KIDSource` — add issuers