Speedup fork of cafaeval (CAFA-evaluator-PK) with a bit-exact parity harness

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

frapercan

These details have not been verified by PyPI

Project links

Project description

cafaeval-protea

This is a modified fork. See CHANGES.md for the full list of modifications and their dates (required by GPLv3 §5.a).

cafaeval-protea is a fork of CAFA-evaluator-PK by Clara De Paolis, which is itself a fork of CAFA-evaluator by the BioComputing UP group at the University of Padua (Piovesan et al., 2024).

The fork exists to provide a faster evaluator for iterative work, without changing any scoring semantics. Fmax, Smin, weighted variants, Partial-Knowledge (PK) evaluation and information accretion weighting are all preserved and validated against the upstream output before any optimization lands.

The Python import path remains cafaeval (identical API) so that existing downstream code using from cafaeval.evaluation import cafa_eval keeps working unchanged. Only the PyPI distribution name differs:

pip install cafaeval-protea          # installs the fork
python -c "from cafaeval.evaluation import cafa_eval"  # same import

Attribution

Upstream authors (primary — always cite)

The original evaluator and all of its scoring logic are the work of:

CAFA-evaluator: A Python Tool for Benchmarking Ontological Classification Methods D. Piovesan, D. Zago, P. Joshi, M. C. De Paolis Kaluza, M. Mehdiabadi, R. Ramola, A. M. Monzon, W. Reade, I. Friedberg, P. Radivojac, S. C. E. Tosatto. Bioinformatics Advances, 2024. DOI: 10.1093/bioadv/vbae043

Direct parent (PK variant)

The Partial-Knowledge evaluation extensions were contributed by Clara De Paolis in claradepaolis/CAFA-evaluator-PK. This fork is branched directly from that repository and inherits its semantics for -known annotations and terms-of-interest filtering.

Speedup ideas and cherry-picked commits

The Phase A algorithmic speedups in this fork (weighted-only fast path, cached per-term children lists, fill-mode restricted to zero rows, incremental non-zero counter in the prediction parser, shared-memory parallel DAG propagation, fork-pool initializer pattern for the threshold sweep) come from Antonina Dolgorukova (T0chka) and her public fork T0chka/CAFA-evaluator-PK-speedup, which she shared in the CAFA 6 Kaggle discussion "Speeding up cafaeval" (post #664359).

The five substantive commits from her speedup-local branch were cherry-picked into this fork with authorship preserved (see git log). On top of that, we added: dead-code removal, structured cafaeval.* logging, extension of the fork-pool initializer pattern from the NK/LK branch to the PK (gt_exclude) branch of compute_metrics, and the parity harness under bench/ and tests/diff/.

If you use the speedups in published work please acknowledge the upstream paper above, Clara De Paolis' PK fork, and Antonina Dolgorukova's speedup work.

Scope of modifications

This fork modifies the following parts of the upstream:

Area	Upstream module	Change	Status	Validation
Parser	`src/cafaeval/parser.py`	(A) incremental non-zero counter, buffered reads, single dict lookup per term; (B3) PyArrow-backed vectorised parser with dictionary-encoded `pid`/`tid` and sort-based per-namespace group-max	done	bit-exact on real corpora
Propagation	`src/cafaeval/graph.py`	(A) cached per-term children lists, fill-mode restricted to zero rows, shared-memory spawn worker; (B4) sparse push-up kernel with flat ancestor CSR and `np.maximum.reduceat` group-max over input non-zeros	done	bit-exact in A, `rtol=1e-6` in B
NK/LK metric	`src/cafaeval/evaluation.py`	(A) weighted-only fast path, fork-pool `initializer` for threshold sweep; (B1) sparse confusion-matrix kernel via `np.bincount` scatter + right-to-left cumsum	done	bit-exact
PK metric	`src/cafaeval/evaluation.py`	(A) fork-pool `initializer` pattern extended to the `gt_exclude` branch; (B2) sparse PK kernel with boolean-mask filter `(pred != 0) & toi_mask & ~excluded_mask`	done	`rtol=1e-6` in B (ULP reorder)
Logging	(new)	Structured stdlib `logging` at module granularity — see Logging below	done	n/a
Orchestrator	`src/cafaeval/__main__.py`	Thin reshuffling only; no semantic change	done	bit-exact

Detailed per-commit diff against the upstream is maintained in CHANGES.md.

Validation policy

No optimization lands in this fork without a passing parity test against a frozen upstream oracle. The oracle is built in bench/ by running the unmodified upstream against a set of deterministic synthetic corpora (tiny / medium / large) and serializing the full output — Fmax, Smin, weighted Fmax, weighted Smin, precision-recall curves, optimal thresholds — into bench/oracle/*.pkl. The diff tests under tests/diff/ reload that oracle and compare the fork's output:

Phase A (parser cherry-picks, cached children, weighted-only, zero-row propagation, NK sparse kernel, sparse propagate): atol=0, rtol=0. A single-bit divergence is a bug.
Phase B (sparse PK confusion matrix): rtol=1e-6, atol=1e-9. The PK sparse kernel reorders per-protein inner sums, so a ULP-level (~4e-16) divergence in pr is expected and tolerated.

The active phase is controlled by the CAFAEVAL_PARITY_PHASE env var (A or B). The default flipped from A to B when the Phase B2 PK kernel landed. On a 4.45M-row real corpus the fork agrees with unmodified upstream to 2.1e-14 in PK and 1.9e-14 in NK, well inside the Phase B tolerance.

No result from this fork is trusted until the relevant corpus passes its diff test.

Performance

End-to-end wall time of a full cafa_eval(...) call (OBO load → ground truth parse → prediction parse + propagate → confusion matrix sweep → metric assembly → best-row aggregation), measured on a single workstation at n_cpu=1 and the CAFA-default th_step=0.01 (99 thresholds). "Upstream" is unmodified claradepaolis/CAFA-evaluator-PK at commit 16a6a6d. Corpus: real CAFA 6 PROTEA artifacts (8 712 BP / 4 992 MF / 5 125 CC ground-truth proteins, ~700 k-row prediction file, known_terms.tsv exclude set for PK).

Mode	Upstream	Fork (B1–B7)	Speedup
NK	92.96 s	4.08 s	22.8×
PK	418.53 s	10.33 s	40.5×

The sparse confusion-matrix kernels (Phase B1/B2) are approximately flat in n_tau — moving from th_step=0.05 (20 thresholds) to the CAFA default th_step=0.01 (99 thresholds) costs the fork virtually nothing, while upstream's per-threshold scan scales linearly. Hence the speedup ratio grows with n_tau.

Where the fork's remaining time goes on PK end-to-end (10 s):

Phase	Time
OBO parse	1.9 s
Ground truth parse + propagate	3.8 s
Prediction parse + propagate (PyArrow)	2.4 s
`compute_metrics` × 3 namespaces (sparse PK)	1.8 s
Eval/normalise/aggregate plumbing	0.4 s

Runtime knobs

All optimizations are gated by environment variables so the legacy path is always available for A/B comparison or debugging.

Var	Default	Effect
`CAFAEVAL_SPARSE`	`1`	Sparse NK + PK confusion-matrix kernels and sparse push-up propagation. Set to `0` to fall back to the dense/pool path.
`CAFAEVAL_FAST_PARSER`	`1`	PyArrow-backed vectorised `pred_parser`. Set to `0` to force the legacy per-line loop. Also falls back automatically when `max_terms` is set or the fast path raises.
`CAFAEVAL_PARITY_PHASE`	`B`	Tolerance used by `tests/diff/test_oracle_parity.py` — `A` for bit-exact, `B` for `rtol=1e-6, atol=1e-9`.

Install

pip install cafaeval-protea               # core install (numpy + pandas + matplotlib)
pip install "cafaeval-protea[fast]"       # enables the PyArrow parser fast path

The hard dependency set is kept at numpy + pandas + matplotlib; pyarrow>=12 is an optional [fast] extra. Without it, pred_parser automatically falls back to the legacy loop.

Logging

The upstream evaluator is silent at the library boundary, which makes long-running calls inside other pipelines opaque. This fork adds structured logging using the stdlib logging module (no new dependencies), organized as a proper logger hierarchy:

cafaeval                    # root logger
├── cafaeval.parser         # parsing predictions / ground truth
├── cafaeval.propagate      # DAG propagation
├── cafaeval.metrics        # compute_metrics
└── cafaeval.eval           # orchestrator

Conventions:

INFO: high-level events with timing, e.g. "parser: parsed 12345 proteins in 3.21s".
DEBUG: per-namespace, per-threshold detail, matrix shapes.
WARNING: non-fatal anomalies (terms missing from the ontology, proteins without ground truth).
No print() calls, no basicConfig() inside the library. Handler configuration is always the consumer's responsibility.
Structured fields are passed via logger.info(..., extra={...}) so that downstream consumers can extract machine-readable payloads without parsing log strings.

A downstream project that wants to capture these logs only needs:

import logging
logging.getLogger("cafaeval").setLevel(logging.INFO)
# attach your own handler here

Usage

The CLI and library interfaces are unchanged from the upstream. See README_upstream.md for the full input-file formats, command-line flags, and output layout. The cafaeval console script and the cafa_eval(...) Python entry point continue to work exactly as documented there.

License

GNU General Public License v3 (GPLv3), inherited unchanged from the upstream. See LICENCE.md.

Per GPLv3 §5.a, modifications introduced by this fork are documented in CHANGES.md with their dates. The original upstream copyright notice (© 2022 Damiano Piovesan) is preserved verbatim in LICENCE.md and is not superseded by the existence of this fork.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

frapercan

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.1.0

Apr 13, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

cafaeval_protea-0.1.0.tar.gz (71.2 kB view details)

Uploaded Apr 13, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

cafaeval_protea-0.1.0-py3-none-any.whl (56.7 kB view details)

Uploaded Apr 13, 2026 Python 3

File details

Details for the file cafaeval_protea-0.1.0.tar.gz.

File metadata

Download URL: cafaeval_protea-0.1.0.tar.gz
Upload date: Apr 13, 2026
Size: 71.2 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for cafaeval_protea-0.1.0.tar.gz
Algorithm	Hash digest
SHA256	`357ed63016ab31344e151969dc85419aa9aba3dc54127a026174c50e10c6f0fc`
MD5	`123cf04d25db57a9c897e08d14821e7e`
BLAKE2b-256	`7efdbf2eba67fc75b41d0cd4595f74d15efa10f506bdd3ed06420bf00471d615`

See more details on using hashes here.

Provenance

The following attestation bundles were made for cafaeval_protea-0.1.0.tar.gz:

Publisher: release.yml on frapercan/cafaeval-protea

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: cafaeval_protea-0.1.0.tar.gz
- Subject digest: 357ed63016ab31344e151969dc85419aa9aba3dc54127a026174c50e10c6f0fc
- Sigstore transparency entry: 1284600461
- Sigstore integration time: Apr 13, 2026
Source repository:
- Permalink: frapercan/cafaeval-protea@a50e19b434d3c89c7c88fdb2e2fe0ffb7a99442c
- Branch / Tag: refs/tags/v0.1.0
- Owner: https://github.com/frapercan
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@a50e19b434d3c89c7c88fdb2e2fe0ffb7a99442c
- Trigger Event: push

File details

Details for the file cafaeval_protea-0.1.0-py3-none-any.whl.

File metadata

Download URL: cafaeval_protea-0.1.0-py3-none-any.whl
Upload date: Apr 13, 2026
Size: 56.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for cafaeval_protea-0.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`dae41b6a4bcad05055072867dd9caf8939d75a8f038b143d675c80bb2374fa52`
MD5	`7e0ead9cdb7a41f1dcca09852df07022`
BLAKE2b-256	`7b3779e68cece6221685de11d78c862ddac12989b5e34dfd8b9cb59e4d3b13fb`

See more details on using hashes here.

Provenance

The following attestation bundles were made for cafaeval_protea-0.1.0-py3-none-any.whl:

Publisher: release.yml on frapercan/cafaeval-protea

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: cafaeval_protea-0.1.0-py3-none-any.whl
- Subject digest: dae41b6a4bcad05055072867dd9caf8939d75a8f038b143d675c80bb2374fa52
- Sigstore transparency entry: 1284600523
- Sigstore integration time: Apr 13, 2026
Source repository:
- Permalink: frapercan/cafaeval-protea@a50e19b434d3c89c7c88fdb2e2fe0ffb7a99442c
- Branch / Tag: refs/tags/v0.1.0
- Owner: https://github.com/frapercan
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@a50e19b434d3c89c7c88fdb2e2fe0ffb7a99442c
- Trigger Event: push

cafaeval-protea 0.1.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

cafaeval-protea

Attribution

Upstream authors (primary — always cite)

Direct parent (PK variant)

Speedup ideas and cherry-picked commits

Scope of modifications

Validation policy

Performance

Runtime knobs

Install

Logging

Usage

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance