Skip to main content

A collection of different evaluators for dna nucleodtide level labeling

Project description

DNA segmentation benchmark

This benchmark provides easy metrics for segmentation tasks beyond the common scores like f1, precision and recall. The main motivation for this benchmark is that computing the segmentation performance of a model through micro averaging over individual nucleotides can lead to very wrong conclusions about the actual quality of a model. Hence, this package provides a range of additional metrics.

Insertion / Deletion / Excision / Incision metric

Looking at the kind of error models make when segmenting can reveal systematic biases and issues. Further more this package allows to also look at the lengths of the different errors.

Error counts

image

Error lengths

image

Whole section correctness metric

Instead of measuring how many errors a model makes, these metrics look at if consecutive sections (e.g. Exons or Introns) were labeled correctly entirely

Correctly predicted sections

image

All sections of are a sequence are correct

This metric has to be used carefully. If using this on exons it only makes sense to use this if it certain that alternate splicing events are not occurring. image

Frameshift metrics

Again, this metric can be incredibly insightful, but you have to be careful how you use it. Unless you are sure that all exons are part of the final transcript for all the benchmarked sequences DON'T USE IT. Your results will be skewed and hold no value. image

Traditional Metrics

image

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dna_segmentation_benchmark-0.0.2.tar.gz (32.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

dna_segmentation_benchmark-0.0.2-py3-none-any.whl (33.0 kB view details)

Uploaded Python 3

File details

Details for the file dna_segmentation_benchmark-0.0.2.tar.gz.

File metadata

File hashes

Hashes for dna_segmentation_benchmark-0.0.2.tar.gz
Algorithm Hash digest
SHA256 f8c904621db5fd755bd3f79c3605eeed7780e74cfbb54068863eb3ad13d1b7db
MD5 523ec3e3fb0722e56bb3cbaa2ebb1832
BLAKE2b-256 02731993623a8a8c91a6224010069001d8e6d9773a3c0241340019153cac2f67

See more details on using hashes here.

File details

Details for the file dna_segmentation_benchmark-0.0.2-py3-none-any.whl.

File metadata

File hashes

Hashes for dna_segmentation_benchmark-0.0.2-py3-none-any.whl
Algorithm Hash digest
SHA256 a57b1ede1e092eedc4aa02e5ac226a8c21c445f1e8e909b24955cf781ae2afdf
MD5 5b57f3625cd5f70433e0f678a4d1b0d2
BLAKE2b-256 461073bc98a80063062478a477faa551fb48713fab0f651f299e439d87522024

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page