Skip to main content

A collection of different evaluators for dna nucleodtide level labeling

Project description

DNA segmentation benchmark

This benchmark provides easy metrics for segmentation tasks beyond the common scores like f1, precision and recall. The main motivation for this benchmark is that computing the segmentation performance of a model through micro averaging over individual nucleotides can lead to very wrong conclusions about the actual quality of a model. Hence, this package provides a range of additional metrics.

Insertion / Deletion / Excision / Incision metric

Looking at the kind of error models make when segmenting can reveal systematic biases and issues. Further more this package allows to also look at the lengths of the different errors.

Error counts

image

Error lengths

image

Whole section correctness metric

Instead of measuring how many errors a model makes, these metrics look at if consecutive sections (e.g. Exons or Introns) were labeled correctly entirely

Correctly predicted sections

image

All sections of are a sequence are correct

This metric has to be used carefully. If using this on exons it only makes sense to use this if it certain that alternate splicing events are not occurring. image

Frameshift metrics

Again, this metric can be incredibly insightful, but you have to be careful how you use it. Unless you are sure that all exons are part of the final transcript for all the benchmarked sequences DON'T USE IT. Your results will be skewed and hold no value. image

Traditional Metrics

image

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dna_segmentation_benchmark-0.0.1.tar.gz (32.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

dna_segmentation_benchmark-0.0.1-py3-none-any.whl (32.9 kB view details)

Uploaded Python 3

File details

Details for the file dna_segmentation_benchmark-0.0.1.tar.gz.

File metadata

File hashes

Hashes for dna_segmentation_benchmark-0.0.1.tar.gz
Algorithm Hash digest
SHA256 d62b988396ddedb3162e9e4d52df00e600e888a2359428a018126f245d0cb5ec
MD5 7252d5207be77c44c35db4fdb57740c8
BLAKE2b-256 ecb3bc1c3b214f6195a2feea81609729d3be8f83aec06ddc302969fa31af5fe3

See more details on using hashes here.

File details

Details for the file dna_segmentation_benchmark-0.0.1-py3-none-any.whl.

File metadata

File hashes

Hashes for dna_segmentation_benchmark-0.0.1-py3-none-any.whl
Algorithm Hash digest
SHA256 67324ee92c5fff5fc354952b262fdab6f7a7b44ee2fc045bb0a05bb0ead40953
MD5 78c35b16ef3a9a90838030476c208db8
BLAKE2b-256 3cf72f85c7e052ed716b1f8ab41b0d9212be607f594668e17f14462e387dfe82

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page