Skip to main content

A Python3 annotation program to select the best gene model in each locus

Project description

Mikado is a lightweight Python3 pipeline whose purpose is to facilitate the identification of expressed loci from RNA-Seq data * and to select the best models in each locus.

The logic of the pipeline is as follows:

  1. In a first step, the annotation (provided in GTF/GFF3 format) is parsed to locate superloci of overlapping features on the same strand.

  2. The superloci are divided into different subloci, each of which is defined as follows:

    • For multiexonic transcripts, to belong to the same sublocus they must share at least a splicing junction (i.e. an intron)

    • For monoexonic transcripts, they must overlap for at least one base pair

    • All subloci must contain either only multiexonic or only monoexonic transcripts

  3. In each sublocus, the pipeline selects the best transcript according to a user-defined prioritization scheme.

  4. The resulting monosubloci are merged together, if applicable, into monosubloci_holders

  5. The best non-overlapping transcripts are selected, in order to define the loci contained inside the superlocus.

    • At this stage, monoexonic and multiexonic transcript are checked for overlaps

    • Moreover, two multiexonic transcripts are considered to belong to the same locus if they share a splice site (not junction)

  6. Once the loci have been defined, the program backtracks and looks for transcripts which can be assigned unambiguously to a single locus and constitute valid alternative splicing isoforms of the main transcripts.

The criteria used to select the “best” transcript are left to the user’s discretion, using specific configuration files.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

Mikado-2.2.3.tar.gz (25.2 MB view details)

Uploaded Source

File details

Details for the file Mikado-2.2.3.tar.gz.

File metadata

  • Download URL: Mikado-2.2.3.tar.gz
  • Upload date:
  • Size: 25.2 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/3.7.3 pkginfo/1.7.0 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.59.0 CPython/3.9.2

File hashes

Hashes for Mikado-2.2.3.tar.gz
Algorithm Hash digest
SHA256 bdbdbdeb56bab4a4d633a0be36aea1aac3c025763a5d6721c98982ac1bd17bf9
MD5 a94a38830f99791cb9c7afe1aa850d26
BLAKE2b-256 28dddec1ae4c7519971dc80a780dde045fce221adffba00535e3ff26de7c29ba

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page