Map profile HMMs of transposon termini to genomic sequences for annotation of cryptic transposon variants.

These details have not been verified by PyPI

Project links

Intended Audience
- Science/Research
Programming Language
- Python :: 3
Topic
- Scientific/Engineering :: Bio-Informatics

Project description

TIRmite

Autonomous examples of transposons, belonging to many distinct super-families, share two common properties: A gene or genes encoding the mode of transposition; and terminal sequence features that are recognised by these gene products as the element boundaries.

Proper classification of transposons and grouping into families relies on both phylogeny of conserved sequences and conservation of transposition mechanism.

However, not all TE instances are created equal — inhabiting the nulear soup of their host genome, where your brother's transposase is as good as your own, non-autonomous variants (lacking their own functional hardware) proliferate.

MITEs are a classic example of this - derived from autonomous DNA elements with Terminal Inverted Repeats, they are Miniature Inverted-repeat Transposable Elements, sometimes little more than a pair of TIRs.

When non-autonomous structural variants of a TE vastly outnumber their parent element, and include forms that capture novel genes (or other full transposons!), it becomes difficult to correctly cluster related elements based on the limited signal present in terminal sequences (TIRs, LTRs, etc).

TIRmite employs profile Hidden Markov Models (HMMs) to model natural variation in transposon termini and recover divergent and degraded hits that are often missed by sequence-based aligners like BLAST.

An iterative pairing algorithm is then used to annotate cryptic transposon variants with variable internal sequence compositions.

The elements extracted by TIRmite generally represent structuaral variants derived from an autonomous ancestor and may be further clustered into families.

About TIRmite
Options and usage
Algorithm overview
Contributing
Issues
License

About TIRmite

TIRmite will use profile-HMM models of Transposon Terminal Repeats for genome-wide annotation of transposon families. You can search for TE families with symmetrical termini (i.e. TIRs or LTRs) or asymmetrical elements with different conserved features at either end (i.e. Helitrons, Helentrons, and Starship elements).

Three classes of output are produced:

All significant termini hit sequences are written to fasta (per query HMM).
Candidate elements comprised of paired termini are written to fasta (per query HMM).
Genomic annotations of candidate elements and, optionally, HMM hits (paired and unpaired) are written as a single GFF3 file.

Options and usage

Installing TIRmite

TIRmite requires Python >= v3.9

Dependencies:

HMMER3
mafft
BLAST+ (Optional)

You can create a Conda environment with these dependencies using the environment.yml file in this repo.

conda env create -f environment.yml

conda activate tirmite

Installation options:

pip install the latest development version directly from this repo.

pip install git+https://github.com/Adamtaranto/TIRmite.git

Install latest release from PyPi.

pip install tirmite

Install latest release (with dependencies) from Bioconda.

conda install -c bioconda tirmite

Test installation.

# Print version number and exit.
% tirmite --version
tirmite 1.3.0

# Get usage information
% tirmite --help

Example usage

See the online tutorials for detailed walkthroughs of each module:

Building HMMs — build a profile HMM from a seed TIR/LTR sequence using tirmite seed.
Using tirmite search — ensemble BLAST/nhmmer search and hit merging.
Using tirmite pair — pairing terminus hits, flank extraction and target site reconstruction.
Reconstructing and validating target sites — validate reconstructed target sites with tirmite validate.

Quick start

GENOME="genome.fa"
HMMFILE="MY_TIR.hmm"
NHMMERFILE="MY_TIR_nhmmer_hits.tab"

# 1. Search genome for terminus hits
nhmmer --dna --cpu 8 --tblout $NHMMERFILE $HMMFILE $GENOME

# 2. Pair hits and write elements + GFF3
tirmite pair \
  --genome $GENOME \
  --nhmmer-file $NHMMERFILE \
  --hmm-file $HMMFILE \
  --orientation F,R \
  --mincov 0.4 \
  --maxdist 20000 \
  --gff-report all \
  --gff \
  --outdir MY_TIR_OUTPUT

To also reconstruct target sites (see tutorial):

tirmite pair \
  --genome $GENOME \
  --nhmmer-file $NHMMERFILE \
  --hmm-file $HMMFILE \
  --orientation F,R \
  --mincov 0.6 \
  --maxdist 20000 \
  --flank-len 30 \
  --tsd-length 2 \
  --outdir MY_TIR_OUTPUT \
  --gff

Standard options

Run tirmite --help to view available subcommands:

tirmite --help
usage: tirmite [-h] [--version] COMMAND ...

TIRmite: Transposon Terminal Repeat detection suite

positional arguments:
  COMMAND     Available subcommands
    legacy    Original TIRmite workflow (HMM search + pairing)
    seed      Build HMM models from seed sequences
    pair      Pair precomputed nhmmer hits
    search    Ensemble search: merge hits from clustered features
    validate  Validate reconstructed target sites

options:
  -h, --help  show this help message and exit
  --version   show program's version number and exit

Algorithm overview

Use nhmmer (or BLAST) to query genome with termini models/sequences.
Import all hits under --maxeval threshold.
For each significant terminus match, identify candidate partners, where: - Hit is on the same sequence. - Hit is in correct relative orientation. - Distance is <= --maxdist. - Hit length is >= (model/query length * --mincov prop)
Rank candidate partners by distance downstream of positive-strand hits, and upstream of negative-strand hits.
Pair reciprocal top candidate hits.
For unpaired hits, find nearest unpaired candidate partner and check for reciprocity.
If the first unpaired candidate is non-reciprocal, check for 2nd-order reciprocity (is outbound top-candidate of current candidate reciprocal.)
Iterate steps 6-7 until all termini hits are paired OR number of iterations without new pairing exceeds --stable-reps.

Ensemble search with `tirmite search` (optional pre-processing step)

For complex scenarios with multiple sub-type HMMs or asymmetric element families, use tirmite search to merge and filter hits before pairing:

Run BLAST and/or nhmmer with multiple query models simultaneously.
Optionally merge overlapping hits from clustered component features (cluster map).
When a pairing map is provided:
- Step 0: Exclude hits from models not listed in the pairing map.
- Step 1: Remove nested weak hits within each direct left/right pair.
- Step 2: Remove lower-quality cross-model hits at shared genomic loci.
- Emit a structured filter summary report covering all three steps (per-model exclusion counts, nesting relationships, and per-pair cross-model removal counts).
Output a filtered, merged hit table ready for tirmite pair.
Optionally write separate left/right hit files (--split-paired-output) for asymmetric elements.

Contributing

If you would like to add a new feature or fix a bug, please see our contribution guidelines.

Open an issue
Fork the repo
Follow the dev env setup instructions below

# Clone this repo (or your own fork)
git clone https://github.com/Adamtaranto/TIRmite.git && cd TIRmite
# Install custom conda env
conda env create -f environment.yml
# Activate conda env
conda activate tirmite
# Install an editable copy of the package
pip install -e '.[dev]'
# Enable pre-commit checks
pre-commit install

Issues

Submit feedback to the Issue Tracker

License

Software provided under GPL-3 license.

Star History

Project details

These details have not been verified by PyPI

Project links

Intended Audience
- Science/Research
Programming Language
- Python :: 3
Topic
- Scientific/Engineering :: Bio-Informatics

Release history Release notifications | RSS feed

This version

1.4.0

Apr 20, 2026

1.3.0

Jan 14, 2026

1.2.0

Apr 7, 2025

1.1.6

Oct 31, 2024

1.1.5

Oct 28, 2024

1.1.4

Dec 25, 2019

1.1.3

Nov 29, 2018

1.1.1

Nov 5, 2018

1.1.0

Oct 11, 2017

1.0.0

Oct 2, 2017

0.1.0

Sep 1, 2017

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

tirmite-1.4.0.tar.gz (392.2 kB view details)

Uploaded Apr 20, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

tirmite-1.4.0-py3-none-any.whl (147.4 kB view details)

Uploaded Apr 20, 2026 Python 3

File details

Details for the file tirmite-1.4.0.tar.gz.

File metadata

Download URL: tirmite-1.4.0.tar.gz
Upload date: Apr 20, 2026
Size: 392.2 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: Hatch/1.16.5 cpython/3.12.13 HTTPX/0.28.1

File hashes

Hashes for tirmite-1.4.0.tar.gz
Algorithm	Hash digest
SHA256	`62b135fa8b42f8a45cd649018b941a68e7437663266d59ce62b263cb1b2c6ec9`
MD5	`2d3d901bb5ab38b42f16b1833ab1566e`
BLAKE2b-256	`10b5f28cb4005793f7cdb8e8c376539b9708d91c4ebb4479033513cec68c40c3`

See more details on using hashes here.

File details

Details for the file tirmite-1.4.0-py3-none-any.whl.

File metadata

Download URL: tirmite-1.4.0-py3-none-any.whl
Upload date: Apr 20, 2026
Size: 147.4 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: Hatch/1.16.5 cpython/3.12.13 HTTPX/0.28.1

File hashes

Hashes for tirmite-1.4.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`3871a670fdb53de68d32982661decf3750a222d6ab5a51baf887d95c9f1ce309`
MD5	`d09c6f874a8dfe883c6a29c7d99fcc68`
BLAKE2b-256	`66637d60c70ed8212df7219029278e108082ac2b194d28f1783aea7975b33cb6`

See more details on using hashes here.

tirmite 1.4.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

TIRmite

Table of contents

About TIRmite

Options and usage

Installing TIRmite

Example usage

Quick start

Standard options

Algorithm overview

Ensemble search with `tirmite search` (optional pre-processing step)

Contributing

Issues

License

Star History

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

tirmite 1.4.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

TIRmite

Table of contents

About TIRmite

Options and usage

Installing TIRmite

Example usage

Quick start

Standard options

Algorithm overview

Ensemble search with tirmite search (optional pre-processing step)

Contributing

Issues

License

Star History

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

Ensemble search with `tirmite search` (optional pre-processing step)