For manipulating taf and maf format alignments
Project description
Taffy
This is a MIT license C, Python and CLI library for manipulating/reading/writing TAF (described below) and MAF format multiple sequence alignments. It allows conversion between the formats and manipulation of the alignments with a number of useful utilities for preparing them for different use cases. The Python library is built on top of the C library and is therefore quite fast.
###Taf Format Specification
See the Taf format page for a specification of the taf format and example.
###Installation
See C/CLI Install for how to build and install this source for using the C library and CLI utilities.
See Python install for how to install the Python library.
CLI Utilities
See taffy utilities for a description of the many useful taffy utilities, including:
- view - MAF / TAF conversion and region extraction
- norm - normalize TAF blocks
- add-gap-bases - add sequences from HAL or FASTA files into TAF gaps
- index - create a .tai index (required for region extraction)
- sort - sort the rows of a TAF file to a desired order
- stats - print statistics of a TAF file
- coverage - print coverage statistics of a given genome in a TAF file
Using the Python API
See using the Python API for how to work with MAF/TAF alignments using a convenient Python API designed to complement the CLI.
See the example notebook for a quick worked example of using the Python API for machine learning with PyTorch.
Using the C Library
There is also a simple C library for working with taf/maf files. See taf.h in the inc directory.
Comparing MAF and TAF file sizes
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for taffy-0.0.3-cp39-cp39-macosx_13_0_arm64.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | ce0a97f19fc69338cd85a8e87f5b6aad402fa1b734a65a1a964050f168426956 |
|
MD5 | a9a8d76c596275105a75883ca2a36dbd |
|
BLAKE2b-256 | c11ab6e0bbc5ec14bf954a577fe2fb616b85ce94581f153bb25d6d9f88f23b35 |