Skip to main content

sequence analysis tools for flu research

Project description

build status PyPI

flutile

Installation

pip install flutile

Commands

aadiff

aadiff takes a multiple-sequence alignment as an input and creates a character difference table. This command is designed for preparing amino acid difference tables. Below is an example of a comparison of 4 H1 sequences.

flutile aadiff --subtype=H1 mufile.faa

site A02479030 SD0246 SD0272 SD0136
-3 A T
-2 N S
-1 A T
1 D
154 K
154+1 - X
154+2 - X
155 D S X
156 D G N X

The --subtype=H1 argument tells flutile to align the inputs against an H1 reference (A/United Kingdom/1/1933). The reference is used to determine relative indices (the sites column). The index reference is used only for indexing and does not appear in the final table. The first three rows (sites -3, -2, -1) align to the three residues at the end of the signal peptide. Site 1 is the first residue in the mature peptide. Any gaps in the reference alignment are indexed as <ref_id>+<offset>, for example 154+1 and 154+2 are positions 1 and 2 residues after the reference position 154.

flutile uses the references from (Burke 2014):

--- | ------------------------------------------------- | ------------------- | ---------------------- | H1 | A/United Kingdom/1/1933 | MKARLLVLLCALAATDA | DTICIGYHANNS | H2 | A/Singapore/1/1957 | MAIIYLILLFTAVRG | DQICIGYHANNS | H3 | A/Aichi/2/1968 | MKTIIALSYIFCLPLG | QDLPGNDNSTATLCLGHHAVPN | H4 | A/swine/Ontario/01911–2/1999 | MLSIAILFLLIAEGSS | QNYTGNPVICLGHHAVSN | H5 | A/Vietnam/1203/2004 | MEKIVLLFAIVSLVKS | DQICIGYHANNS | H6 | A/chicken/Taiwan/0705/1999 | MIAIIVIATLAAAGKS | DKICIGYHANNS | H7 | A/Netherlands/219/2003 | MNTQILVFALVASIPTNA | DKICLGHHAVSN | H8 | A/turkey/Ontario/6118/1968 | MEKFIAIAMLLASTNA | YDRICIGYQSNNS | H9 | A/swine/Hong Kong/9/1998 | MEAASLITILLVVTASNA | DKICIGYQSTNS | H10 | A/mallard/bavaria/3/2006 | MYKIVVIIALLGAVKG | LDKICLGHHAVAN | H11 | A/duck/England/1/1956 | MEKTLLFAAIFLCVKA | DEICIGYLSNNS | H12 | A/duck/Alberta/60/1976 | MEKFIILSTVLAASFA | YDKICIGYQTNNS | H13 | A/gull/Maryland/704/1977 | MALNVIATLTLISVCVHA | DRICVGYLSTNS | H14 | A/mallard/Astrakhan/263/1982 | MIALILVALALSHTAYS | QITNGTTGNPIICLGHHAVEN | H15 | A/duck/Australia/341/1983 | MNTQIIVILVLGLSMVRS | DKICLGHHAVAN | H16 | A/black-headed-gull/Turkmenistan/13/1976 | MMIKVLYFLIIVLGRYSKA | DKICIGYLSNNS | H17 | A/little-yellow-shouldered bat/Guatemala/060/2010 | MELIILLILLNPYTFVLG | DRICIGYQANQN | H18 | A/flat-faced bat/Peru/033/2010 | MITILILVLPIVVG | DQICIGYHSNNS |

  • The H3 signal peptide appears to actually be MKTIIALSYIFCLALG

represent

represent takes a multiple-sequence alignment as input and removes entries that are similar in sequence and time. The function requires that headers have a date term (with format year/month/day). For example:

>A|1990-01-02
GATACA
>B|1990-02-02
CATATA

There may be gaps in the alignment. Sequences in the alignment that are separated by less than or equal to --max-day-sep and that are a sequence identity of greather than or equal to --min-pident-sep will be clustered together. A single representative is sampled from each cluster (the latest one with ties resolved by order).

trim

Extract the HA1 regions from (currently) H1 and H3 HA proteins.

References

  1. Burke, Smith (2014) A Recommended Numbering Scheme for Influenza A HA Subtypes

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

flutile-1.0.1.tar.gz (66.6 kB view details)

Uploaded Source

Built Distribution

flutile-1.0.1-py3-none-any.whl (68.2 kB view details)

Uploaded Python 3

File details

Details for the file flutile-1.0.1.tar.gz.

File metadata

  • Download URL: flutile-1.0.1.tar.gz
  • Upload date:
  • Size: 66.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.8.0 pkginfo/1.8.2 readme-renderer/32.0 requests/2.26.0 requests-toolbelt/0.9.1 urllib3/1.26.8 tqdm/4.62.3 importlib-metadata/4.8.1 keyring/23.5.0 rfc3986/2.0.0 colorama/0.4.4 CPython/3.10.2

File hashes

Hashes for flutile-1.0.1.tar.gz
Algorithm Hash digest
SHA256 75824a9dbc1623878d20307f121dbe46f3784bf453a072b5efcf0872197e7c05
MD5 da9434d4f66f1fffbf22898cb34ecfb2
BLAKE2b-256 0c269538d66816f8d9415331fc97b5b261717ae879a978dc6acfb359ffab127e

See more details on using hashes here.

File details

Details for the file flutile-1.0.1-py3-none-any.whl.

File metadata

  • Download URL: flutile-1.0.1-py3-none-any.whl
  • Upload date:
  • Size: 68.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.8.0 pkginfo/1.8.2 readme-renderer/32.0 requests/2.26.0 requests-toolbelt/0.9.1 urllib3/1.26.8 tqdm/4.62.3 importlib-metadata/4.8.1 keyring/23.5.0 rfc3986/2.0.0 colorama/0.4.4 CPython/3.10.2

File hashes

Hashes for flutile-1.0.1-py3-none-any.whl
Algorithm Hash digest
SHA256 f83ca882961d17af4e8121d0f80667869c3fdab6627b8a9a63c4ae963cff2c35
MD5 c7ce28c5f49ec00dbb1d9263d771da1d
BLAKE2b-256 07068985325c72d4e3b70ffe2f3d2ddd697bbd7726c62a7c71e87afbc0d1e923

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page