Skip to main content

sequence analysis tools for flu research

Project description

build status PyPI

flutile

Installation

pip install flutile

Commands

aadiff

aadiff takes a multiple-sequence alignment as an input and creates a character difference table. This command is designed for preparing amino acid difference tables. Below is an example of a comparison of 4 H1 sequences.

flutile aadiff --subtype=H1 mufile.faa

site A02479030 SD0246 SD0272 SD0136
-3 A T
-2 N S
-1 A T
1 D
154 K
154+1 - X
154+2 - X
155 D S X
156 D G N X

The --subtype=H1 argument tells flutile to align the inputs against an H1 reference (A/United Kingdom/1/1933). The reference is used to determine relative indices (the sites column). The index reference is used only for indexing and does not appear in the final table. The first three rows (sites -3, -2, -1) align to the three residues at the end of the signal peptide. Site 1 is the first residue in the mature peptide. Any gaps in the reference alignment are indexed as <ref_id>+<offset>, for example 154+1 and 154+2 are positions 1 and 2 residues after the reference position 154.

flutile uses the references from (Burke 2014):

--- | ------------------------------------------------- | ------------------- | ---------------------- | H1 | A/United Kingdom/1/1933 | MKARLLVLLCALAATDA | DTICIGYHANNS | H2 | A/Singapore/1/1957 | MAIIYLILLFTAVRG | DQICIGYHANNS | H3 | A/Aichi/2/1968 | MKTIIALSYIFCLPLG | QDLPGNDNSTATLCLGHHAVPN | H4 | A/swine/Ontario/01911–2/1999 | MLSIAILFLLIAEGSS | QNYTGNPVICLGHHAVSN | H5 | A/Vietnam/1203/2004 | MEKIVLLFAIVSLVKS | DQICIGYHANNS | H6 | A/chicken/Taiwan/0705/1999 | MIAIIVIATLAAAGKS | DKICIGYHANNS | H7 | A/Netherlands/219/2003 | MNTQILVFALVASIPTNA | DKICLGHHAVSN | H8 | A/turkey/Ontario/6118/1968 | MEKFIAIAMLLASTNA | YDRICIGYQSNNS | H9 | A/swine/Hong Kong/9/1998 | MEAASLITILLVVTASNA | DKICIGYQSTNS | H10 | A/mallard/bavaria/3/2006 | MYKIVVIIALLGAVKG | LDKICLGHHAVAN | H11 | A/duck/England/1/1956 | MEKTLLFAAIFLCVKA | DEICIGYLSNNS | H12 | A/duck/Alberta/60/1976 | MEKFIILSTVLAASFA | YDKICIGYQTNNS | H13 | A/gull/Maryland/704/1977 | MALNVIATLTLISVCVHA | DRICVGYLSTNS | H14 | A/mallard/Astrakhan/263/1982 | MIALILVALALSHTAYS | QITNGTTGNPIICLGHHAVEN | H15 | A/duck/Australia/341/1983 | MNTQIIVILVLGLSMVRS | DKICLGHHAVAN | H16 | A/black-headed-gull/Turkmenistan/13/1976 | MMIKVLYFLIIVLGRYSKA | DKICIGYLSNNS | H17 | A/little-yellow-shouldered bat/Guatemala/060/2010 | MELIILLILLNPYTFVLG | DRICIGYQANQN | H18 | A/flat-faced bat/Peru/033/2010 | MITILILVLPIVVG | DQICIGYHSNNS |

  • The H3 signal peptide appears to actually be MKTIIALSYIFCLALG

represent

represent takes a multiple-sequence alignment as input and removes entries that are similar in sequence and time. The function requires that headers have a date term (with format year/month/day). For example:

>A|1990-01-02
GATACA
>B|1990-02-02
CATATA

There may be gaps in the alignment. Sequences in the alignment that are separated by less than or equal to --max-day-sep and that are a sequence identity of greather than or equal to --min-pident-sep will be clustered together. A single representative is sampled from each cluster (the latest one with ties resolved by order).

trim

Extract the HA1 regions from (currently) H1 and H3 HA proteins.

References

  1. Burke, Smith (2014) A Recommended Numbering Scheme for Influenza A HA Subtypes

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

flutile-0.13.3.tar.gz (65.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

flutile-0.13.3-py3-none-any.whl (67.1 kB view details)

Uploaded Python 3

File details

Details for the file flutile-0.13.3.tar.gz.

File metadata

  • Download URL: flutile-0.13.3.tar.gz
  • Upload date:
  • Size: 65.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.3.0 pkginfo/1.7.0 requests/2.25.1 setuptools/51.1.1 requests-toolbelt/0.9.1 tqdm/4.60.0 CPython/3.9.7

File hashes

Hashes for flutile-0.13.3.tar.gz
Algorithm Hash digest
SHA256 fb9bb09cf23cbe9368f5fbabf72fac9d1244f394832a72cb64f437890d4fd460
MD5 4ae039877bc877a72f8632dd34c0eff6
BLAKE2b-256 d0400a0bbb55c19c36018d3b3929de92ed7bd036ec4469379344a34f09c5587a

See more details on using hashes here.

File details

Details for the file flutile-0.13.3-py3-none-any.whl.

File metadata

  • Download URL: flutile-0.13.3-py3-none-any.whl
  • Upload date:
  • Size: 67.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.3.0 pkginfo/1.7.0 requests/2.25.1 setuptools/51.1.1 requests-toolbelt/0.9.1 tqdm/4.60.0 CPython/3.9.7

File hashes

Hashes for flutile-0.13.3-py3-none-any.whl
Algorithm Hash digest
SHA256 15758bbab7ff080a9c29f430a3e300e0900669362ea7444e0e16b7919828f93a
MD5 157ef6de0169486711303fd2007f69cd
BLAKE2b-256 156d56cd260ebbc799874d19b9f229d0d95b9d5088e0a52c6f87a351a939fa30

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page