Skip to main content

Sanitise protein FASTA files / data

Project description

tidyfasta

A python program to tidy and sanitise FASTA sequence files

Problems and fixes

Problem Fix
Sequence without ID ID name added
ID without sequence Exception raised
Multiline sequence One line per sequence
Non canonical AA Exception raise
Dangerous characters in ID Exception raise
Lowercase AA Converts to uppercase AA
Excessive Whitespace Removes excessive whitespace

Usage

tidyfasta.py --input file.FASTA
tidyfasta.py --input file.FASTA --single
tidyfasta.py --input file.FASTA --single --strict

Output

  1. Tidied version of original file

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

tidyfasta-1.0.0.tar.gz (3.4 kB view hashes)

Uploaded Source

Built Distribution

tidyfasta-1.0.0-py3-none-any.whl (4.1 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page