Skip to main content

No project description provided

Project description

protfasta

Build Status

Release 0.1.12 (March 2023)

Overview

protfasta - a robust parser for protein-based FASTA files.

Documentation

For all documentation see https://protfasta.readthedocs.io/en/latest/.

For code see https://github.com/holehouse-lab/protfasta.

Installation

protfasta has been tested on Linux and macOS. It should also work on Windows but we haven't tested it there yet.

protfasta can be downloaded and installed directly from PyPI using pip:

pip install protfasta

If this has worked, the pfasta command-line tool should be available from the command-line

pfasta --help

And you're done. This also means you can now import and use protfasta in your Python workflow.

Simple example

import protfasta

# sequences is now a dictionary where keys are FASTA headers and values are sequences.
sequences = protfasta.read_fasta('inputfile.fasta')

Errors and help

For bug reports or errors please raise an issue on this github repository (see the Issues tab at the top).

Changelog

  • 0.1.12 (March 2023) - integrated in check_header_parser flag via pull request from the amazing Friedlab !

  • Added in append_to_fasta flag so you can append to an existing FASTA file (thanks Ryan!)

  • 0.1.11 (Sept 17th 2022) - re-wrote code for checking duplicate sequence to make it O(1) instead of O(n) for number of sequences (:-/) and added convert-remove option for invalid_sequences

  • 0.1.9 (Sept 12th 2021) - added in robustness for whitespace in sequence files, which, bizarrely, was not present (i.e. added as an invalid residue type but can now be converted).

Copyright

Copyright (c) 2020-2021, Alex Holehouse - Holehouse lab. protfasta is released under the MIT license. The codebase is well structured and relatively simple, lending it to feature addition. We welcome pull-requests assuming contributed code maintains an appropriate level of clarity and robustness.

Acknowledgements

Many of the software-engineering tools and approaches used in the development of protfasta come from resources developed by the Molecular Sciences Software Institute.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

protfasta-0.1.12.tar.gz (142.6 kB view details)

Uploaded Source

File details

Details for the file protfasta-0.1.12.tar.gz.

File metadata

  • Download URL: protfasta-0.1.12.tar.gz
  • Upload date:
  • Size: 142.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.0 CPython/3.8.12

File hashes

Hashes for protfasta-0.1.12.tar.gz
Algorithm Hash digest
SHA256 9b211b780ed1e64edd52ca7a68a518a6b40e0547988cedf1a01c5590065a1d36
MD5 53f99dab8f9f93da9e7384640d12f3b5
BLAKE2b-256 b9bc0e7012b9844b098b6b121ac49632f72ac415c2ebc19dbf309a045011d7b9

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page