Skip to main content

Read sequences in (compressed) FASTX files or from stdin

Project description

streamseqs

pytest-badge

streamseqs is a tiny package to quickly read in sequences from stdin or from (compressed) FAST(A/Q) files from disk.

It only performs barebones error checking and parsing and expects its input to be reasonably sane. If you need more sophisticated parsing and error checking, you are probably better of using Biopython.

Installation

pip install streamseqs

Usage

from streamseqs import stream_records

# Stream from file
for record in stream_records("seqs.fasta"):
    process(record)

# Records are dicts:
# {"id": "sequence header", "seq": "ATGCT", "qual": "HHHHH"}
# For FASTA files, "qual" is `None`.

# Gzip compressed files are transparently handled
for record in stream_records("seqs.fastq.gz"):
    process(record)

# Can stream from stdin. Expects a stream, not a TTY!
import sys
for record in stream_records(sys.stdin):
    process(record)

# Format record for writing - as FASTQ if "qual" is present, as FASTA if not.
from streamseqs import format_record
format_record(record)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

streamseqs-0.1.0.tar.gz (4.1 kB view hashes)

Uploaded Source

Built Distribution

streamseqs-0.1.0-py3-none-any.whl (4.4 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page