Skip to main content

Separate masked and unmasked parts of sequences in FASTX files.

Project description

splitmasked

pytest-badge

splitmasked splits sequence records in FAST(A/Q) files based on their masking status. What constitutes masking can be defined with the --maskchar option (eg. N or lowercase). Both masked and unmasked parts can be retained and written to separate output files.

Installation

pip install splitmasked

Usage

splitmasked \
    --maskchar lowercase \
    --minlength_masked 100 \
    --minlength_unmasked 20 \
    --outfile_masked /dev/null \
    --outfile_unmasked unmasked.fastq \
    input.fastq

Examples

Input

@Seq1 comment1
aaaaaTTTTTTAAgatgatgatgAATGAA
+
AAAAAAAAAAAAAAAAAAAAAAAAAAAAA
@Seq2 comment2
ATGATAGAgagagtTTTATA
+
HHHHHHHHHHHHHHHHHHHH

Output

With --maskchar lowercase:

unmasked.fastq

@Seq1_part2 comment1
TTTTTTAA
+
AAAAAAAA
@Seq1_part4 comment1
AATGAA
+
AAAAAA
@Seq2_part1 comment2
ATGATAGA
+
HHHHHHHH
@Seq2_part3 comment2
TTTATA
+
HHHHHH

masked.fastq

@Seq1_part1 comment1
aaaaa
+
AAAAA
@Seq1_part3 comment1
gatgatgatg
+
AAAAAAAAAA
@Seq2_part2 comment2
gagagt
+
HHHHHH

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

splitmasked-0.1.1.tar.gz (5.0 kB view details)

Uploaded Source

Built Distribution

splitmasked-0.1.1-py3-none-any.whl (5.8 kB view details)

Uploaded Python 3

File details

Details for the file splitmasked-0.1.1.tar.gz.

File metadata

  • Download URL: splitmasked-0.1.1.tar.gz
  • Upload date:
  • Size: 5.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.11.1

File hashes

Hashes for splitmasked-0.1.1.tar.gz
Algorithm Hash digest
SHA256 8ac3b678c9bb28f1387e8a31b049c0d16153edf4acc7fd1f8c0258f75bb82cd2
MD5 948dce79774714b931cd717246a6713f
BLAKE2b-256 bef80c6fa602d3cf120f562db2d668e25d6c96b675992078ff799590bb17dbf3

See more details on using hashes here.

File details

Details for the file splitmasked-0.1.1-py3-none-any.whl.

File metadata

  • Download URL: splitmasked-0.1.1-py3-none-any.whl
  • Upload date:
  • Size: 5.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.11.1

File hashes

Hashes for splitmasked-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 590ca763d2a3fe6b5568e0bbfaa088876baead7b0080887694ddca8075a83305
MD5 5058cfb1fe66b1da6f2026bb48e25a0b
BLAKE2b-256 05dd56ddee84f4b47539660671839b968baded3c092d76f714b23b1b3d2b11f8

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page