Separate masked and unmasked parts of sequences in FASTX files.
Project description
splitmasked
splitmasked
splits sequence records in FAST(A/Q) files based on their masking
status. What constitutes masking can be defined with the --maskchar
option
(eg. N
or lowercase
). Both masked and unmasked parts can be retained and
written to separate output files.
Installation
pip install splitmasked
Usage
splitmasked \
--maskchar lowercase \
--minlength_masked 100 \
--minlength_unmasked 20 \
--outfile_masked /dev/null \
--outfile_unmasked unmasked.fastq \
input.fastq
Examples
Input
@Seq1 comment1
aaaaaTTTTTTAAgatgatgatgAATGAA
+
AAAAAAAAAAAAAAAAAAAAAAAAAAAAA
@Seq2 comment2
ATGATAGAgagagtTTTATA
+
HHHHHHHHHHHHHHHHHHHH
Output
With --maskchar lowercase
:
unmasked.fastq
@Seq1_part2 comment1
TTTTTTAA
+
AAAAAAAA
@Seq1_part4 comment1
AATGAA
+
AAAAAA
@Seq2_part1 comment2
ATGATAGA
+
HHHHHHHH
@Seq2_part3 comment2
TTTATA
+
HHHHHH
masked.fastq
@Seq1_part1 comment1
aaaaa
+
AAAAA
@Seq1_part3 comment1
gatgatgatg
+
AAAAAAAAAA
@Seq2_part2 comment2
gagagt
+
HHHHHH
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
splitmasked-0.1.1.tar.gz
(5.0 kB
view details)
Built Distribution
File details
Details for the file splitmasked-0.1.1.tar.gz
.
File metadata
- Download URL: splitmasked-0.1.1.tar.gz
- Upload date:
- Size: 5.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.11.1
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 8ac3b678c9bb28f1387e8a31b049c0d16153edf4acc7fd1f8c0258f75bb82cd2 |
|
MD5 | 948dce79774714b931cd717246a6713f |
|
BLAKE2b-256 | bef80c6fa602d3cf120f562db2d668e25d6c96b675992078ff799590bb17dbf3 |
File details
Details for the file splitmasked-0.1.1-py3-none-any.whl
.
File metadata
- Download URL: splitmasked-0.1.1-py3-none-any.whl
- Upload date:
- Size: 5.8 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.11.1
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 590ca763d2a3fe6b5568e0bbfaa088876baead7b0080887694ddca8075a83305 |
|
MD5 | 5058cfb1fe66b1da6f2026bb48e25a0b |
|
BLAKE2b-256 | 05dd56ddee84f4b47539660671839b968baded3c092d76f714b23b1b3d2b11f8 |