Separate masked and unmasked parts of sequences in FASTX files.
Project description
splitmasked
splitmasked splits sequence records in FAST(A/Q) files based on their masking
status. What constitutes masking can be defined with the --maskchar option
(eg. N or lowercase). Both masked and unmasked parts can be retained and
written to separate output files.
Installation
pip install splitmasked
Usage
splitmasked \
--maskchar lowercase \
--minlength_masked 100 \
--minlength_unmasked 20 \
--outfile_masked /dev/null \
--outfile_unmasked unmasked.fastq \
input.fastq
Examples
Input
@Seq1 comment1
aaaaaTTTTTTAAgatgatgatgAATGAA
+
AAAAAAAAAAAAAAAAAAAAAAAAAAAAA
@Seq2 comment2
ATGATAGAgagagtTTTATA
+
HHHHHHHHHHHHHHHHHHHH
Output
With --maskchar lowercase:
unmasked.fastq
@Seq1_part2 comment1
TTTTTTAA
+
AAAAAAAA
@Seq1_part4 comment1
AATGAA
+
AAAAAA
@Seq2_part1 comment2
ATGATAGA
+
HHHHHHHH
@Seq2_part3 comment2
TTTATA
+
HHHHHH
masked.fastq
@Seq1_part1 comment1
aaaaa
+
AAAAA
@Seq1_part3 comment1
gatgatgatg
+
AAAAAAAAAA
@Seq2_part2 comment2
gagagt
+
HHHHHH
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file splitmasked-0.1.1.tar.gz.
File metadata
- Download URL: splitmasked-0.1.1.tar.gz
- Upload date:
- Size: 5.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.11.1
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
8ac3b678c9bb28f1387e8a31b049c0d16153edf4acc7fd1f8c0258f75bb82cd2
|
|
| MD5 |
948dce79774714b931cd717246a6713f
|
|
| BLAKE2b-256 |
bef80c6fa602d3cf120f562db2d668e25d6c96b675992078ff799590bb17dbf3
|
File details
Details for the file splitmasked-0.1.1-py3-none-any.whl.
File metadata
- Download URL: splitmasked-0.1.1-py3-none-any.whl
- Upload date:
- Size: 5.8 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.11.1
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
590ca763d2a3fe6b5568e0bbfaa088876baead7b0080887694ddca8075a83305
|
|
| MD5 |
5058cfb1fe66b1da6f2026bb48e25a0b
|
|
| BLAKE2b-256 |
05dd56ddee84f4b47539660671839b968baded3c092d76f714b23b1b3d2b11f8
|