Skip to main content

Collection of simple bioinformatic tools.

Project description

sbio

Collection of simple Bioinformatic tools.

fastq_quality

Script usage:
fastq_quality.py -r1 *R1*.fastq.gz -r2 *R2*.fastq.gz
fastq_quality.py -h

Module usage:
from sbio.fastq_quality import FASTQ_Quality
fq = FASTQ_Quality(read1, read2, sampling_number)
fq.run()

Files must be .gz zipped
Paired reads must contain _R1 or _R2 in file name

Along with the FASTQ file and sample name 5 FASTQ attributes are obtained:

file_size --> FASTQ file size (human readable) total_read_count --> Total read count within each FASTQ file
sampling_size --> Random analyzed read count
length_mean --> Average read length
read_average --> Average read quality
reads_gt_q30 --> Read counts with an average quality greater than 30

Note: when calculating percent of reads above Q30 use reads_gt_q30/sampling_size

After ran object will contain nested dot notation for each read, fq.read1.fastq --> 'sample_S25_L001_R1.fastq.gz'

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vsbio-0.1.0.tar.gz (3.6 kB view hashes)

Uploaded Source

Built Distribution

vsbio-0.1.0-py3-none-any.whl (5.1 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page