Skip to main content

Collection of simple bioinformatic tools.

Project description


Collection of simple Bioinformatic tools.


Script usage: -r1 *R1*.fastq.gz -r2 *R2*.fastq.gz -h

Module usage:
from sbio.fastq_quality import FASTQ_Quality
fq = FASTQ_Quality(read1, read2, sampling_number)

Files must be .gz zipped
Paired reads must contain _R1 or _R2 in file name

Along with the FASTQ file and sample name 5 FASTQ attributes are obtained:

file_size --> FASTQ file size (human readable) total_read_count --> Total read count within each FASTQ file
sampling_size --> Random analyzed read count
length_mean --> Average read length
read_average --> Average read quality
reads_gt_q30 --> Read counts with an average quality greater than 30

Note: when calculating percent of reads above Q30 use reads_gt_q30/sampling_size

After ran object will contain nested dot notation for each read, fq.read1.fastq --> 'sample_S25_L001_R1.fastq.gz'

Project details

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Files for vsbio, version 0.1.0
Filename, size File type Python version Upload date Hashes
Filename, size vsbio-0.1.0-py3-none-any.whl (5.1 kB) File type Wheel Python version py3 Upload date Hashes View
Filename, size vsbio-0.1.0.tar.gz (3.6 kB) File type Source Python version None Upload date Hashes View

Supported by

AWS AWS Cloud computing Datadog Datadog Monitoring DigiCert DigiCert EV certificate Facebook / Instagram Facebook / Instagram PSF Sponsor Fastly Fastly CDN Google Google Object Storage and Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Salesforce Salesforce PSF Sponsor Sentry Sentry Error logging StatusPage StatusPage Status page