Skip to main content

Collection of simple bioinformatic tools.

Project description

sbio

Collection of simple Bioinformatic tools.

fastq_quality

Script usage:
fastq_quality.py -r1 *R1*.fastq.gz -r2 *R2*.fastq.gz
fastq_quality.py -h

Module usage:
from sbio.fastq_quality import FASTQ_Quality
fq = FASTQ_Quality(read1, read2, sampling_number)
fq.run()

Files must be .gz zipped
Paired reads must contain _R1 or _R2 in file name

Along with the FASTQ file and sample name 5 FASTQ attributes are obtained:

file_size --> FASTQ file size (human readable) total_read_count --> Total read count within each FASTQ file
sampling_size --> Random analyzed read count
length_mean --> Average read length
read_average --> Average read quality
reads_gt_q30 --> Read counts with an average quality greater than 30

Note: when calculating percent of reads above Q30 use reads_gt_q30/sampling_size

After ran object will contain nested dot notation for each read, fq.read1.fastq --> 'sample_S25_L001_R1.fastq.gz'

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Files for sbio, version 0.0.7
Filename, size File type Python version Upload date Hashes
Filename, size sbio-0.0.7-py3-none-any.whl (5.0 kB) File type Wheel Python version py3 Upload date Hashes View hashes
Filename, size sbio-0.0.7.tar.gz (3.6 kB) File type Source Python version None Upload date Hashes View hashes

Supported by

Elastic Elastic Search Pingdom Pingdom Monitoring Google Google BigQuery Sentry Sentry Error logging AWS AWS Cloud computing DataDog DataDog Monitoring Fastly Fastly CDN DigiCert DigiCert EV certificate StatusPage StatusPage Status page