EZQC is a streamlined, terminal-based alternative to FastQC.
Project description
EZQC: Easy Quality Control for FastQ Files
Table of Contents
Introduction
EZQC is a streamlined, terminal-based alternative to FastQC. Instead of generating individual report files per analysis, EZQC displays the analysis results, reasons, and suggestions directly in the terminal, making it easier to quickly assess the quality of multiple files. Additionally, EZQC generates figures for each analysis, providing a visual aid to spot potential issues for further examination.
EZQC is capable of performing the following analyses:
- Per base sequence quality
- Per sequence quality scores
- Per base sequence content
- Per sequence GC content
- Per base N content
- Sequence Length Distribution
- Overrepresented sequences
- Adapter Content
Why Choose EZQC VS FastQC?
- Fast Result Readout When Batch Processing: With EZQC, there's no need to click into each HTML report like you would with FastQC.
- Automatic Interpretation of Analysis Results: The results are color-coded and provided in plain English, complete with suggestions. This makes it easier for users to interpret the results quickly.
- Generate Detailed Figures for Advanced Users: For those who want a more in-depth analysis, EZQC is capable of generating detailed figures that aid in understanding the quality of your FastQ files.
Quick Start Guide
- Install EZQC following Installation guide.
- Run the tool on a toy example using the command
ezqc tests/SRR020192.fastq
(fastq file from IGSR). - The results will be displayed in the terminal, and figures as well as csv tables will be saved to a directory named
ezqc_output
in your current working directory. Note that this file is choosen intentionally to fail multiple QC tests.
Installation
You can install EZQC using pip:
pip install ezqc
Alternatively, you can compile the latest version of EZQC from source using the provided setup.py
script. Following steps:
- Clone the repository:
git clone https://github.com/skysky2333/ezqc
- Navigate to the EZQC directory:
cd ezqc
- Install the package:
pip install .
Or
python setup.py install
EZQC requires Python 3.x and depends on the following packages, which will be installed automatically during setup:
- numpy
- matplotlib
- pandas
- scipy
- Bio
Usage
After installation, you can use EZQC from the command line as follows:
ezqc <fastq file(s)>
Replace <fastq file(s)>
with the path(s) to your FastQ files. If you want to analyze multiple files, separate the file paths with spaces:
ezqc file1.fastq file2.fastq file3.fastq
Use -o
or --output
to set the output directory.
Use -h
or --help
to see help messages.
Analysis Methods
Here's a brief description of the analyses performed by EZQC:
- Per Base Sequence Quality: Checks the quality of each base call in a sequence read.
- Per Sequence Quality Scores: Provides a histogram of quality scores over all sequences.
- Per Base Sequence Content: Analyzes the proportion of each base (A, T, G, C) at each position across all sequences.
- Per Sequence GC Content: Calculates the GC content in each sequence.
- Per Base N Content: Identifies sequences with a high proportion of unknown (N) bases.
- Sequence Length Distribution: Provides a histogram showing the distribution of sequence lengths.
- Overrepresented sequences: Identifies any sequences that occur more often than expected.
- Adapter Content: Detects the presence of adapter sequences in the reads.
Contributing
We welcome contributions! Please see CONTRIBUTING.md for details on how to contribute.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file ezqc-0.35.tar.gz
.
File metadata
- Download URL: ezqc-0.35.tar.gz
- Upload date:
- Size: 14.1 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.11.3
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | ea9520c8b63f702cd772ac53308d8c5f7aa2cf29c0f6cc645e7f035371c97194 |
|
MD5 | 86cfc611409724c1088b92af423c4632 |
|
BLAKE2b-256 | e033bbf0a7c81863da9726284bd4e6162c03201bfc76d6b5f464f5fe0cabf7f0 |
File details
Details for the file ezqc-0.35-py3-none-any.whl
.
File metadata
- Download URL: ezqc-0.35-py3-none-any.whl
- Upload date:
- Size: 16.0 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.11.3
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 26cdece79814e2df9c892dff4701b8868a8ed43cb7dfe8d5fec7458cf1330cd2 |
|
MD5 | 8623d7ec63a309464e59ee8ece4d5208 |
|
BLAKE2b-256 | e9bf541f3bce75becb1a8c475786b16b711d927cb687bd7b951bb9f193081a09 |