Skip to main content

The Long-read RNA-seq quality control software.

Project description

LQC

LQC: Long-read RNA-seq Quality Control

The Long-read RNA-seq quality control software.

LQC is used to generate quality control summary report for mapped SAM/BAM files of long-read RNA-seq data (PacBio, Oxford Nanopore). LQC provides detailed information about the indels, mismatches and splicing sites in the BAM files, which provides a good reference for evaluation for the sequencing quality of the long-read sequencing data.

Please cite us:

Dependencies

The LQC software was developed with python3, which demands several python packages.

Bootstrap 5.1.3 is used by the final html report. Network access is required to load the css file.

Installation

It's advised to install the software into a virtual environment.

Create virtual environment:

conda create -n lqc
conda activate lqc

Or:

virtualenv ~/.env/lqc
source ~/.env/lqc/bin/activate

From github

Download from github:

git clone https://github.com/gxiaolab/LQC
cd LQC

Install the package:

python setup.py install

From pip

pip install lqc

Usage

LQC accepts SAM or BAM file with cs tag (generated with --cs options by minimap2), or MD tag. If the SAM/BAM file has only MD tag, a genome fasta reference file is required as well to get the splicing information. Since cs tag included the splicing site information, a genome fasta reference is not required by SAM/BAM with cs tag.

usage: lqc [-h] -b BAM_FILE [--genome-fasta GENOME_FASTA] [-o OUTPUT_DIR] [--output-cs]
           [--output-pickle] [-c [CONTIG ...]] [-t THREAD] [--log-level LOG_LEVEL] [--version]

The Long-read RNA-seq quality control software.

optional arguments:
  -h, --help            show this help message and exit
  -b BAM_FILE, --bam-file BAM_FILE
                        input bam file, with cs tags, sorted and indexed
  --genome-fasta GENOME_FASTA
                        path of genome fasta file
  -o OUTPUT_DIR, --output_dir OUTPUT_DIR
                        directory to store output files
  --output-cs           output processed cs tags
  --output-pickle       output pickle file of results
  -c [CONTIG ...], --contig [CONTIG ...]
                        contigs to be analyzed
  -t THREAD, --thread THREAD
                        threads to be used in calculation
  --log-level LOG_LEVEL
                        logging level (default INFO): [DEBUG, INFO]
  --version             show program's version number and exit

The output directory should be empty to allow the storage of output files.

Output

By default, four kinds of output files will be generated by LQC: summary table, figures, html report, and pickle file to restore python statistic objects. And with the --output-cs option, the processed cs tag information of the BAM file will also be outputed. Similarly, with the --output-pickle option, the result objects will be outputed into one pickle file, which can be used for further analysis.

Summary table will be stored in the table subdirectory of the output directory. Figures will be stored in the fig subdirectory of the output directory.

Processed cs tag file has six columns: read name, contig, low, high, cs mark, cs value.

The LQC will also generate a html report for checking of the BAM file qualities.

Screenshot of LQC report:

LQC report screenshot

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

lqc-0.0.5.tar.gz (100.4 kB view details)

Uploaded Source

Built Distribution

lqc-0.0.5-py3.9.egg (164.4 kB view details)

Uploaded Source

File details

Details for the file lqc-0.0.5.tar.gz.

File metadata

  • Download URL: lqc-0.0.5.tar.gz
  • Upload date:
  • Size: 100.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.8.0 pkginfo/1.8.2 readme-renderer/34.0 requests/2.24.0 requests-toolbelt/0.9.1 urllib3/1.26.9 tqdm/4.50.2 importlib-metadata/4.11.3 keyring/21.4.0 rfc3986/2.0.0 colorama/0.4.4 CPython/3.8.5

File hashes

Hashes for lqc-0.0.5.tar.gz
Algorithm Hash digest
SHA256 000c73360fe8d60eaaed864d1795352ba2710eeeb1f94fd7e2a6e7efbaff5034
MD5 764842f5d21ff7775d778c1cef7cb8d4
BLAKE2b-256 51c6c12361695b25945f77097f4355d694d54420974fa111ac7ed1de759f3f75

See more details on using hashes here.

File details

Details for the file lqc-0.0.5-py3.9.egg.

File metadata

  • Download URL: lqc-0.0.5-py3.9.egg
  • Upload date:
  • Size: 164.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.8.0 pkginfo/1.8.2 readme-renderer/34.0 requests/2.24.0 requests-toolbelt/0.9.1 urllib3/1.26.9 tqdm/4.50.2 importlib-metadata/4.11.3 keyring/21.4.0 rfc3986/2.0.0 colorama/0.4.4 CPython/3.8.5

File hashes

Hashes for lqc-0.0.5-py3.9.egg
Algorithm Hash digest
SHA256 a5dbd535cca8223a8127db04dc39f2371782a04d69eb914d8508ada2e302efec
MD5 9114c62c9181513790bece5469b44d2a
BLAKE2b-256 53aad1753e9032a73849f8deaf198caea952bf3733c17aa0654f53da60ac07b0

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page