Skip to main content

QC on various type of pacbio data

Project description JOSS (journal of open source software) DOI Python 3.8 | 3.9 | 3.10 JOSS (journal of open source software) DOI


This is is the pacbio_qc pipeline from the Sequana projet


Quality control for pacbio datafiles (raw data or CCS files).


BAM files provided by Pacbio Sequencers.


HTML reports with various plots including taxonomic plot




This README file, the Wiki from the github repository (link above) and


Cokelaer et al, (2017), ‘Sequana’: a Set of Snakemake NGS pipelines, Journal of Open Source Software, 2(16), 352, JOSS DOI doi:10.21105/joss.00352


Just install this package:

pip install sequana_pacbio_qc

You will need samtools and kraken2 (optional) for the taxonomic analysis.


sequana_pacbio_qc --help
sequana_pacbio_qc --input-directory DATAPATH

If you want to filter out some BAM files, you may use the pattern in tab ‘input data’.

In the configuration tab, in the kraken section add as many databases as you wish. You may simply unset the first database to skip the taxonomy, which is experimental.

This creates a directory with the pipeline and configuration file. You will then need to execute the pipeline:

cd pacbio_qc
sh  # for a local run

This launch a snakemake pipeline. If you are familiar with snakemake, you can retrieve the pipeline itself and its configuration files and then execute the pipeline yourself with specific parameters:

snakemake -s pacbio_qc.rules -c config.yaml --cores 4 --stats stats.txt

Or use sequanix interface.


This pipelines requires the following executable(s):

  • sequana

  • samtools

  • kraken2

  • multiqc


This pipeline takes as inputs a set of BAM files from Pacbio sequencers. It computes a set of basic statistics related to the read lengths. It also shows some histograms related to the GC content, SNR of the diodes and the number of passes Finally, a quick taxonomy can be performed using Kraken. HTML reports are created for each sample as well as a multiqc summary page.

Kraken databases are not provided with the pipeline. This step is optional and not used by default.





fix missing import in the summary


Uses latest wrappers and graphviz apptainers


Release to use latests sequana_pipetools framework


Update to use latest tools from sequana framework


First release of sequana_pacbio_qc using latest sequana rules and modules (0.9.5)

Contribute & Code of Conduct

To contribute to this project, please take a look at the Contributing Guidelines first. Please note that this project is released with a Code of Conduct. By contributing to this project, you agree to abide by its terms.

Rules and configuration details

Here is the latest documented configuration file to be used with the pipeline. Each rule used in the pipeline may have a section in the configuration file.

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sequana_pacbio_qc-1.0.1.tar.gz (30.7 kB view hashes)

Uploaded source

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page