Skip to main content

Splits FASTQ files evenly.

Project description

fastqsplitter

A simple application to split FASTQ files.

Fastqsplitter splits a fastq file over the specified output files evenly. Fastqsplitter will read groups of a 100 fastq files. For example if 3 output files are specified record 1-100 will go to file 1, 101-200 to file 2, 201-300 to file 3 , 301-400 to file 1 again etc. This ensures the output fastq files are of equal size with no positional bias in the output files.

Fastqsplitter is fast because it assumes each record is 4 lines. As a consequence this application does NOT work with multiline fastq sequences. Also input fastq records are NOT checked for being proper fastq records. Since all downstream analysis tools (FastQC, cutadapt, BWA etc.) do check if the input is correct, another input check in fastqsplitter was deemed redundant.

Quickstart

install fastqsplitter: pip install fastqsplitter

If installation does not work because no C compiler is installed on your system try: NO_CYTHON=True pip install fastqsplitter

To split an input file input_fastq.gz into 3 different files. fastqsplitter -i input_fastq.gz -o split.1.fq.gz -o split.2.fq.gz -o split.3.fq.gz

fastqsplitter uses the excellent xopen library by @marcelm. Therefore, the input and output files compression is determined by the extension. Use .gz if output files should be gzip compressed.

Documentation

More information on fastqsplitter can be found on our readthedocs page.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

fastqsplitter-1.2.0.tar.gz (6.7 kB view hashes)

Uploaded source

Built Distributions

Supported by

AWS AWS Cloud computing Datadog Datadog Monitoring Facebook / Instagram Facebook / Instagram PSF Sponsor Fastly Fastly CDN Google Google Object Storage and Download Analytics Huawei Huawei PSF Sponsor Microsoft Microsoft PSF Sponsor NVIDIA NVIDIA PSF Sponsor Pingdom Pingdom Monitoring Salesforce Salesforce PSF Sponsor Sentry Sentry Error logging StatusPage StatusPage Status page