kmer counting from sequencing fastq.gz file
Project description
kmer_counting_loop.py
Introduction
This tool is designed for bioinformatics analysis to count k-mer frequencies in sequencing data stored in FASTQ format. It is particularly useful when dealing with large datasets as it leverages Python's multiprocessing capabilities for parallel processing, thus enhancing performance and reducing computation time.
Features
Count specified k-mer sizes in FASTQ files (compressed with gzip).
Use input CSV files containing k-mer sequences to filter and count only relevant k-mers.
Handle large datasets efficiently with chunk-based parallel processing.
Utilize multiple CPU cores for faster computation.
Generate output CSV files containing the count of each k-mer.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file kmer_counting-0.1.0.tar.gz
.
File metadata
- Download URL: kmer_counting-0.1.0.tar.gz
- Upload date:
- Size: 4.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.0.0 CPython/3.10.4
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | d284a4e148df4b0375844201e29258f4297c530efc49e290b8af49aeeff15979 |
|
MD5 | 56b8bd6fdb7b92a810ea0dd11d706f8f |
|
BLAKE2b-256 | b70cb0a4691539366beb67e5143a67be26eb1ff21fdfbe485ea127b603ecaa5b |
File details
Details for the file kmer_counting-0.1.0-py3-none-any.whl
.
File metadata
- Download URL: kmer_counting-0.1.0-py3-none-any.whl
- Upload date:
- Size: 4.4 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.0.0 CPython/3.10.4
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 34dc49016d108d5b38c55a9977f2dda507ac574634754cb85f821ae57104d64b |
|
MD5 | 788d27e16c37ba0d65619ebb460b9275 |
|
BLAKE2b-256 | 79e7c91090c6379b1f4063bded90875e9e550f02db6d5c35dce6f464060e4120 |