No project description provided
Project description
Fastq Downloader (WIP)
use this snakemake script to get more fluent experience.
This python package let you download fastq files from ena. It can automatic merge and rename fastq files based on the input file provided.
How to use
auto merge multiple files of paired end reads are not tested now, but should be usable
conda create --name fastq-download -c conda-forge -c hcc -c bioconda aspera-cli snakemake httpx lxml click beautifulsoup4 python=3.9
## use what ever you want to download the gist mentioned above to thisname.smk
## download whl file from github release of this project to thisname.whl
conda activate fastq-download
pip install thisname.whl
## make sure to create an infotsv before, you can just copy from the geo website,
## then go to vim, type :set paste to get into paste mode, paste the table into vim,
## save the file as whatever name you want, then exit vim
## the white space will be auto convert to underscore
## refresh_acc need to be False if you don't want to query again the accesion number,
## or due to the recreation of the link file, all files are to be downloaded.
python3 -m fastq_download --infotsv thisname.tsv --outdir thisname --refresh_acc False
todo
- test for paired-end reads run merge
- publish to bioconda
- if fail, retry
- use dag to run the pipeline (sort of, implemented by using snakemake)
- option to resume download when md5 not match
- option to continue from last time download
- implement second level parallelization
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
fastq-downloader-0.2.1.tar.gz
(9.2 kB
view hashes)
Built Distribution
Close
Hashes for fastq_downloader-0.2.1-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 0063e0ee246a61d7a49961e5636de195b4deb95122a783cb33e0af86e2eabd0d |
|
MD5 | e67b374ae6551b629c97b882dde646b4 |
|
BLAKE2b-256 | d9dd34d922880e50b95b3dde277ce89add34a88bc8bbdb4938165b4f6e5a447f |