Fast metadata and fastq retrieval with ffq and aria2c
Project description
ngsfetch (v0.1.1)
A utility to retrieve fastq files with ffq and aria2. It is designed to be fast and efficient, allowing you to download large datasets quickly and easily. This tool can be used to fetch fastq files from various public repositories, including:
- GEO: Gene Expression Omnibus,
- SRA: Sequence Read Archive,
- EMBL-EBI: European Molecular BIology Laboratory’s European BIoinformatics Institute.
[!IMPORTANT]
- Fast: Uses
aria2to download files in parallel, which can significantly speed up the download process.- Integrity: Verifies the integrity of downloaded files using
md5sumto ensure that the files are not corrupted during the download process.- Retry Mechanism: Automatically attempts to re-download files if the initial download fails, ensuring successful retrieval of data.
Quick start
# Fetch fastq files of GSE52856
ngsfetch -i GSE52856 -o /path/to/output/GSE52856 -p 16
# Fetch fastq files of SRP175008
ngsfetch -i SRP175008 -o /path/to/output/SRP175008 -p 16
# Fetch fastq files of ERP126666
ngsfetch -i ERP126666 -o /path/to/output/ERP126666 -p 16
How to install
pip
pip install ngsfetch
or
git clone https://github.com/NaotoKubota/ngsfetch.git
cd ngsfetch
pip install .
conda
conda create -n ngsfetch python=3.9
conda activate ngsfetch
conda install -c bioconda ngsfetch
Docker
docker pull naotokubota/ngsfetch
Dependencies
Operating system
- Linux (i.e. where the
md5sumcommand is available)
python packages
- python (>=3.9)
- ffq (>=0.3.1)
- aria2 (>=0.0.1b0)
Usage
usage: ngsfetch [-h] [-i ID] [-o OUTPUT] [-p PROCESSES] [--attempts ATTEMPTS] [-v]
ngsfetch v0.1.1 - fast retrieval of metadata and fastq files with ffq and aria2c
optional arguments:
-h, --help show this help message and exit
-i ID, --id ID ID of the data to fetch
-o OUTPUT, --output OUTPUT
Output directory
-p PROCESSES, --processes PROCESSES
Number of processes to use (up to 16)
--attempts ATTEMPTS Number of attempts to fetch metadata and fastq files
-v, --verbose Increase verbosity
Contributing
Thank you for wanting to improve ngsfetch! If you have any bugs or questions, feel free to open an issue or pull request.
Authors
- Naoto Kubota (0000-0003-0612-2300)
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file ngsfetch-0.1.1.tar.gz.
File metadata
- Download URL: ngsfetch-0.1.1.tar.gz
- Upload date:
- Size: 7.8 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.9.21
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
23bf8c75e8ddd83bb066fc619e064ba7ee5b73bfaa3ba3fbef92eb891d8a2efe
|
|
| MD5 |
6f31a1bd8db4220cffd8371156df53a2
|
|
| BLAKE2b-256 |
730b39a3c6f31e28ce3c89df76efd10651fe004c07d55ed499fb2206f76abbaf
|
File details
Details for the file ngsfetch-0.1.1-py3-none-any.whl.
File metadata
- Download URL: ngsfetch-0.1.1-py3-none-any.whl
- Upload date:
- Size: 7.8 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.9.21
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
aabae0883586d16d1a73b90d33a4834116a2fcb55f245874001fdf87dcd6eb03
|
|
| MD5 |
404429e74e141e221d12bde9c61d25f0
|
|
| BLAKE2b-256 |
bbe4634b00c6005464029ea5829bc294d5cdf28c66df3fa3ae36a9b750583770
|