The AsperaSRAgetter provides a easy way to download sequencing data from ENA by using Aspera.
Project description
AsperaSRAgetter
AsperaSRAgetter provides an easy way to download sequencing data (fastq.gz format) from European Nucleotide Archive (ENA) by using Aspera.
Installation
AsperaSRAgetter has been distributed on pypi. You can easily install AsperaSRAgetter through pip. AsperaSRAgetter depends on Aspera-CLI to retrive sequencing data from ENA. It is recommended to install Aspera-CLI with Conda.
# You may create a new invironment for AsperaSRAgetter, but this is optional
conda create -n AsperaSRAgetter python=3.10
conda activate AsperaSRAgetter
# Install AsperaSRAgetter using pip
pip install AsperaSRAgetter
# Install Aspera-CLI using conda
conda install -c hcc aspera-cli
Workflow
AsperaSRAgetter first inquiry for corresponding fastq.gz file report through ENA filereport API. Sencondly, the MD5 hash value and ftp url of each fastq.gz files are then resolved from the report. Lastly, ftp url is then passed to Aspera transfer command ascp
to download the fastq.gz file.
The file reports will be stored as a .tsv
table as records of the downloading process.
All files' MD5 hash values are saved in .md5
file which users can further verify the integrity of files.
Usage
The command name of AsperaSRAgetter is sragetter. It accepts either one SRA accession or one TXT file containing multiple accessions (see the usage example below). Note that users need to provide the path of public key authentication file of Aspera-CLI (normally should be ENVIRONMENT_PATH/etc/asperaweb_id_dsa.openssh)
usage: sragetter [-h] [-v] [-acc ACCESSION | -f FILE] -ssh SSH_KEY -o OUTDIR
options:
-h, --help show this help message and exit
-v, --version Show SRAdownloader version number and exit
-acc ACCESSION, --accession ACCESSION
SRA data accession
-f FILE, --file FILE TXT file with multiple SRA accessions
-ssh SSH_KEY, --ssh-key SSH_KEY
Public key authentication file provided by Aspera command line client download package as the 'asperaweb_id_dsa.openssh' file
-o OUTDIR, --outdir OUTDIR
Path to store the downloaded SRA data
Usage
-----------------
Download with one accession:
$ sragetter --accession sra_accession --ssh-key sshkey_path.openssh --outdir outdir_path
Download with TXT file containing multiple accessions:
$ sragetter --file sra_accessions.txt --ssh-key sshkey_path.openssh --outdir outdir_path
Contact
If you have any questions using AsperaSRAgetter, feel free to open an issue or contact me jirunjia@gmail.com.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for AsperaSRAgetter-2.1-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | b83c4ffa9df25aa5484faadae131867bc72bcc0e1e1f526d98a9b6c23207877c |
|
MD5 | ef4ca13fbc0f43e02724d78bd51236cc |
|
BLAKE2b-256 | 6d2b43fe214918b581d4de327a4b699fb75fd5b4ef7909c1f6293f5229b4253b |