A command-line interface to the miSRA API. It allow to remotely query over 90,000 miRNA-seq samples from the Sequence Read Archive
Project description
miSRA
a command line tool to remotely query over 90,000 miRNA-seq samples from the Sequence Read Archive.
For more info, please visit the GitHub repository.
Dependencies
miSRA requires Python >= 3.7 and the python package requests (automatically installed by pip)
How to install
(optional but recommended) Create a virtual environment and activate it
python3 -m venv env
source env/bin/activate
Install with pip (recommended)
pip3 install miSRA
# to test that it worked
miSRA --help
Run miSRA:
miSRA --config your_config.json
An example config.json could include the following parameters (for a detailed explanation of the different query modes, see miSRA modes ):
{
"mode":"mirna", # There are different modes to query miSRA (mirna, lib and spike). The mode mirna performs alignments to miRNA annotations using sRNAbench
# mirna mode requires 2 miRNA annotation files, one for mature miRNAs and one for hairpins
"mature":"mature_hsa.fa", # path to mature miRNA annotations in fasta format
"hairpin":"hairpin_hsa.fa", # path to hairpin miRNA annotations in fasta format
# you can specify which samples you want to profile either by specifying comma-separated SRA study or experiment accessions
"studies":"SRP225193", # profile all samples from this study
# "experiments":"SRX2349199,SRX2349197,SRX546025,SRX546026", # this would include these experiments
"localOut":"RNAatlas", # local folder where the results will be downloaded to
"mm": "1", # number of mismatches (optional)
"alignType":"v" # bowtie alignment type (optional)
}
One could also download information about the database content. The following command will generate a file in the current working directory including the number of available samples and studies per species.
miSRA --db-stat
To obtain all samples and studies available for a specific species, you can do:
miSRA --taxonID TAXON_ID
## Where TAXON_ID is NCBI's taxonID. For instance, for human:
miSRA --taxonID 9606
miSRA modes:
There are 3 main modes to query samples in miSRA:
- miRNA: mature and hairpin miRNA sequences are used for profiling
- library: long reference sequences are used for profiling and mappings of reads to these sequences will be reported
- spike: short reference sequences are provided and only exact matches will be reported
For more info, please visit the GitHub repository or the tool manual.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file misra-0.0.3.tar.gz
.
File metadata
- Download URL: misra-0.0.3.tar.gz
- Upload date:
- Size: 8.1 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.10.9
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 9a3f40ecf23e1ad92b6591fe4a96ed6fd70a018c0e5ec2ce51b2c3a7a0e00edc |
|
MD5 | e89f161dc6252c601c3bf1b34f6569af |
|
BLAKE2b-256 | e62ad5068ec4cd8a5721be420de8fb3565de1b2e3862f98cf55e193c8e28f89c |
File details
Details for the file misra-0.0.3-py3-none-any.whl
.
File metadata
- Download URL: misra-0.0.3-py3-none-any.whl
- Upload date:
- Size: 9.6 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.10.9
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 1e48ba33d807c5128c39c2cdbf5bfa4268458fef5a5161483da61d509343d633 |
|
MD5 | 6956e9a884f113ddac35bf4d611826ce |
|
BLAKE2b-256 | ae280c9c9220ef6e3df9723c238da56737632003d1ece76736696f4fab5bc745 |