Skip to main content

A command-line interface to the miSRA API. It allow to remotely query over 90,000 miRNA-seq samples from the Sequence Read Archive

Project description

miSRA

a command line tool to remotely query over 90,000 miRNA-seq samples from the Sequence Read Archive.
For more info, please visit the GitHub repository.

Dependencies

miSRA requires Python >= 3.7 and the python package requests (automatically installed by pip)

How to install

(optional but recommended) Create a virtual environment and activate it

python3 -m venv env
source env/bin/activate

Install with pip (recommended)

pip3 install miSRA
# to test that it worked
miSRA --help

Run miSRA:

miSRA --config your_config.json

An example config.json could include the following parameters (for a detailed explanation of the different query modes, see miSRA modes ):

{
    "mode":"mirna",  # There are different modes to query miSRA (mirna, lib and spike). The mode mirna performs alignments to miRNA annotations using sRNAbench
    
    # mirna mode requires 2 miRNA annotation files, one for mature miRNAs and one for hairpins
    "mature":"mature_hsa.fa", # path to mature miRNA annotations in fasta format
    "hairpin":"hairpin_hsa.fa", # path to hairpin miRNA annotations in fasta format
    
    # you can specify which samples you want to profile either by specifying comma-separated SRA study or experiment accessions
    "studies":"SRP225193", # profile all samples from this study
    # "experiments":"SRX2349199,SRX2349197,SRX546025,SRX546026", # this would include these experiments
    
    "localOut":"RNAatlas", # local folder where the results will be downloaded to
    "mm": "1", # number of mismatches (optional)
    "alignType":"v" # bowtie alignment type (optional)
}

One could also download information about the database content. The following command will generate a file in the current working directory including the number of available samples and studies per species.

miSRA --db-stat

To obtain all samples and studies available for a specific species, you can do:

miSRA --taxonID TAXON_ID
## Where TAXON_ID is NCBI's taxonID. For instance, for human:
miSRA --taxonID 9606

miSRA modes:

There are 3 main modes to query samples in miSRA:

  • miRNA: mature and hairpin miRNA sequences are used for profiling
  • library: long reference sequences are used for profiling and mappings of reads to these sequences will be reported
  • spike: short reference sequences are provided and only exact matches will be reported

For more info, please visit the GitHub repository or the tool manual.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

misra-0.0.3.tar.gz (8.1 kB view details)

Uploaded Source

Built Distribution

misra-0.0.3-py3-none-any.whl (9.6 kB view details)

Uploaded Python 3

File details

Details for the file misra-0.0.3.tar.gz.

File metadata

  • Download URL: misra-0.0.3.tar.gz
  • Upload date:
  • Size: 8.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.10.9

File hashes

Hashes for misra-0.0.3.tar.gz
Algorithm Hash digest
SHA256 9a3f40ecf23e1ad92b6591fe4a96ed6fd70a018c0e5ec2ce51b2c3a7a0e00edc
MD5 e89f161dc6252c601c3bf1b34f6569af
BLAKE2b-256 e62ad5068ec4cd8a5721be420de8fb3565de1b2e3862f98cf55e193c8e28f89c

See more details on using hashes here.

File details

Details for the file misra-0.0.3-py3-none-any.whl.

File metadata

  • Download URL: misra-0.0.3-py3-none-any.whl
  • Upload date:
  • Size: 9.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.10.9

File hashes

Hashes for misra-0.0.3-py3-none-any.whl
Algorithm Hash digest
SHA256 1e48ba33d807c5128c39c2cdbf5bfa4268458fef5a5161483da61d509343d633
MD5 6956e9a884f113ddac35bf4d611826ce
BLAKE2b-256 ae280c9c9220ef6e3df9723c238da56737632003d1ece76736696f4fab5bc745

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page