Bioinformatics Reference Data Manager is an application used to download, backup and update of reference data, required for various bioinformatics analysis

These details have not been verified by PyPI

Project links

Homepage

Project description

Bioinformatics Reference Data Manager (BRDM)

Description

BRDM is an application used to automatically update, backup and restore the reference data that required for bioinformatic analysis.

Requirements

miniconda

Deployment Procedures

Download rdm_env_setting.yaml from github

  wget https://raw.githubusercontent.com/AAFC-BICoE/reference-data-manager/master/rdm_env_setting.yaml

Create the conda environment for the program

  conda env create -n rdm_env --file rdm_env_setting.yaml

Run the program

Set up the config file
- The location of the sample configuration file config.yaml.sample
```
  /path/to/conda/envs/lib/python3.6/site-package/brdm
```
- Default option to set up the config file: Copy the sample configuration to config.yaml and modify config.yaml
```
  cd /path/to/conda/envs/lib/python3.6/site-package/brdm
  cp config.yaml.sample config.yaml
  nano config.yaml
```
- If the location or the name of your configuration file is different with that of default option, the path of your config file has to be provided by argument --config-file
view the options of the program

  source activate rdm_env
  brdm -h

Run the program

  source activate rdm_env
  brdm [option]
  (an example: brdm --update-ncbi-blast)

Some suggestions for executing the program

All the NCBI database should be updated outside of business hours. Abuse of the Entrez or NCBI services can lead to temporary loss of access.
Construct the precise queries for NCBI subsets
1. The purpose of the NCBI subsets is to construct database for specific markers in a group of taxa; it is suggested to provide information such as names of the markers, taxa and range of the sequence length in a query.
2. Confirm and refine your queries by testing them on NCBI website.
3. Due to the approach of constructing the subset database, sequences (e.g. wgs) that not exist in NCBI nt database cannot be included in subset database. The condition such as "and not wgs" is suggested to be added in your queries, in order to get the appropriate accession IDs and sequences.
  1. The approach of constructing the subset database: Only accession IDs for each subsets are downloaded directly from NCBI; Sequences are retrieved from NCBI nt blast database;
  2. Ncbi nt blast database consists of GenBank+EMBL+DDBJ+PDB+RefSeq sequences, but excludes EST, STS, GSS, WGS, TSA, patent sequences as well as phase 0, 1, and 2 HTGS sequences.

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

This version

0.8.4.2

Feb 17, 2021

0.8.4

Feb 17, 2021

0.8.3

Dec 8, 2020

0.8.2

Nov 19, 2019

0.8.1

Jan 3, 2019

0.8

Dec 28, 2018

0.6

Dec 21, 2018

0.5

Dec 19, 2018

0.3.1

Oct 22, 2018

0.3.0

Oct 19, 2018

0.2.9

Oct 19, 2018

0.2.8

Oct 19, 2018

0.2.6

Oct 19, 2018

0.2.3

Oct 19, 2018

0.2.2

Oct 19, 2018

0.2.1

Oct 18, 2018

0.2.0

Oct 18, 2018

0.1.9

Oct 16, 2018

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

reference-data-manager-0.8.4.2.tar.gz (27.5 kB view details)

Uploaded Feb 17, 2021 Source

File details

Details for the file reference-data-manager-0.8.4.2.tar.gz.

File metadata

Download URL: reference-data-manager-0.8.4.2.tar.gz
Upload date: Feb 17, 2021
Size: 27.5 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/1.11.0 pkginfo/1.4.2 requests/2.22.0 setuptools/40.5.0 requests-toolbelt/0.8.0 tqdm/4.28.1 CPython/3.6.6

File hashes

Hashes for reference-data-manager-0.8.4.2.tar.gz
Algorithm	Hash digest
SHA256	`0a78ecf1572140cf31b76fe0bab824256192b1b1d6b78179b43785b1d36a8364`
MD5	`b35dd67d42de5c45b6327e5244cf7459`
BLAKE2b-256	`2b064dff994593d91c44fa826d8b5eec321340414719c568c172b05afc19924b`

See more details on using hashes here.

reference-data-manager 0.8.4.2

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Bioinformatics Reference Data Manager (BRDM)

Description

Requirements

Deployment Procedures

Run the program

Some suggestions for executing the program

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

File details

File metadata

File hashes