Seamlessly import and query all of the beacon protocol collections
Project description
Beacon2 Import Toolkit
Overview
Beacon Import facilitates the seamless transfer of local genetic data to the Beacon database server, enhancing collaboration and knowledge sharing. Meanwhile, Beacon Query empowers users to efficiently explore the database for specific genetic information, including genes, sequences, variants, cnv, and genomic ranges. The search tools also search the phenopacket and other metadata b2ri collections (individuals, runs, datasets, biosamples, cohorts and analyses), catalyzing genomic research and clinical applications
Requirements
- Python 3.6 or newer. (download instructions)
Installation and update
Using Pip3
-
Install beacon2-import using pip3.
sudo pip3 install beacon2-import
-
Update beacon2-import, if needed, using pip3.
pip3 install beacon2-import --upgrade
-
Test your pip3 installation by running beacon2-import or beacon2-search.
beacon2-import --help beacon2-search --help
Using conda (bioconda channel)
-
Install beacon2-import using conda.
conda config --add channels bioconda conda config --add channels conda-forge conda install beacon2-import
-
Update beacon2-import, if needed, using conda.
conda update beacon2-import
-
Test your conda installation by running beacon2-import or beacon2-search.
beacon2-import --help beacon2-search --help
Usage - beacon2-import
usage: beacon2_import.py [-h] [-H DATABASE_HOST] [-P DATABASE_PORT] [-a]
[-A DATABASE_AUTH_CONFIG] [-g] [-u GALAXY_URL]
[-k GALAXY_KEY] [-d DATABASE] [-c COLLECTION]
[-i INPUT_JSON_FILE] [-s] [-o] [-D] [-V] [-ca] [-cc]
[-r REMOVED_COLL_NAME]
Input arguments
optional arguments:
-h, --help show this help message and exit
Connection to MongoDB:
-H DATABASE_HOST, --db-host DATABASE_HOST
Hostname/IP of the beacon database
-P DATABASE_PORT, --db-port DATABASE_PORT
Port of the beacon database
Addvanced Connection to MongoDB:
-a, --advance-connection
Connect to beacon database with authentication
-A DATABASE_AUTH_CONFIG, --db-auth-config DATABASE_AUTH_CONFIG
JSON file containing credentials/config e.g.{'db_auth_
source':'admin','db_user':'root','db_password':'exampl
e'}
Connection to Galaxy:
-g, --galaxy Import data from Galaxy
-u GALAXY_URL, --galaxy-url GALAXY_URL
Galaxy hostname or IP
-k GALAXY_KEY, --galaxy-key GALAXY_KEY
API key of a galaxy user WITH ADMIN PRIVILEGES
Database Configuration:
-d DATABASE, --database DATABASE
The targeted beacon database
-c COLLECTION, --collection COLLECTION
The targeted beacon collection from the desired
database
Import Json Arguments:
-i INPUT_JSON_FILE, --input_json_file INPUT_JSON_FILE
Input the local path to the JSON file or it's name on
your Galaxy Hitory to import to beacon
store origin:
-s, --store-origins Make a local file containing variantIDs with the
dataset they stem from
-o , --origins-file Full file path of where variant origins should be
stored (if enabled)
control output:
-D, --debug
-V, --verbose Be verbose
Clear beacon database:
-ca, --clearAll Delete all data before the new import
-cc, --clearColl Delete specific collection before the new import
-r REMOVED_COLL_NAME, --removeCollection REMOVED_COLL_NAME
Define the collection name for deletion
Usage - beacon2-search
usage: beacon2_search.py [-h]
{sequence,range,gene,bracket,analyses,biosamples,cohorts,datasets,individuals,runs,cnv}
...
Query Beacon Database
positional arguments:
{sequence,range,gene,bracket,analyses,biosamples,cohorts,datasets,individuals,runs,cnv}
sequence Connect to MongoDB and perform sequence-based querys
to the genomicVariations collection
range Connect to MongoDB and perform range-based querys to
the genomicVariations collection
gene Connect to MongoDB and perform geneID-based querys to
the genomicVariations collection
bracket Connect to MongoDB and perform bracket-based querys to
the genomicVariations collection
analyses Connect to MongoDB and query the analyses collection
biosamples Connect to MongoDB and query the biosample collection
cohorts Connect to MongoDB and query the cohorts collection
datasets Connect to MongoDB and query the datasets collection
individuals Connect to MongoDB and query the individuals
collection
runs Connect to MongoDB and query the runs collection
cnv Connect to MongoDB and query the copy number variants
(cnv) collection
optional arguments:
-h, --help show this help message and exit
Beacon Query Tool
This script provides a command-line interface for querying different collections in a Beacon Database using various sub-commands for sequence, range, gene ID, or bracket criteria.
Example Usage:
1. Query genomicVariations collection by sequence:
beacon_search sequence -d database_name -c collection_name -rn reference_name -s start -ab alternate_bases
2. Query genomicVariations collection by range:
beacon_search range -d database_name -c collection_name -rn reference_name -s start -e end -v variant_type
3. Query genomicVariations collection by gene ID:
beacon_search gene -d database_name -c collection_name -g gene_id -vmin variant_min_length -vmax variant_max_length
4. Query genomicVariations collection by bracket:
beacon_search bracket -d database_name -c collection_name -rn reference_name -smin start_minimum -smax start_maximum -emin end_minimum -emax end_maximum -v variant_type
5. Query analyses collection:
beacon_search analyses -d database_name -c collection_name -al aligner -ad analysis_date -bi biosample_id -id identification -ii individual_id -pn pipeline_name -pr pipeline_ref -ri run_id -vc variant_caller
6. Query biosamples collection:
beacon_search biosamples -d database_name -c collection_name -bs biosample_status -cd collection_date -cm collection_moment -id identification -op obtention_procedure -so sample_origin_type
7. Query cohorts collection:
beacon_search cohorts -d database_name -c collection_name -ct cohort_data_types -cd cohort_design -cz cohort_size -t cohort_type -id identification -g genders -n name
8. Query datasets collection:
beacon_search datasets -d database_name -c collection_name -o ontology -om ontology_modifiers -id identification -n name
9. Query individuals collection:
beacon_search individuals -d database_name -c collection_name -g age_group -do disease_ontology -f family_history -se severity -st stage -e ethnicity -go geographic_origin -id identification -as assay_code -s sex
10. Query runs collection:
beacon_search runs -d database_name -c collection_name -id identification -ii individual_id -ll library_layout -ls library_selection -s library_source -st library_strategy -p platform -pm platform_model -r run_date
11. Query for cnv:
beacon_search cnv -d database_name -c collection_name -vi variantInternalId -ai analysisId -ii individualId -s start -e end -ch chromosome -g gene -vt variantType -p primarySite -dt diseaseType -si variantStateId -vs variantState -sd sequenceId
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file beacon2_import-2.2.4.tar.gz
.
File metadata
- Download URL: beacon2_import-2.2.4.tar.gz
- Upload date:
- Size: 20.1 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.1.1 pkginfo/1.4.2 requests/2.31.0 setuptools/45.2.0 requests-toolbelt/0.8.0 tqdm/4.30.0 CPython/3.8.10
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 238fe76cc0a82ff1c3881ea4a376f664885761e0e211477064147209442f7201 |
|
MD5 | 9bfc872852ff78b5dee859a8d557a692 |
|
BLAKE2b-256 | cf2c8a8e0c20512b8abbe8c85c59b3ffc7ba7621860f27f0927e85034bd369af |
File details
Details for the file beacon2_import-2.2.4-py3-none-any.whl
.
File metadata
- Download URL: beacon2_import-2.2.4-py3-none-any.whl
- Upload date:
- Size: 27.2 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.1.1 pkginfo/1.4.2 requests/2.31.0 setuptools/45.2.0 requests-toolbelt/0.8.0 tqdm/4.30.0 CPython/3.8.10
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 5404733feba852f7bb641e8175e0ce0470053467d0f5e440b4098ae3266ebb1d |
|
MD5 | ea19ca9297d699b1f099c6ae4857ae73 |
|
BLAKE2b-256 | 83c48a94585a5d96b0641597bf3d6d5212e4630f7e9ba55de0506fa926a57dea |