This is a multithreaded version of various BioServices functions
Project description
Multi Bioservices
Description
multi_bioservices
is a utility package for COMO. Its purpose is to provie better
multithreaded access to the Bioservices package. This is done by
using ThreadPoolExecutors on the synchronous functions. Currently, the only function this package provides is a wrapper
around the db2db
function in Bioservices.
Installation
To install multi_bioservices
, you can use pip:
pip install multi_bioservices
Usage
To use multi_bioservices
, simply import the db2db
function and call it with the relevant parameters
from multi_bioservices.biodbnet import db2db, InputDatabase, OutputDatabase, TaxonID
db2db(
input_values=["1", "2", "3"],
input_db=InputDatabase.GENE_ID,
output_db=OutputDatabase.GENE_SYMBOL,
taxon_id=TaxonID.HOMO_SAPIENS
)
Parameters
Parameter | Type | Required? | Default Value | Description |
---|---|---|---|---|
input_values |
list[str] |
Yes | N/A | The input values to convert |
input_db |
InputDatabase |
Yes | N/A | The input type |
output_db |
OutputDatabase or list[OutputDatabase] |
No | tuple(OutputDatabase.GENE_SYMBOL.value,OutputDatabase.GENE_ID.value, OutputDatabase.CHROMOSOMAL_LOCATION.value |
The type to return |
taxon_id |
TaxonID or int |
No | TaxonID.HOMO_SAPIENS (9606) |
The taxonomy of the input type |
quiet |
bool |
No | False |
Should all output be suppressed? |
remove_duplicates |
bool |
No | False |
Should duplicates be removed from the returned dataframe? |
cache |
bool |
No | True |
Should cache be used? |
delay |
int |
No | 5 |
How long of a delay should be enforced if the API is accessed to quickly? |
concurrency |
int |
No | 8 (max 20) |
How many concurrent requests can be made at once? |
batch_length |
int |
No | 300 (max 500 if taxon_id is TaxonID.HOMO_SAPIENS ) |
How many items should be converted at once? |
Returns
multi_bioservices
returns a dataframe with the input and output databases as column names. The index of the dataframe
has been reset, starting at 0
An example dataframe is seen below
Index | Gene ID | Gene Symbol |
---|---|---|
0 | 0 | - |
1 | 1 | A1BG |
2 | 2 | A2M |
3 | 3 | A2MP1 |
4 | 4 | - |
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file multi_bioservices-3.1.2.tar.gz
.
File metadata
- Download URL: multi_bioservices-3.1.2.tar.gz
- Upload date:
- Size: 8.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.7.1 CPython/3.10.11 Linux/6.2.0-1018-azure
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 60c4a5b6184429a2c02172f554214461696ef0ab0cf963f35bab5a47d99c9254 |
|
MD5 | e576d354199ab79edb400f26df43767a |
|
BLAKE2b-256 | e90811548f650e324d0c148212333bb3d384217845b6caf3657246a8543bb460 |
Provenance
File details
Details for the file multi_bioservices-3.1.2-py3-none-any.whl
.
File metadata
- Download URL: multi_bioservices-3.1.2-py3-none-any.whl
- Upload date:
- Size: 10.1 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.7.1 CPython/3.10.11 Linux/6.2.0-1018-azure
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | e9d9dc7e7887500863dd7da56b97c0bfd819580a9f37c4e835d8c163cd5abd7c |
|
MD5 | a3338226e2799002dfe8955c386bed28 |
|
BLAKE2b-256 | edc4ed3de205d32b5aa52a305357f54344de37cd59cd87f47e1d547bd29597a3 |