Skip to main content

This is a multithreaded version of various BioServices functions

Project description

Multi Bioservices

PyPI - Version GitHub Workflow Status (with event) PyPI - Python Version Coveralls branch

Description

multi_bioservices is a utility package for COMO. Its purpose is to provie better multithreaded access to the Bioservices package. This is done by using ThreadPoolExecutors on the synchronous functions. Currently, the only function this package provides is a wrapper around the db2db function in Bioservices.

Installation

To install multi_bioservices, you can use pip:

pip install multi_bioservices

Usage

To use multi_bioservices, simply import the db2db function and call it with the relevant parameters

from multi_bioservices.biodbnet import db2db, InputDatabase, OutputDatabase, TaxonID

db2db(
    input_values=["1", "2", "3"],
    input_db=InputDatabase.GENE_ID,
    output_db=OutputDatabase.GENE_SYMBOL,
    taxon_id=TaxonID.HOMO_SAPIENS
)

Parameters

Parameter Type Required? Default Value Description
input_values list[str] Yes N/A The input values to convert
input_db InputDatabase Yes N/A The input type
output_db OutputDatabase or list[OutputDatabase] No tuple(OutputDatabase.GENE_SYMBOL.value,OutputDatabase.GENE_ID.value, OutputDatabase.CHROMOSOMAL_LOCATION.value The type to return
taxon_id TaxonID or int No TaxonID.HOMO_SAPIENS (9606) The taxonomy of the input type
quiet bool No False Should all output be suppressed?
remove_duplicates bool No False Should duplicates be removed from the returned dataframe?
cache bool No True Should cache be used?
delay int No 5 How long of a delay should be enforced if the API is accessed to quickly?
concurrency int No 8 (max 20) How many concurrent requests can be made at once?
batch_length int No 300 (max 500 if taxon_id is TaxonID.HOMO_SAPIENS) How many items should be converted at once?

Returns

multi_bioservices returns a dataframe with the input and output databases as column names. The index of the dataframe has been reset, starting at 0

An example dataframe is seen below

Index Gene ID Gene Symbol
0 0 -
1 1 A1BG
2 2 A2M
3 3 A2MP1
4 4 -

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

multi_bioservices-3.1.2.tar.gz (8.2 kB view details)

Uploaded Source

Built Distribution

multi_bioservices-3.1.2-py3-none-any.whl (10.1 kB view details)

Uploaded Python 3

File details

Details for the file multi_bioservices-3.1.2.tar.gz.

File metadata

  • Download URL: multi_bioservices-3.1.2.tar.gz
  • Upload date:
  • Size: 8.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.7.1 CPython/3.10.11 Linux/6.2.0-1018-azure

File hashes

Hashes for multi_bioservices-3.1.2.tar.gz
Algorithm Hash digest
SHA256 60c4a5b6184429a2c02172f554214461696ef0ab0cf963f35bab5a47d99c9254
MD5 e576d354199ab79edb400f26df43767a
BLAKE2b-256 e90811548f650e324d0c148212333bb3d384217845b6caf3657246a8543bb460

See more details on using hashes here.

File details

Details for the file multi_bioservices-3.1.2-py3-none-any.whl.

File metadata

  • Download URL: multi_bioservices-3.1.2-py3-none-any.whl
  • Upload date:
  • Size: 10.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.7.1 CPython/3.10.11 Linux/6.2.0-1018-azure

File hashes

Hashes for multi_bioservices-3.1.2-py3-none-any.whl
Algorithm Hash digest
SHA256 e9d9dc7e7887500863dd7da56b97c0bfd819580a9f37c4e835d8c163cd5abd7c
MD5 a3338226e2799002dfe8955c386bed28
BLAKE2b-256 edc4ed3de205d32b5aa52a305357f54344de37cd59cd87f47e1d547bd29597a3

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page