Skip to main content

Efficiently access the 'Bedrock Bio' library of open-access computational biology datasets.

Project description

bedrock-bio

Open-Access Computational Biology Datasets

Description

Efficiently access a curated library of open-access computational biology datasets. Tables support predicate pushdown and projection to the cloud storage backend, enabling quick, iterative access to otherwise massive, unwieldy tables.

bedrock_bio consists of three user-facing functions:

  • list_tables(): returns a list of available table identifiers
  • describe_table('<name>'): returns metadata, citation, and column definitions for a table
  • load_table('<name>', **filters): takes a table name and optional partition filters, and returns a lazy DuckDB relation

DuckDB methods (filter, select, limit) can be used on the relation returned by load_table to push down additional row filters and column selections to the storage backend.

Installation

To install the latest release from PyPI:

pip install bedrock-bio

Or install the current development version from GitHub:

pip install git+https://github.com/bedrock-bio/bedrock-bio-client.git@main#subdirectory=python

Examples

import bedrock_bio as bb

List available tables:

bb.list_tables()

Describe a table to see its metadata, citation, and columns:

bb.describe_table('ukb_ppp.pqtls')

Lazily load a table (optionally with partition filters for partitioned tables), select columns, and collect into an in-memory data frame:

df = bb.load_table('ukb_ppp.pqtls', ancestry='EUR', protein_id='A0FGR8', panel='Inflammation') \
  .select('chromosome, position, effect_allele, other_allele, beta, neg_log_10_p_value') \
  .fetchdf()

Dataset Requests

To request the addition of a new table to the library, open an issue.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

bedrock_bio-1.3.1.tar.gz (3.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

bedrock_bio-1.3.1-py3-none-any.whl (5.4 kB view details)

Uploaded Python 3

File details

Details for the file bedrock_bio-1.3.1.tar.gz.

File metadata

  • Download URL: bedrock_bio-1.3.1.tar.gz
  • Upload date:
  • Size: 3.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: uv/0.11.14 {"installer":{"name":"uv","version":"0.11.14","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for bedrock_bio-1.3.1.tar.gz
Algorithm Hash digest
SHA256 04fa9a00908356dd7cf7d186179ded69bacfd13d01c59e62f607ae3b6ae58bc1
MD5 538472fb568f7bbd019d5f7687aa6009
BLAKE2b-256 dcf2d2f172e123ce1ea507fb02964c96a261500b6063d5ef41a320a66a2792cf

See more details on using hashes here.

File details

Details for the file bedrock_bio-1.3.1-py3-none-any.whl.

File metadata

  • Download URL: bedrock_bio-1.3.1-py3-none-any.whl
  • Upload date:
  • Size: 5.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: uv/0.11.14 {"installer":{"name":"uv","version":"0.11.14","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for bedrock_bio-1.3.1-py3-none-any.whl
Algorithm Hash digest
SHA256 dd73583d7b1aa4ffd88758ce66d48cfa3ecb672b9ce15af76698cbd9ec01004d
MD5 f1b727091fb7bcab299ad36765436ea8
BLAKE2b-256 00165aa5fe92765d2c8411c137302c0a42bd6a62557f35ceca9c45da44964631

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page