BRACoD is a method to identify associations between bacteria and physiological variables in Microbiome data

Project description

BRACoD

Installation

Installation in python:

pip install BRACoD

There is also an R interface, which depends on the python version being installed. There is a helper function that will do it for you, but it might be easier to do it with pip.

devtools::install_github("ajverster/BRACoD/BRACoD.R")

Python Walkthrough

Simulate some data and normalize it

import BRACoD
sim_counts, sim_y, contributions = BRACoD.simulate_microbiome_counts(BRACoD.example_otu_data)
sim_relab = BRACoD.scale_counts(sim_counts)

Run BRACoD

trace = BRACoD.run_bracod(sim_relab, sim_y, n_sample = 1000, n_burn=1000, njobs=4)

Examine the diagnostics

BRACoD.convergence_tests(trace, sim_relab)

Examine the results

df_results = BRACoD.summarize_trace(trace, sim_counts.columns, 0.3)

Compare the results to the simulated truth

bugs_identified = df_results["bugs"].values
bugs_actual = np.where(contributions != 0)[0]

precision, recall, f1 = BRACoD.score(bugs_identified, bugs_actual)
print("Precision: {}, Recall: {}, F1: {}".format(precision, recall, f1))

Try with your real data. We have included some functions to help you threshold and process your data

df_counts = BRACoD.threshold_count_data(df_counts)
df_rel = BRACoD.scale_counts(df_counts)
df_rel, Y = remove_null(df_rel, Y)
trace = BRACoD.run_bracod(df_rel, Y, n_sample = 1000, n_burn=1000, njobs=4)
df_results = BRACoD.summarize_trace(trace, sim_counts.columns, 0.3)

R Walkthrough

Simulate some data and normalize it

library('BRACoD.R')
data(obesity)
r <- simulate_microbiome_counts(obesity)

sim_counts <- r[[1]]
sim_y <- r[[2]]
contributions <- r[[3]]
sim_relab <- scale_counts(sim_counts)

Run BRACoD

trace <- run_bracod(sim_relab, sim_y, n_sample = 1000, n_burn=1000, njobs=4)

Examine the diagnostics
```
convergence_tests(trace, sim_relab)
```

Examine the results

df_results <- summarize_trace(trace, colnames(sim_counts))

Compare the results to the simulated truth

bugs_identified <- df_results$bugs
bugs_actual <- which(contributions != 0)

r <- score(bugs_identified, bugs_actual)

precision <- r[[1]]
recall <- r[[2]]
f1 <- r[[3]]

print(sprintf("Precision: %.2f, Recall: %.2f, F1: %.2f",precision, recall, f1))

Try with your real data. We have included some functions to help you threshold and process your data

df_counts <- threshold_count_data(df_counts)
df_rel <- scale_counts(df_counts)
r <- remove_null(df_rel, Y)
df_rel <- r[[1]]
Y <- r[[2]]

trace <- run_bracod(df_rel, Y, n_sample = 1000, n_burn=1000, njobs=4)
df_results <- summarize_trace(trace, sim_counts.columns, 0.3)

Project details

Release history Release notifications | RSS feed

0.3.6

Mar 24, 2022

0.3.5

Feb 16, 2022

0.3.4

Feb 16, 2022

0.3.3

Jun 8, 2021

0.3.2

Jun 7, 2021

0.3.1

May 13, 2021

0.3.0

May 4, 2021

0.2.9

May 4, 2021

0.2.8

May 4, 2021

0.2.7

May 4, 2021

0.2.6

May 4, 2021

0.2.5

May 4, 2021

This version

0.2.4

Apr 28, 2021

0.2.3

Apr 27, 2021

0.2.2

Apr 23, 2021

0.2.1

Apr 23, 2021

0.2.0

Apr 21, 2021

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

BRACoD-0.2.4.tar.gz (62.4 kB view hashes)

Uploaded Apr 28, 2021 Source

Built Distribution

BRACoD-0.2.4-py3-none-any.whl (61.7 kB view hashes)

Uploaded Apr 28, 2021 Python 3

Hashes for BRACoD-0.2.4.tar.gz

Hashes for BRACoD-0.2.4.tar.gz
Algorithm	Hash digest
SHA256	`4780ddd89224431a4074d03102fb55266d7399e2cc4d4ad5eb6938d03cf32e3c`
MD5	`e4b9f05bb0bc9be86536563f4936200a`
BLAKE2b-256	`0ded5bbb8695e4b53edc72d5152dca3ff323283cddbb2a5ba7337f15583f9f96`

Hashes for BRACoD-0.2.4-py3-none-any.whl

Hashes for BRACoD-0.2.4-py3-none-any.whl
Algorithm	Hash digest
SHA256	`27aa363282b735bb3ca853bde817fdf36d12a40cea9bfa01d166b1e545b31bc8`
MD5	`4c9e11c00df92bf916d552153965d089`
BLAKE2b-256	`e79d977c245e8f0a86dc22a266cb9347467b40774e1db335a3494217ff8d8e10`