No project description provided
Project description
SAMBA (Smoothed imAge MicroBiome distAnce)
This code is attached to the paper "SAMBA: A Novel Smoothed Image-Based Metric for Microbial Sequencing Data Analysis". SAMBA is a novel microbial metric. SAMBA utilizes the iMic method to transform microbial data into images, incorporating phylogenetic structure and abundance similarity. This image-based representation enhances data visualization and analysis. Moreover, SAMBA employs a fast Fourier transform (FFT) with adjustable thresholding to smooth the images, reducing noise and accentuating meaningful information. Various distance metrics, such as SAM and MSE, can be applied to the processed images.
How to apply SAMBA
SAMBA's code is available at this GitHub as well as pypi.
SAMBA's GitHub
There is an example in example_use.py. You should follow the following steps:
-
Load the raw ASVs table in the following format: the first column names "ID", each row represents a sample and each column represents an ASV. The last row contains the taxonomy information, names "taxonomy".
df = pd.read_csv("example_data/for_preprocess.csv")
-
Apply the MIPMLP with the defaulting parameters (see MIPMLP for more explanations).
processed = MIPMLP.preprocess(df)
-
micro2matrix (translate microbiome into matrix according to iMic, and save the images in a prepared folder
folder = "example_data/2D_images" micro2matrix(processed, folder, save=True)
-
Calculate the distance matrix according to SAMBA One can choose the FFT cutoff (in the range of [0,1]), and the final metric (one of "sam","mse","d1","d2","d3").
DM = build_SAMBA_distance_matrix(folder,cutoff=CUTOFF,metric=METRIC)
SAMBA's Pypi
- Pip install the package
pip install samba-metric
- Apply SAMBA on a MIPMLP processed data:
from samba import *
processed = pd.read_csv("example_data.csv",index_col=0)
folder = "FOLDER_NAME"
micro2matrix(processed, folder, save=True)
# Calculate the distance matrix according to SAMBA
DM = build_SAMBA_distance_matrix(folder)
DM.to_csv(f"{folder}/samba_dists.csv")
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for samba_metric-0.0.9-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | fecc0b80396ee958349e496e6f14d950a00b1b87304849fb497879f97190eefc |
|
MD5 | 357b2d5be23fb164522ec692c901ef12 |
|
BLAKE2b-256 | f72bc3e69e94d6dcbaa70cc527913cf2aeae8759e0e3e1ad514b1a04f402edbe |