Batch Effect Correction framework for metagenomic data
Project description
Batch Effect Correction framework for metagenomic data
Table of Contents
- About the project
- Installation
- Features
- Usage
- Documentation
- License
- Contributing
- Credits and acknowledgements
- Contact and feedback
About the project
The integration of metagenomic data from multiple studies and experimental conditions is essential to understand the interactions between microbial communities in complex biological systems, but the inherent diversity and biological complexity pose methodological challenges that require refined strategies for atlas-level integration. ABaCo, a family of generative models based on Variational Autoencoders (VAEs) combined with an adversarial training, aim for the integration of metagenomic data from different studies by minimizing technical heterogeneity conserving biological significance. The VAE encodes the data into a latent space, while the discriminator is trained to detect the provenance of the data, eliminating variability associated with its origin; concurrently, the data is modeled using distributions suitable for raw counts, and the latent space follows a clustering prior to ensure biological conservation.
An overview of the ABaCo workflow is shown in the figure below:
Installation
[!TIP] It is recommended to install ABaCo inside a virtual environment to manage depenendencies and avoid conflicts with existing packages. You can use the virtual environment manager of your choice, such as
poetry,conda, orpipenv.
Pip
ABaCo is available on PyPI and can be installed using pip:
pip install abaco
You can also install the package for development by cloning this repository and running the following command:
[!WARNING] We assume you are in the root directory of the cloned repository when running this command. Otherwise, you need to specify the path to the
abacodirectory.
pip install -e .
Features
Usage
Documentation
ABaCo's documentation is hosted on Read the Docs. It includes detailed examples, configuration options, and the API reference.
License
The code in this repository is licensed under the MIT License, allowing you to use, modify, and distribute it freely as long as you include the original copyright and license notice.
The documentation and other creative content are licensed under the Creative Commons Attribution 4.0 International (CC BY 4.0) License, meaning you are free to share and adapt it with proper attribution.
Full details for both licenses can be found in the LICENSE file.
Contributing
ABaCo is an open-source project, and we welcome contributions of all kinds via GitHub issues and pull requests. You can report bugs, suggest improvements, propose new features, or implement changes. Please follow the guidelines in the CONTRIBUTING file to ensure that your contribution is easily integrated into the project.
Credits and acknowledgements
- ABaCo was developed by the Multiomics Network Analytics Group (MoNA) at the Novo Nordisk Foundation Center for Biosustainability (DTU Biosustain).
Contact and feedback
We appreciate your feedback! If you have any comments, suggestions, or run into issues while using ABaCo, feel free to open an issue in this repository. Your input helps us make ABaCo better for everyone.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
File details
Details for the file abaco-0.1.1.tar.gz.
File metadata
- Download URL: abaco-0.1.1.tar.gz
- Upload date:
- Size: 5.8 MB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
fa2c58a3a1d1d8263607602357fb91d7f223fd5a0efa8ff7709fa5b12e2e7c22
|
|
| MD5 |
d62c727a75bf7cca8147530cbac98ed5
|
|
| BLAKE2b-256 |
db61ce589c8a8fa1fc163b3e55cfeb7b274c51315b3fde82046bc5a222b26ba1
|
Provenance
The following attestation bundles were made for abaco-0.1.1.tar.gz:
Publisher:
cicd.yml on Multiomics-Analytics-Group/abaco
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
abaco-0.1.1.tar.gz -
Subject digest:
fa2c58a3a1d1d8263607602357fb91d7f223fd5a0efa8ff7709fa5b12e2e7c22 - Sigstore transparency entry: 776909851
- Sigstore integration time:
-
Permalink:
Multiomics-Analytics-Group/abaco@3c63675faa9a7bb7c423e4f0bc86bbcf5c05fff8 -
Branch / Tag:
refs/tags/0.1.1 - Owner: https://github.com/Multiomics-Analytics-Group
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
cicd.yml@3c63675faa9a7bb7c423e4f0bc86bbcf5c05fff8 -
Trigger Event:
push
-
Statement type: