Skip to main content

A deep learning framework for batch effect correction in biological data

Project description

BioBatchNet

Installation

Clone the Repository

Clone the repository to your local machine:

git clone https://github.com/Manchester-HealthAI/BioBatchNet](https://github.com/Manchester-HealthAI/BioBatchNet

Set Up the Environment

Create a virtual environment and install dependencies using environment.yml:

Using Conda:

conda env create -f environment.yml
conda activate bbn

BioBatchNet Usage

Enter BioBatchNet

cd BioBatchNet

Construct dataset

For the IMC dataset, place the dataset inside:

mv <your-imc-dataset> Data/IMC/

For scRNA-seq data, create a folder named gene_data inside the Data directory and place the dataset inside:

mkdir -p Data/gene_data/
mv <your-scrna-dataset> Data/scRNA-seq/

Batch effect correction

For IMC Data To process IMC data, run the following command to train BioBatchNet:

python imc.py -c config/IMC/IMMUcan.yaml

For scRNA-seq Data To process scRNA-seq data, modify the dataset, run the following command to train BioBatchNet:

python scrna.py -c config/IMC/macaque.yaml

CPC Usage

CPC utilizes the embedding output from BioBatchNet as input. The provided sample data consists of the batch effect corrected embedding of IMMUcan IMC data.

To use CPC, ensure you are running in the same environment as BioBatchNet.
All experiment results can be found in the following directory:

cd CPC/IMC_experiment

Key Notes:

  • CPC requires embeddings from BioBatchNet as input.
  • Sample data includes batch-corrected IMMUcan IMC embeddings.
  • Ensure the same computational environment as BioBatchNet before running CPC.

📂 Data Download Link

To use BioBatchNet for batch effect correction, you need to download the corresponding dataset and place it in the appropriate directory.

🔹 Download scRNA-seq Data

The scRNA-seq dataset is available on OneDrive. Click the link below to download:

🔗 Download scRNA-seq Data

🔹 Download IMC Data

The IMC dataset can be accessed from the Bodenmiller Group IMC datasets repository. Visit the link below to explore and download the datasets:

🔗 IMC Datasets - Bodenmiller Group

To Do List

  • Data download link
  • Checkpoint
  • Benchmark method results

License

This project is licensed under the MIT License. See the LICENSE file for details.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

biobatchnet-0.1.1.tar.gz (26.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

biobatchnet-0.1.1-py3-none-any.whl (30.9 kB view details)

Uploaded Python 3

File details

Details for the file biobatchnet-0.1.1.tar.gz.

File metadata

  • Download URL: biobatchnet-0.1.1.tar.gz
  • Upload date:
  • Size: 26.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.5

File hashes

Hashes for biobatchnet-0.1.1.tar.gz
Algorithm Hash digest
SHA256 77010c2117105368bf71aa6700f7593130ce3ca8736d65daf761b715a00e52a6
MD5 a5904e3ab483b0ad76b9e827047a37a7
BLAKE2b-256 0c403aad2ce26357a5479ad4424d1611e869bfc5be147aa24338f6e0ff1ee975

See more details on using hashes here.

File details

Details for the file biobatchnet-0.1.1-py3-none-any.whl.

File metadata

  • Download URL: biobatchnet-0.1.1-py3-none-any.whl
  • Upload date:
  • Size: 30.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.5

File hashes

Hashes for biobatchnet-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 1a7fb7a364354a98dcaec103651a0e60b1c93709bc37b4c4353e557b85c50ed4
MD5 6510522a066e959671d4762fc5a90d54
BLAKE2b-256 938dcd2914e506925baf81bed4f901bc2a30eda31a90efb9abf6a3bfbbdcae38

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page