A deep learning framework for batch effect correction in biological data
Project description
BioBatchNet
Installation
Clone the Repository
Clone the repository to your local machine:
git clone https://github.com/Manchester-HealthAI/BioBatchNet](https://github.com/Manchester-HealthAI/BioBatchNet
Set Up the Environment
Create a virtual environment and install dependencies using environment.yml:
Using Conda:
conda env create -f environment.yml
conda activate bbn
BioBatchNet Usage
Enter BioBatchNet
cd BioBatchNet
Construct dataset
For the IMC dataset, place the dataset inside:
mv <your-imc-dataset> Data/IMC/
For scRNA-seq data, create a folder named gene_data inside the Data directory and place the dataset inside:
mkdir -p Data/gene_data/
mv <your-scrna-dataset> Data/scRNA-seq/
Batch effect correction
For IMC Data To process IMC data, run the following command to train BioBatchNet:
python imc.py -c config/IMC/IMMUcan.yaml
For scRNA-seq Data To process scRNA-seq data, modify the dataset, run the following command to train BioBatchNet:
python scrna.py -c config/IMC/macaque.yaml
CPC Usage
CPC utilizes the embedding output from BioBatchNet as input. The provided sample data consists of the batch effect corrected embedding of IMMUcan IMC data.
To use CPC, ensure you are running in the same environment as BioBatchNet.
All experiment results can be found in the following directory:
cd CPC/IMC_experiment
✅ Key Notes:
- CPC requires embeddings from BioBatchNet as input.
- Sample data includes batch-corrected IMMUcan IMC embeddings.
- Ensure the same computational environment as BioBatchNet before running CPC.
📂 Data Download Link
To use BioBatchNet for batch effect correction, you need to download the corresponding dataset and place it in the appropriate directory.
🔹 Download scRNA-seq Data
The scRNA-seq dataset is available on OneDrive. Click the link below to download:
🔹 Download IMC Data
The IMC dataset can be accessed from the Bodenmiller Group IMC datasets repository. Visit the link below to explore and download the datasets:
🔗 IMC Datasets - Bodenmiller Group
To Do List
- Data download link
- Checkpoint
- Benchmark method results
License
This project is licensed under the MIT License. See the LICENSE file for details.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file biobatchnet-0.1.1.tar.gz.
File metadata
- Download URL: biobatchnet-0.1.1.tar.gz
- Upload date:
- Size: 26.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.13.5
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
77010c2117105368bf71aa6700f7593130ce3ca8736d65daf761b715a00e52a6
|
|
| MD5 |
a5904e3ab483b0ad76b9e827047a37a7
|
|
| BLAKE2b-256 |
0c403aad2ce26357a5479ad4424d1611e869bfc5be147aa24338f6e0ff1ee975
|
File details
Details for the file biobatchnet-0.1.1-py3-none-any.whl.
File metadata
- Download URL: biobatchnet-0.1.1-py3-none-any.whl
- Upload date:
- Size: 30.9 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.13.5
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
1a7fb7a364354a98dcaec103651a0e60b1c93709bc37b4c4353e557b85c50ed4
|
|
| MD5 |
6510522a066e959671d4762fc5a90d54
|
|
| BLAKE2b-256 |
938dcd2914e506925baf81bed4f901bc2a30eda31a90efb9abf6a3bfbbdcae38
|