HEMnet package
Project description
HEMnet - Haematoxylin & Eosin and Molecular neural network
Description
A deep learning automated cancer diagnosis software using molecular labelling to improve pathological annotation of Haematoxylin and Eosin (H&E) stained tissue.
Installation
-
Docker
You can download and run the docker image using the following commands:
docker pull andrewsu1/hemnet docker run -it andrewsu1/hemnet -
Conda
Install Openslide (this is necessary to open whole slide images) - download it here
Create a conda environment from the
environment.ymlfileconda env create -f environment.yml conda activate HEMnet
Usage
Slide Preparation
Name slides in the format: slide_id_TP53 for TP53 slides and slide_id_HandE for H&E slides
The TP53 and HandE suffix is used by HEMnet to identify the stain used.
1. Generate training and testing datasets
a. Generate train dataset
python HEMnet_train_dataset.py -b /path/to/base/directory -s relative/path/to/slides -o relative/path/to/output/directory -t relative/path/to/template_slide.svs -v
b. Generate test dataset
python HEMnet_test_dataset.py -b /path/to/base/directory -s /relative/path/to/slides -o /relative/path/to/output/directory -t relative/path/to/template_slide -m tile_mag -a align_mag -c cancer_thresh -n non_cancer_thresh
Other parameters:
-tis the relative path to the template slide from which all other slides will be normalised against. The template slide should be the same for each step.-mis the tile magnification. e.g. if the input is10then the tiles will be output at 10x-ais the align magnification. Paired TP53 and H&E slides will be registered at this magnification. To reduce computation time we recommend this be less than the tile magnification - a five times downscale generally works well.-ccancer threshold to apply to the DAB channel. DAB intensities less than this threshold indicate cancer.-nnon-cancer threshold to apply to the DAB channel. DAB intensities greater than this threshold indicate no cancer.
2. Train and evaluate model
a. Training model
python train.py -b /path/to/base/directory -t relative/path/to/training_tile_directory -l relative/path/to/validation_tile_directory -o /relative/path/to/output/directory -m cnn_base -g num_gpus -e epochs -a batch_size -s -w -f -v
Other parameters:
-mis CNN base model. eg.resnet50,vgg16,vgg19,inception_v3andxception.-gis number of GPUs for training.-eis training epochs. Default is100epochs.-ais batch size. Default is32-sis option to save the trained model weights.-wis option to used transfer learning. Model will used pre-trained weights from ImageNet at the initial stage.-fis fine-tuning option. Model will re-train CNN base.
b. Test model prediction
python test.py -b /path/to/base/directory -t relative/path/to/test_tile_directory -o /relative/path/to/output/directory -w model_weights -m cnn_base -g num_gpus -v
Other parameters:
-wis path to trained model. eg.trained_model.h5.-mis CNN base model (same to training step).-gis number of GPUs for prediction.
c. Evaluate model performance and visualise model prediction
python visualisation.py -b /path/to/base/directory -t /relative/path/to/training_output_directory -p /relative/path/to/test_output_directory -o /relative/path/to/output/directory -i sample
Other parameters:
-tis path to training outputs.-pis path to test outputs.-iis name of Whole Slide Image for visualisation.
3. Apply model to diagnose new images
python HEMnet_inference.py -s '/path/to/new/HE/Slides/' -o '/path/to/output/directory/' -t '/path/to/template/slide/' -nn '/path/to/trained/model/' -v
Predict on TCGA images with our pretrained model for colorectal cancer using google colab
Results
Citing HEMnet
The Team
Please contact Dr Quan Nguyen (quan.nguyen@uq.edu.au), Andrew Su (a.su@uqconnect.edu.au), and Xiao Tan (xiao.tan@uqconnect.edu.au) for issues, suggestions, and we are very welcome to collaboration opportunities.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file HEMnet-1.0.0.tar.gz.
File metadata
- Download URL: HEMnet-1.0.0.tar.gz
- Upload date:
- Size: 3.4 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.4.1 importlib_metadata/3.7.3 pkginfo/1.7.0 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.54.1 CPython/3.7.6
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
6aac023efe1b1cff2d7b8fbd1a6f2c157adac00f3a66bc3fde64f42982ef1437
|
|
| MD5 |
f0664c6156eabbaf56c6a99a633f0387
|
|
| BLAKE2b-256 |
75a7812515f9ce4c908aaab5850d4801f9e49703bc25627f94d598ca624daa0a
|
File details
Details for the file HEMnet-1.0.0-py3-none-any.whl.
File metadata
- Download URL: HEMnet-1.0.0-py3-none-any.whl
- Upload date:
- Size: 3.8 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.4.1 importlib_metadata/3.7.3 pkginfo/1.7.0 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.54.1 CPython/3.7.6
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
367e417dd9639ce41ae17f23dab79fd54e67eeb786fe60091cbea952a61aa7c7
|
|
| MD5 |
ac3107353206cb3836772b0de98cffa9
|
|
| BLAKE2b-256 |
87df306349a9783a6ff24ce3df8761e9fae7637f2ee75343442d35df2477bdc7
|