ExoDeepFinder is an original deep learning approach to localize macromolecules in cryo electron tomography images. The method is based on image segmentation using a 3D convolutional neural network.

These details have not been verified by PyPI

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

ExoDeepFinder

ExoDeepFinder is an exocytosis event detection tool.

This work is based on DeepFinder which has been customized for TIRF microscopy.

Installation guide

ExoDeepFinder depends on Tensorflow which is only GPU-accelerated on Linux. There is currently no official GPU support for MacOS and native Windows, so the CPU will be used on those platform, but you can still use it (it will just be slower, yet the training might be very slow). On Windows, WSL2 can be used to run tensorflow code with GPU; see the install instructions for more information.

ExoDeepFinder binaries are available for Windows, Linux and Mac, so there is no need to install anything if you just want to use the graphical user interface. The Linux release is big (over 4Gb) because it contains the libraries required for the GPU acceleration. Thus they are split in two parts (ExoDeepFinder_Linux-x86_64_part1.tar.gz and ExoDeepFinder_Linux-x86_64_part2.tar.gz). To uncompress them, use the following command: tarcat ExoDeepFinder_Linux-x86_64_part*.tar.gz | tar -xvzf -.

Alternatively, to install ExoDeepFinder and use it with command lines, create and activate a virtual environment with python 3.11 or later (see the Virtual environments section for more information), install dependencies (on Linux only, see bellow), and run pip install exodeepfinder[GUI].

On Linux you will need to install wxPython dependencies manually (sudo apt install libgtk-3-dev, etc.) or use one precompiled wxPython version (use pip install -f https://extras.wxpython.org/wxPython4/extras/linux/gtk3/ubuntu-16.04 wxPython with your Ubuntu version number, or use conda install wxpython to install a compiled wxPython from conda).

Note that on Windows, the python command is often replaced by py and pip by py -m pip; so you migth need adapt the commands in this documentation depending on your system settings.

Usage

To detect exocytose events, you can either use the pretrained model (available in examples/analyze/in/net_weights_FINAL.h5) to generate segmetations, or you can annotate your exocytose movies and train your own model.

Here are all ExoDeepFinder commands (described later):

edf_convert_tiff_to_h5              # convert tiff folders to a single h5 file
edf_segment                         # segment a movie
edf_generate_annotation             # generate an annotation file from a segmentation by clustering it
edf_generate_segmentation           # generate a segmentation from an annotation file
edf_detect_spots                    # detect bright spots in movies
edf_merge_detector_expert           # merge the expert annotations with the detector segmentations for training
edf_structure_training_dataset      # structure dataset files for training
edf_train                           # train a new model
exodeepfinder                       # combine all above commands

The ExoDeepFinder main GUI enables to execute each of those commands (listed on the Actions panel, although the edf_ prefix is removed). The following documentation explains how to use ExoDeepFinder with command lines (when installed with pip install exodeepfinder), but you can use the available GUI if you prefer.

For more information about an ExoDeepFinder command, use the --help option (run edf_detect_spots --help to know more about edf_detect_spots).

To open a Graphical User Interface (GUI) for a given command, run it without any argument. For example, edf_segment opens a GUI which can execute the edf_segment command with the arguments specified with the graphical interface.

exodeepfinder runs any of the other command as a subcommand (for example exodeepfinder segment -m movie.h5 is equivalent to edf_segment -m movie.h5); and it opens a GUI for all other commands when called without any argument.

If you installed ExoDeepFinder as a developer (see [Development section](## Development)), all commands can either be called directly (edf_segment -m movie.h5) or with python and the proper path (python deepfinder/commands/segment.py -m movie.h5 when in the project root directory).

Exocytose events segmentation

ExoDeepFinder handles exocytose movies made from tiff files, where each tiff file is a frame of the movie, and their name ends with the frame number; like in the following structure:

exocytose_data/
├── movie1/
│   ├── frame_1.tiff
│   ├── frame_2.tiff
│   └── ...

The frame extensions can be .tif, .tiff, .TIF or .TIFF.

There is no constraint on the file names, but they must contain the frame number (the last number in the file name must be the frame number), and be in the tiff format (it could work with other format like .png since images are read with the skimage.io.imread() function of the scikit-image library). For example frame_1.tiff could also be named IMAGE32_1.TIF. Similarly, there is no constraint on the movie names. In addition, although there is no strict constraint on the file names, be aware that it is much simpler to work with simple file names with no space or special characters.

The movie folders (containing the frames in tiff format) can be converted into a single .h5 file with the edf_convert_tiff_to_h5 command. Most ExoDeepFinder commands take h5 files as input, so the first step is to convert the data to h5 format with the following command: edf_convert_tiff_to_h5 --tiff path/to/movie/folder/ --output path/to/output/movie.h5

You can also generate all your movie folders at once using the --batch option:

edf_convert_tiff_to_h5 --batch path/to/movies/ --output path/to/outputs/ --make_subfolder

where path/to/movies/ contains movies folders (which in turn contains tiff files). The --make_subfolder option enable to put all tiff files in a till/ subfolder; which is useful in batch mode. The --batch option enables to process multiple movie folders at once and work in the same way in all ExoDeepFinder commands.

The above command will turn the following file structure:

exocytose_data/
├── movie1/
│   ├── frame_1.tiff
│   ├── frame_2.tiff
│   └── ...
├── movie2/
│   ├── frame_1.tiff
│   └── ...
└── ...

into this one:

exocytose_data/
├── movie1/
│   ├── tiff/
│   │   ├── frame_1.tiff
│   │   ├── frame_2.tiff
│   │   └── ...
│   └── movie.h5
├── movie2/
│   ├── tiff/
│   |   ├── frame_1.tiff
│   │   └── ...
│   └── movie.h5
└── ...

To generate segmentations, you can either use ExoDeepFinder or napari-exodeepfinder.

To segment a movie, use: edf_segment --movie path/to/movie.h5 --model_weights examples/analyze/in/net_weights_FINAL.h5 --patch_size 160 --visualization

This will generate a segmentation named path/to/movie_semgmentation.h5 with the pretrained weigths in examples/analyze/in/net_weights_FINAL.h5 and patches of size 160. It will also generate visualization images.

See edf_segment --help for more information about the input arguments.

To cluster a segmentation and create an annotation file from it, use: edf_generate_annotation --segmentation path/to/movie_segmentation.h5 --cluster_radius 5

Using napari-exodeepfinder

The napari-exodeepfinder plugin can be used to compute predictions. Open the movie you want to segment in napari (it must be in h5 format). In the menu, choose Plugins > Napari DeepFinder > Segmentation to open the segmentation tools. Choose the image layer you want to segment. Select the examples/analyze/in/net_weights_FINAL.h5 net weights; or the path of the model weights you want to use for the segmentation. Use 3 for the number of class (0: background, 1: bright spots, 2: exocytose events), and 160 for the patch size. Choose an output image name (with the .h5 extension), then launch the segmentation.

Training

To train a model, your data should be organized in the following way:

exocytose_data/
├── movie1/
│   ├── frame_1.tiff
│   ├── frame_2.tiff
│   └── ...
├── movie2/
│   ├── frame_1.tiff
│   └── ...
└── ...

Convert movies to h5 format

For each movie, tiff files must be converted to a single .h5 using the edf_convert_tiff_to_h5 command:

edf_convert_tiff_to_h5 --batch path/to/exocytose_data/ --make_subfolder

This will change the exocytose_data structrue into the following one:

exocytose_data/
├── movie1/
│   ├── tiff/
│   │   ├── frame_1.tiff
│   │   ├── frame_2.tiff
│   │   └── ...
│   └── movie.h5
├── movie2/
│   ├── tiff/
│   |   ├── frame_1.tiff
│   │   └── ...
│   └── movie.h5
└── ...

Detect bright spots

Then, bright spots must be detected in each frame with a spot detector such as Atlas. The Atlas installation instructions are detailed in the repository.

Once atlas (or the detector of your choice) is installed, you can detect spots in each frame using the edf_detect_spots command:

edf_detect_spots --detector_path path/to/atlas/ --batch path/to/exocytose_data/

where path/to/atlas/ is the root path of atlas (containing the build/ directory with the binaries inside if you followed the default installation instructions).

This will generate detector_segmentation.h5 files (the semgentations of spots) in the movie folders:

exocytose_data/
├── movie1/
│   ├── tiff/
│   │   ├── frame_1.tiff
│   │   ├── frame_2.tiff
│   │   └── ...
│   ├── detector_segmentation.h5
│   └── movie.h5
├── movie2/
└── ...

You can make sure that the detector segmentations are correct by opening them in napari with the corresponding movie. Open both .h5 files in napari, put the detector_segmentation.h5 layer on top, then right-click on it and select "Convert to labels". You should see the detections in red on top of the movie.

Annotate exocytose events

Annotate the exocytose events in the movies with the napari-exodeepfinder plugin:

Follow the install instructions, and open napari.
In the menu, choose Plugins > Napari DeepFinder > Annotation to open the annotation tools.
Open a movie (for example exocytose_data/movie1/movie.h5).
Create a new points layer, and name it movie_1 (any name with the _1 suffix, since we want to annotate with the class 1).
In the annotation panel, select the layer you just created in the "Points layer" select box (you can skip this step and use the "Add points" and "Delete selected point" buttons from the layer controls).
You can use the Orthoslice view to easily navigate in the volume, by using the Plugins > Napari DeepFinder > Orthoslice view menu.
Scroll in the movie until you find and exocytose event.
If you opened the Orthoslice view, you can click on an exocytose event to put the red cursor at its location, then click the "Add point" button in the annotation panel to annotate the event.
You can also use the "Add points" and "Delete selected point" buttons from the layer controls.
When you annotated all events, save your annotations to xml by choosing the File > Save selected layer(s)... menu, or by using ctrl+S (command+S on a mac), and choose the Napadi DeepFinder (*.xml) format. Save the file beside the movie, and name it expert_annotation.xml (this should result in the exocytose_data/movie1/expert_annotation.xml with the above example).

Annotate all training and validation movies with this procedure; you should end up with the following folder structure:

exocytose_data
├── movie1/
│   ├── tiff/
│   │   ├── frame_1.tiff
│   │   ├── frame_2.tiff
│   │   └── ...
│   ├── detector_segmentation.h5
│   ├── expert_annotation.xml
│   └── movie.h5
├── movie2/
└── ...

Make sure that the expert_annotation.xml files you just created have the following format:

<objlist>
  <object tomo_idx="0" class_label="1" x="71" y="152" z="470"/>
  <object tomo_idx="0" class_label="1" x="76" y="184" z="445"/>
  <object tomo_idx="0" class_label="1" x="141" y="150" z="400"/>
  <object tomo_idx="0" class_label="1" x="200" y="237" z="420"/>
  <object tomo_idx="0" class_label="1" x="95" y="229" z="438"/>
  ...
</objlist>

The class_label must be 1, and tomo_idx must be 0.

Convert expert annotations to expert segmentations

Use the edf_generate_segmentation command to convert the annotations to segmentations:

edf_generate_segmentation --batch path/to/exocytose_data/

You will end up with the following structure:

exocytose_data/
├── movie1/
│   ├── tiff/
│   │   ├── frame_1.tiff
│   │   ├── frame_2.tiff
│   │   └── ...
│   ├── detector_segmentation.h5
│   ├── expert_annotation.xml
│   ├── expert_segmentation.h5
│   └── movie.h5
├── movie2/
└── ...

Again, you can check on napari that everything went right by opening all images and checking that expert_segmentation.h5 corresponds to expert_annotation.xml and the movie.

Merge detector and expert data

Then, merge detector detections with expert annotations with the edf_merge_detector_expert command:

edf_merge_detector_expert --batch path/to/exocytose_data/

This will create two new files merged_annotation.xml (the merged annotations) and merged_segmentation.h5 (the merged segmentations). The exocytose events are first removed from the detector segmentation (detector_segmentation.h5), then the remaining events (from the dector and the expert) are transfered to the merged segmentation (merged_segmentation.h5), with class 2 for exocytose events and class 1 for others events. The maximum number of other events in the annotation is 9800; meaning that if there are more than 9800 other events, only 9800 events will be picked randomly and the others will be discarded.

The exocytose_data/ folder will then follow this structure:

exocytose_data/
├── movie1/
│   ├── tiff/
│   │   ├── frame_1.tiff
│   │   ├── frame_2.tiff
│   │   └── ...
│   ├── detector_segmentation.h5
│   ├── expert_annotation.xml
│   ├── expert_segmentation.h5
│   ├── merged_annotation.xml
│   ├── merged_segmentation.h5
│   └── movie.h5
├── movie2/
└── ...

Again, make sure everything looks right in napari.

Organize training files

Finally, the training data should be organized in the following way:

dataset/
├── train/
│   ├── movie1.h5
│   ├── movie1_objl.xml
│   ├── movie1_target.h5
│   ├── movie2.h5
│   ├── movie2_objl.xml
│   ├── movie2_target.h5
...
└── valid/
    ├── movie3.h5
    ├── movie3_objl.xml
    ├── movie3_target.h5
...

Use edf_structure_training_dataset to convert the current folder structure into the training structure:

edf_structure_training_dataset --input path/to/exocytose_data/ --output path/to/dataset/

This will organize the files with the above structure, by putting 70% of the movies in the train/ folder, and 30% of them in the valid/ folder.

Make sure the output folder is correct, and that you can open its content in napari.

Train your custom model

Finally, launch the training with edf_train --dataset path/to/dataset/ --output path/to/model/.

Summary

Here is all the steps you should execute to train a new model:

Convert tiff frames to h5 file: edf_convert_tiff_to_h5 --batch path/to/exocytose_data/ --make_subfolder
Use napari-exodeepfinder to annotation exocytose events in the movies
Detect all spots: edf_detect_spots --detector_path path/to/atlas/ --batch path/to/exocytose_data/
Generate detector segmentations: edf_generate_segmentation --batch path/to/exocytose_data/
Merge expert and detector segmentation: edf_merge_detector_expert --batch path/to/exocytose_data/
Structure the files: edf_structure_training_dataset --dataset path/to/exocytose_data/ --training path/to/dataset/
Train the model: edf_train --dataset path/to/dataset/ --output path/to/model/

Virtual environments & package managers

There are two major ways of creating virtual environments in Python: venv and conda ; and two major ways of installing packages: pip and conda.

Virtual environment: venv & conda

The simplest way of creating a virtual environment in python is to use venv. Make sure your Python version greater or equal to 3.10, and simply run python -m venv ExoDeepFinder/ (py -m venv ExoDeepFinder/ on Windows) to create your environment (replace ExoDeepFinder by the name you want for your environment). Then run source ExoDeepFinder/bin/activate to activate it (ExoDeepFinder\Scripts\activate on Windows).

Alternatively, you can use Conda (or a nice minimalist alternative like Micromamba, see bellow) to create a Python 3.10 environment, even if your python version is different.

Once conda is installed, run conda create -n ExoDeepFinder python=3.10 to create the environment with python 3.10, and conda activate ExoDeepFinder to activate it.

Conda alternatives

The simplest way to install and use Conda is via Micromamba, which a minimalist drop-in replacement. Once you installed it, just use micromamba instead of conda for all you conda commands (some unusual commands might not be implemented in micromamba, but it is really sufficient for most use cases).

For example, run micromamba create -n ExoDeepFinder python=3.10 to create the environment with python 3.10, and micromamba activate ExoDeepFinder to activate it.

Package managers: pip & conda

The Numpy documentation explains the main differences between pip and conda:

The two main tools that install Python packages are pip and conda. Their functionality partially overlaps (e.g. both can install numpy), however, they can also work together. We’ll discuss the major differences between pip and conda here - this is important to understand if you want to manage packages effectively.

The first difference is that conda is cross-language and it can install Python, while pip is installed for a particular Python on your system and installs other packages to that same Python install only. This also means conda can install non-Python libraries and tools you may need (e.g. compilers, CUDA, HDF5), while pip can’t.

The second difference is that pip installs from the Python Packaging Index (PyPI), while conda installs from its own channels (typically “defaults” or “conda-forge”). PyPI is the largest collection of packages by far, however, all popular packages are available for conda as well.

The third difference is that conda is an integrated solution for managing packages, dependencies and environments, while with pip you may need another tool (there are many!) for dealing with environments or complex dependencies.

Development

To install ExoDeepFinder for development, clone the repository (git clone git@github.com:deep-finder/tirfm-deepfinder.git), create and activate a virtual environment (see section above), and install it with pip install -e ./tirfm-deepfinder/[GUI].

To generate the release binaries, install PyInstaller with pip install pyinstaller in your virtual environment ; and package ExoDeepFinder with pyinstaller exodeepfinder.spec. You must run this command on the destination platform (run on Windows for a Windows release, on Mac for a Mac release, and Linux for a Linux release).

Project details

These details have not been verified by PyPI

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

This version

0.2.5

Jun 28, 2024

0.2.4

Jun 28, 2024

0.2.3

Jun 20, 2024

0.2.2

Jun 18, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

exodeepfinder-0.2.5.tar.gz (66.7 kB view hashes)

Uploaded Jun 28, 2024 Source

Built Distribution

exodeepfinder-0.2.5-py3-none-any.whl (67.3 kB view hashes)

Uploaded Jun 28, 2024 Python 3

Hashes for exodeepfinder-0.2.5.tar.gz

Hashes for exodeepfinder-0.2.5.tar.gz
Algorithm	Hash digest
SHA256	`a96ee91fb790bf4fd8d124238ca343724077b8788570f0212687273ff30340d2`
MD5	`c11f53f8249170e6517d31b418018049`
BLAKE2b-256	`a85e7602f53747e2a7c984bd3264b67db876af6f0a7010a4d0bc93d44812d3b2`

Hashes for exodeepfinder-0.2.5-py3-none-any.whl

Hashes for exodeepfinder-0.2.5-py3-none-any.whl
Algorithm	Hash digest
SHA256	`8570323b276fc62fb023f82b69c04f482abc28be28dff3e4e6a660df367c10b3`
MD5	`6188b37847ab1e6a02abf2f13def88e0`
BLAKE2b-256	`cbabaa42943ac3a845e9d1a8e46546330e4f224717e41c7f1d1adba95a5f4447`