Skip to content

Latest commit

 

History

History
214 lines (182 loc) · 9.59 KB

README.md

File metadata and controls

214 lines (182 loc) · 9.59 KB

3D U-Net Convolution Neural Network with Keras

Overview of this fork and its branches

This fork of the original ellisdg repository is used to test and demonstrate TensorFlow Large Model Support (TFLMS). The training, prediction, and evaluation programs have been modified to allow command line parameter control of file names, and training parameters.

The original ellisdg code has been modified to use TensorFlow Keras (tf.keras), and has been updated to use later versions of TensorFlow, including TensorFlow 2 with its eager execution.

TensorFlow Large Model Support has several implementations, which have been provided with various versions of IBM Watson Machine Learning Community Edition or IBM PowerAI. This model has been updated to work with them over time and separate branches have been created for the different implementations.

Notable branches

  • current_lms - This branch contains changes to support the current implementation of TFLMS. It currently supports the TFLMS version available in the IBM Watson Machine Learning Community Edition 1.7.0 with source and documentation here: https://github.com/IBM/tensorflow-large-model-support
  • tflmsv2_tf2.0.0 - This branch contains changes to support running the model with TensorFlow 2.0.0 using the compat.v1 mode, while using the TFLMSv2 implementation.
  • tflmsv2 - This branch contains changes to support the TFLMS implementation named "TFLMSv2" which is provided with IBM PowerAI 1.6.0 and IBM Watson Machine Learning Community Edition 1.6.x.
  • tflmsv1 - This branch contains changes to support the TFLMS implementation named "TFLMSv1" which is provided in the IBM PowerAI 1.5.x releases and open sourced as a separate module in the tflmsv1 branch of https://github.com/IBM/tensorflow-large-model-support.
  • master - This branch is the original master as it was at the time the ellisdg repository was forked.

Tumor Segmentation Example

Background

Originally designed after this paper on volumetric segmentation with a 3D U-Net. The code was written to be trained using the BRATS data set for brain tumors, but it can be easily modified to be used in other 3D applications.

Tutorial using BRATS Data

Training

  1. Download the BRATS 2017 GBM and LGG data. Place the unzipped folders in the brats/data/original folder.
  2. Build dependencies

The ANTs tooling that is used for preprocessing must be built from source, and the SimpleITK conda package must also be built before installation.

The following steps will build the SimpleITK conda package and place it in your local conda repository for future install:

git clone https://github.com/SimpleITK/SimpleITKCondaRecipe.git
cd SimpleITKCondaRecipe
conda build --python 3.6 recipe

The following steps will create a conda environment for building ANTs, install the cmake and gcc tools as conda packages, and then build the ANTs binaries in the ~/ants_build/bin/ants/bin/ directory:

conda create -n my_build_env python=3.6
conda activate my_build_env
conda install -y cmake gxx_linux-ppc64le=7
cd ~
mkdir ants_build
cd ants_build
git clone https://github.com/ANTsX/ANTs.git
cd ANTs
git checkout v2.3.1
mkdir -p ~/ants_build/bin/ants
cd ~/ants_build/bin/ants
cmake ~/ants_build/ANTs
make -j 120 ANTS
  1. Install dependencies:
conda install pytables lxml scikit-image scikit-learn scipy pandas
pip install nibabel nilearn nipype
conda install --use-local simpleitk

(nipype is required for preprocessing only)

  1. Add the location of the ANTs binaries to the PATH environmental variable. If you build the dependencies as described above this will be ~/ants_build/bin/ants/bin/

  2. Add the repository directory to the PYTHONPATH system variable:

cd 3DUNetCNN
$ export PYTHONPATH=${PWD}:$PYTHONPATH
  1. Convert the data to nifti format and perform image wise normalization and correction:

cd into the brats subdirectory:

$ cd brats

Import the conversion function and run the preprocessing:

$ python
>>> from preprocess import convert_brats_data
>>> convert_brats_data("data/original", "data/preprocessed")

Note: By default the preprocessing will process 120 subjects at a time. You can modify the thread count variable NUM_FOLDER_PROCESS_THREADS in preprocess.py to change the concurrency.

  1. Run the training:

To run training using an improved UNet model:

$ python train_isensee2017.py

The train_isensee2017.py program has command line parameters that allow changing many things including image resolution, output file names, profiling, and Large Model Support enablement. See python train_isensee2017.py --help for more information.

Write prediction images from the validation data

In the training above, part of the data was held out for validation purposes. To write the predicted label maps to file:

$ python predict.py

The predictions will be written in the prediction folder along with the input data and ground truth labels for comparison.

If you have trained the isensee2017 model with the default parameters, the model name will be generated with a random name. The predict.py file supports command line parameters to provide specify input and output locations. See python predict.py --help for more information.

Write loss graph and validation score box plots

To create the loss graph and validation score box plot png files run:

$ python evaluate.py

The evaluate.py program supports one position parameter that allows you to specify the directory containing the predictions. If unspecified it will default to prediction which is the default output directory of predict.py.

Results from patch-wise training using original UNet

Patchwise training loss graph Patchwise boxplot scores

In the box plot above, the 'whole tumor' area is any labeled area. The 'tumor core' area corresponds to the combination of labels 1 and 4. The 'enhancing tumor' area corresponds to the 4 label. This is how the BRATS competition is scored. The both the loss graph and the box plot were created by running the evaluate.py script in the 'brats' folder after training has been completed.

Results from Isensee et al. 2017 model

I (ellisdg) also trained a model with the architecture as described in the 2017 BRATS proceedings on page 100. This architecture employs a number of changes to the basic UNet including an equally weighted dice coefficient, residual weights, and deep supervision. This network was trained using the whole images rather than patches. As the results below show, this network performed much better than the original UNet.

Isensee training loss graph Isensee boxplot scores

TensorFlow Large Model Support

You can enable TensorFlow Large Model Support by passing command line parameters. Additional parameters allow specifying different input data files, image sizes, profiling, log LMS statitics, and more. See the training usage for more information:

python train_isensee2017.py --help

An example command to run the 320^3 size with TFLMS (possible on a 32GB GPU) is:

export TF_GPU_HOST_MEM_LIMIT_IN_MB=300000

numactl --cpunodebind=0 --membind=0 python train_isensee2017.py --lms --data_file_path=320_data.h5 --image_size 320

Using this code on other 3D datasets

If you want to train a 3D UNet on a different set of data, you can copy either the train.py or the train_isensee2017.py scripts and modify them to read in your data rather than the preprocessed BRATS data that they are currently setup to train on.

Pre-trained Models

The following Keras models were trained on the BRATS 2017 data:

Citations

GBM Data Citation:

  • Spyridon Bakas, Hamed Akbari, Aristeidis Sotiras, Michel Bilello, Martin Rozycki, Justin Kirby, John Freymann, Keyvan Farahani, and Christos Davatzikos. (2017) Segmentation Labels and Radiomic Features for the Pre-operative Scans of the TCGA-GBM collection. The Cancer Imaging Archive. https://doi.org/10.7937/K9/TCIA.2017.KLXWJJ1Q

LGG Data Citation:

  • Spyridon Bakas, Hamed Akbari, Aristeidis Sotiras, Michel Bilello, Martin Rozycki, Justin Kirby, John Freymann, Keyvan Farahani, and Christos Davatzikos. (2017) Segmentation Labels and Radiomic Features for the Pre-operative Scans of the TCGA-LGG collection. The Cancer Imaging Archive. https://doi.org/10.7937/K9/TCIA.2017.GJQ7R0EF