DATA-SCIENCE-BOWL-2018

Find the nuclei in divergent images to advance medical discovery

Spot Nuclei. Speed Cures.

Imagine speeding up research for almost every disease, from lung cancer and heart disease to rare disorders. The 2018 Data Science Bowl offers our most ambitious mission yet: create an algorithm to automate nucleus detection.

We’ve all seen people suffer from diseases like cancer, heart disease, chronic obstructive pulmonary disease, Alzheimer’s, and diabetes. Many have seen their loved ones pass away. Think how many lives would be transformed if cures came faster.

By automating nucleus detection, you could help unlock cures faster—from rare disorders to the common cold.

Deep Learning Tutorial for Kaggle Find the nuclei in divergent images to advance medical discovery competition, using Keras

This tutorial shows how to use Keras library to build deep neural network for Find the nuclei in divergent images to advance medical discovery More info on this Kaggle competition can be found on https://www.kaggle.com/c/data-science-bowl-2018.

This deep neural network achieves ~0.302 score on the leaderboard based on test images, and can be a good staring point for further, more serious approaches.

The architecture was inspired by U-Net: Convolutional Networks for Biomedical Image Segmentation.

Overview

Data

  cd data
  mkdir stage1_train stage1_test
  unzip stage1_train.zip -d stage1_train/
  unzip stage1_test.zip -d stage1_test/

Data for the competition is available in the data folder.data_util.py just loads the images and saves them into NumPy binary format files .npy for faster loading later.

Pre-processing

The images are not pre-processed in any way,except resizing 256 x 256

Run the model

  python main.py

It will train,predict and generate submission file

Run the Data-science-bowl-2018 notebook on Google colab

Download the Data-science-bowl-2018.ipynb notebook from this repo
Goto Colab
Goto File-->Upload Notebook . Upload the notebook
Goto menu Runtime-->Change runtime and select HardWare accelerator GPU (Free Nvidia K80 GPU from google,it can run continues for 12hrs)
Execute all cells and download the submission files from colab (Codes included at end of Notebook for downloading files to local system from colab)

To learn more about colab Click Here

Training

The model is trained for 150 epochs,where each epoch took 8sec on NVIDIA K80 GPU

loss function used keras binary_cross_entropy

The weights are updated by Adam optimizer, with a 1e-5 learning rate.

Dependencies

skimage
Tensorflow
Keras >= 2.1.2
Pandas

Python version 3

Model

The provided model is basically a convolutional auto-encoder, but with a twist - it has skip connections from encoder layers to decoder layers that are on the same "level". See picture below (note that image size and numbers of convolutional filters in this tutorial differs from the original U-Net architecture).

This deep neural network is implemented with Keras functional API, which makes it extremely easy to experiment with different interesting architectures.

Output from the network is a 128 x 128 which represents mask that should be learned. Sigmoid activation function makes sure that mask pixels are in [0, 1] range.

Name		Name	Last commit message	Last commit date
Latest commit History 32 Commits
data		data
.gitattributes		.gitattributes
Data_Science_Bowl_2018.ipynb		Data_Science_Bowl_2018.ipynb
LICENSE		LICENSE
README.md		README.md
data_util.py		data_util.py
main.py		main.py
model.png		model.png
model.py		model.py
u-net-architecture.png		u-net-architecture.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DATA-SCIENCE-BOWL-2018

Find the nuclei in divergent images to advance medical discovery

Spot Nuclei. Speed Cures.

Deep Learning Tutorial for Kaggle Find the nuclei in divergent images to advance medical discovery competition, using Keras

Overview

Data

Pre-processing

Run the model

Run the Data-science-bowl-2018 notebook on Google colab

Training

Dependencies

Model

Model Constructed using KERAS API

About

Releases

Packages

Languages

License

fenglf/DATA-SCIENCE-BOWL-2018

Folders and files

Latest commit

History

Repository files navigation

DATA-SCIENCE-BOWL-2018

Find the nuclei in divergent images to advance medical discovery

Spot Nuclei. Speed Cures.

Deep Learning Tutorial for Kaggle Find the nuclei in divergent images to advance medical discovery competition, using Keras

Overview

Data

Pre-processing

Run the model

Run the Data-science-bowl-2018 notebook on Google colab

Training

Dependencies

Model

Model Constructed using KERAS API

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages