Skip to content

Data cleaning & preprocessing

Viet edited this page May 17, 2019 · 12 revisions

Basic scheme: https://drive.google.com/file/d/1ooGtgptBMmHt6cuFXs1TAkPeOpHFMQOL/view?usp=sharing

Augmentation: artificially increase the train data

2 possible approaches:

  • offline augmentation: Extend the present data set.
  • online augmentation (augmentation on the fly): Extend the data set on each mini-batch.

Different styles:

light augmentation: only flipping etc. heavier augmentation: see light and heavier augmentation (link)

Data augmentation (medium) introduction

light and heavier augmentation

Using GANs to generate new data for x-ray

Using GANs to improve CNN classification

Fast classification

Data augmentation techniques

Data augmentation techniques II

Preprocessing x-ray data