(43909856) ViT (Visual Transformer) for image classification of ADNI dataset #172

retinotopyproject · 2023-10-27T05:13:37Z

Hi there,

This fork of the 'topic-recognition' branch contains code for a ViT (a Visual Transformer) model. This is based on the ViT-S/16
model, from the IEEE conference paper "Scaling Vision Transformers" (Zhai et al., 2022).

This model was used to perform binary classification of images from the Alzheimer's Disease Neuroimaging Initiative (ADNI) MRI dataset. The ViT attempts to classify 2D MRI slices as belonging to either Cognitive Normal (NC) or Alzheimer's Detected (AD) patients.

For more detailed information about the model, its performance, and any dependencies/requirements for training or testing it, please see the README file. All files related to the ViT model, including the README, are in the directory location 'recognition/TRANSFORMER_43909856'.

The files added include:

dataset.py: Loads the ADNI dataset images and their corresponding image labels, and performs the required train/validation/test set splits. Also applies any required pre-processing or transforms to the data. A basic plot of the input images (after pre-processing transforms) can be generated from a method in this file.
modules.py: Contains the modules required to create the ViT model network and its sub-components.
train.py: Trains the model on the ADNI dataset over the specified number of epochs. Evaluates the model on the validation set during the training process, at the end of every epoch. Also contains methods for plotting training and validation loss (and validation accuracy) throughout training.
predict.py: Used to run inference (make predictions of classes) on the model. Also contains a method for generating a confusion matrix, based on the test set's predicted classes and empirical/observed classes.
Additional plots can be found in the 'plots' subdirectory.

LinfengLiu98 · 2023-11-09T02:55:02Z

This is an initial inspection, no action is required at this point

Difficulty: Hard

Readme:
Excellent

Commit messages: Excellent, detailed.

Code:

Uses Vision transformer
Well commented
Good model design

Functionality/Performance:

Test accuracy is 60.37%
Did patient-level split
Preprocssing data correctly
Performed data augmentation
Well done

shakes76 · 2023-11-20T23:28:18Z

Marking

Good Practice (Design/Commenting, TF/Torch Usage)

Adequate design and implementation
Good spacing and comments
Header blocks missing -1

Recognition Problem

Solves problem
Driver Script present
File structure present
Shows Usage & Demo & Visualisation & Data usage
Module present
Commenting
No Data leakage
Difficulty: Hard

Commit Log

Meaningful commit messages
Progressive commits used

Documentation

ReadMe acceptable/good
Model/technical explanation
Good Description and Comments
Markdown used and PDF submitted

Pull Request

Successful Pull Request (Working Algorithm Delivered on Time in Correct Branch)
No Feedback required
Request Description good

shakes76 · 2023-11-20T23:28:42Z

Please remove gitignore file for merge, does not affect grade only merge.

retinotopyproject · 2023-11-21T23:37:46Z

Hi @shakes76, I have resolved the merge conflict related to the .gitignore file. Thanks

wangzhaomxy · 2023-11-22T04:40:44Z

Feedback attempt and no feedback marks lost.

retinotopyproject added 30 commits October 16, 2023 10:28

Added gitignore file

e419e45

Added python file and README stubs

2d23cd1

Fixed incorrect file import

cfb7ce8

added file and readme stubs for 2D transformer

9f868fe

implemented multi-head-attention class/module

a548211

implemented Feed-Forward module/component

b38da5f

added class for combining FF and Attention modules

1822bde

added helper function for positional encoding

003959f

added ViT model implementation

16ad862

added dataset/model saving dirs to gitignore

12ab005

added dir to gitignore

6a22cc6

added ADNI dataset dir to gitignore

33c17fb

added basic class for ADNI dataset init

a0e06c9

changed ADNI data load to use ImageFolder (torch)

cb4b793

added basic training code for benchmark dataset

0c14c94

Added testing for benchmark data (Imagenette)

de1986b

fixed bugs with matmul and einops formatting

8032def

added patient-based stratified train-val split

c232ef8

removed some commented out code

9c57aa5

changed train/test scripts to use ADNI dataset

b99bdcc

added saving/output of basic model loss/metrics

1020b17

modified docstrings in ViT modules

dfefe2f

added opening PIL images, img transforms

74feab3

fixed bugs with multiprocessing (Windows devices)

4eb1d47

fixed issues with loading model/data for training

af3a607

fixed more bugs with multiprocessing on Windows

9982e84

added method for plotting ground truth images

4d3bde5

saves predictions/empirical class values (testing)

34365ec

saving of predicted & empirical classes (training)

f5fb21a

added dependencies info to README

688d11a

retinotopyproject added 19 commits October 26, 2023 21:04

added train test split info to README

d45511c

added more preprocessing info to README

ffd418d

fixed bugs with dataset load; modifying comments

8016a15

fixed bugs with test set loading for predictions

a769a7a

fixed bugs with train/val split & validation set

ed37c08

added to examples & improvements sections (README)

575d830

added basic READMEs to subdirs

9d49fdd

excluded READMEs from gitignore

e5e08d0

added validation accuracy plot

1e6e2cc

modified comments/docstrings

5877dd3

added model info, etc to README

c1077b3

fixed issue with array elements not sent to cpu

d8bc747

fixed issue with incorrect validation accuracy

acf9ed8

added reproducibility info, some results to README

fe7b59c

added confusion matrix plots for test set

5eacdac

added plot for precision-recall curve (test set)

a6bb88a

removed precision-recall plotting method

aa3ad35

added more plots and results to readme

9fb287c

minor tweaks/changes to train and test scripts

336a505

nathasha-naranpanawa added the Transformer label Oct 29, 2023

shakes76 added the Extension Extension approved label Nov 20, 2023

shakes76 added the question Further information is requested label Nov 20, 2023

Merge branch 'topic-recognition' into topic-recognition

a2aa69b

wangzhaomxy added Question_attempt and removed question Further information is requested labels Nov 22, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

(43909856) ViT (Visual Transformer) for image classification of ADNI dataset #172

(43909856) ViT (Visual Transformer) for image classification of ADNI dataset #172

retinotopyproject commented Oct 27, 2023

LinfengLiu98 commented Nov 9, 2023

shakes76 commented Nov 20, 2023

shakes76 commented Nov 20, 2023

retinotopyproject commented Nov 21, 2023

wangzhaomxy commented Nov 22, 2023

(43909856) ViT (Visual Transformer) for image classification of ADNI dataset #172

Are you sure you want to change the base?

(43909856) ViT (Visual Transformer) for image classification of ADNI dataset #172

Conversation

retinotopyproject commented Oct 27, 2023

LinfengLiu98 commented Nov 9, 2023

shakes76 commented Nov 20, 2023

Marking

Good Practice (Design/Commenting, TF/Torch Usage)

Recognition Problem

Commit Log

Documentation

Pull Request

shakes76 commented Nov 20, 2023

retinotopyproject commented Nov 21, 2023

wangzhaomxy commented Nov 22, 2023