Skip to content

ai-kmu/Medical_CBAM_ViT

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Medical_CBAM_ViT

Leveraging Inductive Bias in ViT for Medical Image Diagnosis

PyTorch official implementation of (�Jungmin Ha, Euihyun Yoon, Sungsik Kim, Jinkyu Kim, and Jaekoo Lee. "Leveraging Inductive Bias in ViT for Medical Image Diagnosis" BMVC, 2024).

Description

스크린샷 2024-08-23 오후 2 07 05

An overview of our proposed model. Built upon Vision Transformer, we use the following three building blocks: (1) Stem Block, (2) SWA Block for 1st and 2nd stages, and (3) DA Block for 3rd and 4th stages. In image classification, the output feature map undergoes Global Average Pooling(GAP) and MLP processing. For segmentation, fused feature maps with Fused Feature Pyramid Network(FPN) from Stages are utilized. (b, c, d) Detailed Explanation of Local Attenton, Shifted-Window Attention and Deformable Attention

스크린샷 2024-08-23 오후 2 13 05 Comparison of classification and segmentation performance on various datasets. Note that scores in parenthesis represent results with the black-hat transform as preprocess- ing. Bold text indicates the best performance, while underlined text indicates the second-best performance among all models.

Requirements

  • PyTorch (> 1.2.0)
  • torchvision
  • numpy

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 3

  •  
  •  
  •  

Languages