sarcasm-detector

This project implements a multimodal sarcasm detection model that combines image and text features to classify different types of sarcasm. The model is designed to effectively fuse features from multiple modalities, handle imbalanced datasets, and improve generalization using auxiliary tasks and specialized loss functions.

Features of the Architecture

Low-Rank Fusion Module: Compresses and combines features from the text and image embeddings to reduce dimensionality while maintaining essential information.
Text Encoder: The model uses a pre-trained text encoder ViSoBERT
Image Encoder: A CLIP-based image encoder extracts visual features from input images.

Overcoming Imbalanced Datasets

Focal Loss: This loss function gives more focus to minority classes (e.g., text-sarcasm and image-sarcasm) by dynamically scaling the gradient of easy samples.
Auxiliary Losses:
- For text-sarcasm: The model ensures text-based features dominate the prediction by leveraging text-specific auxiliary outputs.
- For image-sarcasm: Fused embeddings are emphasized for prediction, ensuring better understanding of visual sarcasm.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
checkpoints		checkpoints
data		data
models		models
text_finetune		text_finetune
utils		utils
.editorconfig		.editorconfig
.flake8		.flake8
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
dataset.py		dataset.py
ocr_extract.py		ocr_extract.py
requirements.txt		requirements.txt
train.py		train.py
visualize.py		visualize.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

sarcasm-detector

Features of the Architecture

Overcoming Imbalanced Datasets

About

Releases

Packages

Languages

License

LongBaoCoder2/sarcasm-detector

Folders and files

Latest commit

History

Repository files navigation

sarcasm-detector

Features of the Architecture

Overcoming Imbalanced Datasets

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages