GitHub - amanmohanty/idl-nncompress

This repo contains work for the paper 'Efficient CNN-LSTM based Image Captioning using Neural Network Compression'

Team Members

Harshit Rampal
Aman Mohanty

Abstract

Modern Neural Networks are eminent in achieving state of the art performance on tasks under Computer Vision, Natural Language Processing and related verticals. However, they are notorious for their voracious memory and compute appetite which further obstructs their deployment on resource limited edge devices. In order to achieve edge deployment, researchers have developed pruning and quantization algorithms to compress such networks without compromising their efficacy. Such compression algorithms are broadly experimented on standalone CNN and RNN architectures while in this work, we present an unconventional end to end compression pipeline of a CNN-LSTM based Image Captioning model. The model is trained using VGG16 or ResNet50 as an encoder and an LSTM decoder on the flickr8k dataset. We then examine the effects of different compression architectures on the model and design a compression architecture that achieves a 71.3% reduction in model size, 73.1% reduction in inference time and a 7.7% increase in BLEU score as compared to its uncompressed counterpart.

Project Dependencies

tensorflow, tensorflow-model-optimization, keras, nltk, numpy, pickle, PIL, requests, BytesIO

Dataset

For this project, we train our model on the flickr8k dataset

Results

Performance comparison between baseline model and the quantized encoder quantized decoder model. The baseline image captioning model has VGG16 encoder and LSTM decoder and is trained on the flickr8k dataset.

Models	BLEU Score	Model Size(MB)	Inference time for 2000 samples(mins)
Baseline-Baseline	0.527	578.4	5.68
Quantized-Quantized	0.568	155.39	1.63

Generated Captions

Baseline: two dogs are playing together in the grass

Best: dog is running through the grass

Baseline: two men are playing soccer on the grass

Best: two men are playing soccer

Name		Name	Last commit message	Last commit date
Latest commit History 57 Commits
code		code
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

This repo contains work for the paper 'Efficient CNN-LSTM based Image Captioning using Neural Network Compression'

Team Members

Abstract

Project Dependencies

Dataset

Results

Generated Captions

About

Releases

Packages

Contributors 2

Languages

amanmohanty/idl-nncompress

Folders and files

Latest commit

History

Repository files navigation

This repo contains work for the paper 'Efficient CNN-LSTM based Image Captioning using Neural Network Compression'

Team Members

Abstract

Project Dependencies

Dataset

Results

Generated Captions

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages