Skip to content

This project aims to speed up data collection for mathematical equation detection and OCR by employing a CNN to eliminate square grids from handwritten notes, thus generating clean images for efficient text extraction.

Notifications You must be signed in to change notification settings

cleb98/Handwritten-notes-Denoising

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

19 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Handwritten-notes-Denoising

This project provides tools to accelerate the collection of data needed to address mathematical equation detection and optical character recognition (OCR) in handwritten notes. Handwritten notes pose several difficulties for automatic extraction of text and equations, mainly because of the inherent noise created when notes are taken on lined or squared sheets, rather than on a blank sheet of paper. Our goal is to have a Convolutional Neural Network (CNN) trained to remove square grids from handwritten notes and to obtain a text-only, noise-free image. The work was developed as part of the "Computer Vision and cognitive systems" examination at the University of Modena and Reggio Emilia.

Paper: Handwritten Notes Denoising

Presentation: Handwritten Notes Denoising

Aknowledgements

AUTHORs CONTACTs GITHUBs
Olmo Baldoni [email protected] olmobaldoni
Cristian Bellucci [email protected] cleb98
Danilo Caputo [email protected] Ilodan

Usage

The data generation folder contains all the scripts to generate the dataset and compute the mean and variance.

The Unet folder contains the codes for training and inferring the model.

The crop-warp folder contains the scripts for image warping.

The retrieval folder contains the code for image retrieval.

License

MIT

About

This project aims to speed up data collection for mathematical equation detection and OCR by employing a CNN to eliminate square grids from handwritten notes, thus generating clean images for efficient text extraction.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 98.3%
  • Python 1.6%
  • Shell 0.1%