This project provides tools to accelerate the collection of data needed to address mathematical equation detection and optical character recognition (OCR) in handwritten notes. Handwritten notes pose several difficulties for automatic extraction of text and equations, mainly because of the inherent noise created when notes are taken on lined or squared sheets, rather than on a blank sheet of paper. Our goal is to have a Convolutional Neural Network (CNN) trained to remove square grids from handwritten notes and to obtain a text-only, noise-free image. The work was developed as part of the "Computer Vision and cognitive systems" examination at the University of Modena and Reggio Emilia.
Paper: Handwritten Notes Denoising
Presentation: Handwritten Notes Denoising
AUTHORs | CONTACTs | GITHUBs |
---|---|---|
Olmo Baldoni | [email protected] | olmobaldoni |
Cristian Bellucci | [email protected] | cleb98 |
Danilo Caputo | [email protected] | Ilodan |
The data generation folder contains all the scripts to generate the dataset and compute the mean and variance.
The Unet folder contains the codes for training and inferring the model.
The crop-warp folder contains the scripts for image warping.
The retrieval folder contains the code for image retrieval.
MIT