Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

quality in very low contrast regime #38

Open
bertsky opened this issue May 13, 2022 · 0 comments
Open

quality in very low contrast regime #38

bertsky opened this issue May 13, 2022 · 0 comments
Labels
question Further information is requested

Comments

@bertsky
Copy link
Contributor

bertsky commented May 13, 2022

I have material with typewritten forms that is very challenging (to any binarization method), because the typewriter sometimes fades out, while the printing ink near it blasts in a dark black. The scan/photography also seems to cause a non-normalized histogram:

  • original
    OCR-D-IMG_Ansiedlung_Korotschin_UZS_Sign_22a_0000
  • default-2021-03-09
    OCR-D-BIN_Ansiedlung_Korotschin_UZS_Sign_22a_0000 IMG-BIN
  • (after contrast normalization)
    OCR-D-BIN_Ansiedlung_Korotschin_UZS_Sign_22a_0000 IMG-BIN
  • (after +20% brightness)
    OCR-D-BIN_Ansiedlung_Korotschin_UZS_Sign_22a_0000 IMG-BIN
  • (after -30% brightness)
    OCR-D-BIN_Ansiedlung_Korotschin_UZS_Sign_22a_0000 IMG-BIN
  • Olena with Wolf's algorithm
    OCR-D-BIN-WOLF_Ansiedlung_Korotschin_UZS_Sign_22a_0000-BIN_wolf

So it seems that the autoencoder gets confused by the normalized image, but benefits from making the image even darker. May that be a general tendency (as in: if you loose fg, make it darker, and conversely if you get bg, make it brighter)? Can we derive any metrics that might hint at quality problems from the intermediate activation between encoder and decoder? Any recommendations/considerations?

@cneud cneud added the question Further information is requested label Sep 1, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested
Projects
None yet
Development

No branches or pull requests

2 participants