get_denoised adding spaces in output so half sentances is not coming in output #17

kbrajwani · 2019-10-24T07:45:47Z

[AM] Torserane TVcom in Bankiner wo Finnnce
[D ] T o r e v e n g e T o m i n B a

see am is full sentance but [D ] is cutting down sentance. tell me where can i change the code to get correct output.

jonomon · 2019-10-24T17:47:41Z

Hi kbrajwani,
[AM] is the correct sentence?

kbrajwani · 2019-10-24T17:51:55Z

AM is not the correct sentence but my concern is not about correct prediction but i want to know why [D] is adding spaces in character.
Sometimes [D] is giving correct prediction

jonomon · 2019-10-24T17:53:49Z

Which code are you running and on what dataset?

kbrajwani · 2019-10-24T17:54:41Z

Latest code of your repository with pretrained models.

kbrajwani · 2019-10-24T17:55:09Z

Tesing on iam dataset images.

jonomon · 2019-10-24T17:57:25Z

are all predictions like this or this particular one?

kbrajwani · 2019-10-24T17:59:49Z

No, only first and last line of prediction is come like that.

jonomon · 2019-10-24T18:02:07Z

Did you make any changes?
I am do not see the lines you mentioned in https://github.com/awslabs/handwritten-text-recognition-for-apache-mxnet/blob/master/0_handwriting_ocr.ipynb.

kbrajwani · 2019-10-24T18:07:49Z

Yes i made lots of change in your code,
See following code generating output which i am talking about
decoded_line_denoiser = get_denoised(line_character_probs, ctc_bs=False)
print("[D ]",decoded_line_denoiser)

jonomon · 2019-10-24T18:21:58Z

This file presents the methods the denoiser was trained. The data was modelled after the noise associated with the previous steps of our model.

Judging from your example output "Torserane TVcom in Bankiner wo Finnnce", the noise seems quite different from our model and the words are not really recognisable. Most likely the pretrained denoiser wont work well.

I found that the better the handwriting recognition, the better the denoiser. It might be beneficial for you to first improve the handwriting (i.e., focus on improving the output [AM]) then work on the denoiser.

Please note that the output for our handwriting recognition is "Can't go lighting bonfites on this bus," and the denoiser only changed bonfites to bonfires.

kbrajwani · 2019-11-04T09:20:44Z

see in this image print(generator.generate_sequences(inputs, states, sentence)) this line is generating output like this
This sentnce has an eror
Choice
T h i s s e n t e n c e

output last line is putting one space after every character its your notebook file

jonomon · 2019-11-12T21:49:37Z

Did you edit any of the functions (denoiser.encode, generator.generate_sequences etc)?

devbaseh · 2020-01-07T13:16:59Z

Same Issue With me, the final prediction has spaces and gives only half of the sentence like this,

"This sentnce has an eror
Choice
T h i s s e n t e n c e "

I have just changed
"ctx_nlp = mx.gpu(3)" to "ctx_nlp = mx.gpu(0) if mx.context.num_gpus() > 0 else mx.cpu()"
I couldn't figure out why this happens...

raghav-menon · 2020-03-12T15:29:30Z

Hi, I am having the same problem too. Does this require more than one GPU to do the job. If I am not wrong ctx_nlp = mx.gpu(3) command indicates that the variable is being assigned to the 4th GPU in case there are 4 GPUs. If the line is left as such it gives out an error telling that the number should be 1 less than the number of GPU devices. Since I have only one, I had assigned it as ctx_nlp = mx.gpu(0). But that in turn cuts out half the sentence adding a space between the alphabets as already mentioned by a few here. Not sure whether it is exactly a GPU problem. Was wondering whether this can be run on a CPU rather !! Any help is appreciated!!

jonomon · 2020-03-12T21:08:04Z

@raghav-menon
It shouldn't matter whether it's on GPU or CPU.
Are you using the IAM dataset?

@devbaseh
Which notebook are you running?

raghav-menon · 2020-03-13T00:59:58Z

@raghav-menon
It shouldn't matter whether it's on GPU or CPU.
Are you using the IAM dataset?

Thank you for the quick reply. I am using the IAM dataset. I have tried using both CPU and GPU and the problem persists. The demoing output only gives partial sentences with a space in between. The system I am using is AWS with 64 GB of RAM and a Tesla K80 attached.

jonomon · 2020-03-13T03:07:56Z

@ThomasDelteil ?

raghav-menon · 2020-03-13T03:26:11Z

Would be grateful if you could advise me on how to solve it!! Thanks

ThomasDelteil · 2020-03-13T03:31:50Z

I'll try to have a look this weekend, have you tried retraining from the denoising notebook?

raghav-menon · 2020-03-13T04:01:35Z

I'll try to have a look this weekend, have you tried retraining from the denoising notebook?

Thanks Thomas. I have only used the trained model provided and ran the code. Have not tried retraining!!

mahin003 · 2020-09-27T19:09:57Z

If anybody executed it on Google colab ,please sharethe edited iam_dataset.py it with me , [email protected]

yangyingxiang · 2020-10-03T21:36:07Z

Same issue here, didn't modify any code besides changing gpu from 3 to 0:

"This sentnce has an eror
Choice
T h i s s e n t e n c e "

mahin003 · 2020-10-04T10:10:38Z

i'm facing this error ... Can u help me on this ?? i didnt find the problem

…

On Sun, Oct 4, 2020 at 3:36 AM yangyingxiang ***@***.***> wrote: Same issue here, didn't modify any code besides changing gpu from 3 to 0: "This sentnce has an eror Choice T h i s s e n t e n c e " — You are receiving this because you commented. Reply to this email directly, view it on GitHub <#17 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AMR2YQOOWDI4QEGW44MPWYTSI6KNHANCNFSM4JEQEEXQ> .

jalvathi · 2021-07-23T13:16:33Z

@jonomon @ThomasDelteil Is there any update on the space issue we are getting in denoiser?? An update from you guys will make my day.

Thanks for this wonderful repository. 👍

jalvathi · 2021-07-24T02:14:07Z

@mahin003 @yangyingxiang @raghav-menon @devbaseh @kbrajwani

I had initiated a merge here

I hope, this solves your issues as well. I just got it solved in my code. :)

cc: @jonomon @ThomasDelteil

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

get_denoised adding spaces in output so half sentances is not coming in output #17

get_denoised adding spaces in output so half sentances is not coming in output #17

kbrajwani commented Oct 24, 2019

jonomon commented Oct 24, 2019

kbrajwani commented Oct 24, 2019

jonomon commented Oct 24, 2019

kbrajwani commented Oct 24, 2019

kbrajwani commented Oct 24, 2019

jonomon commented Oct 24, 2019

kbrajwani commented Oct 24, 2019

jonomon commented Oct 24, 2019

kbrajwani commented Oct 24, 2019

jonomon commented Oct 24, 2019

kbrajwani commented Nov 4, 2019

jonomon commented Nov 12, 2019

devbaseh commented Jan 7, 2020

raghav-menon commented Mar 12, 2020 •

edited

Loading

jonomon commented Mar 12, 2020

raghav-menon commented Mar 13, 2020

jonomon commented Mar 13, 2020

raghav-menon commented Mar 13, 2020 •

edited

Loading

ThomasDelteil commented Mar 13, 2020

raghav-menon commented Mar 13, 2020

mahin003 commented Sep 27, 2020

yangyingxiang commented Oct 3, 2020

mahin003 commented Oct 4, 2020 via email

jalvathi commented Jul 23, 2021

jalvathi commented Jul 24, 2021 •

edited

Loading

get_denoised adding spaces in output so half sentances is not coming in output #17

get_denoised adding spaces in output so half sentances is not coming in output #17

Comments

kbrajwani commented Oct 24, 2019

jonomon commented Oct 24, 2019

kbrajwani commented Oct 24, 2019

jonomon commented Oct 24, 2019

kbrajwani commented Oct 24, 2019

kbrajwani commented Oct 24, 2019

jonomon commented Oct 24, 2019

kbrajwani commented Oct 24, 2019

jonomon commented Oct 24, 2019

kbrajwani commented Oct 24, 2019

jonomon commented Oct 24, 2019

kbrajwani commented Nov 4, 2019

jonomon commented Nov 12, 2019

devbaseh commented Jan 7, 2020

raghav-menon commented Mar 12, 2020 • edited Loading

jonomon commented Mar 12, 2020

raghav-menon commented Mar 13, 2020

jonomon commented Mar 13, 2020

raghav-menon commented Mar 13, 2020 • edited Loading

ThomasDelteil commented Mar 13, 2020

raghav-menon commented Mar 13, 2020

mahin003 commented Sep 27, 2020

yangyingxiang commented Oct 3, 2020

mahin003 commented Oct 4, 2020 via email

jalvathi commented Jul 23, 2021

jalvathi commented Jul 24, 2021 • edited Loading

raghav-menon commented Mar 12, 2020 •

edited

Loading

raghav-menon commented Mar 13, 2020 •

edited

Loading

jalvathi commented Jul 24, 2021 •

edited

Loading