Transliteration from Arabic not working for continuous text #7

twardoch · 2021-08-11T02:12:40Z

Transliteration from Arabic is not working for continuous text. It works for single space-separated characters. The Arabic Wiktionary module is a bit complex, need to investigate and add some special processing.

kbatsuren · 2021-08-12T15:01:18Z

Should I implement preprocessing and postprocessing functions in this case? It is like tokenizing continuous text in preprocessing and concat the transliteration results in postprocessing.

twardoch · 2021-08-20T20:15:54Z

I think it’d be best to find out WHY it’s happening. There are multiple modules:

ar-translit has an unusual tr function: function export.tr(text, lang, sc, omit_i3raab, gray_i3raab, force_translit).

I could try to find out how to deal with this, or you might :)

We ought

skalyan91 · 2021-08-29T09:38:01Z

I would add that when the language is set as fas (Persian), even single letters are not transliterated.

twardoch · 2021-08-29T22:01:51Z

Yeah, there are a few different Arabic-script transliterators and the whole notion of Arabic needs some special handling in our Py code.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Transliteration from Arabic not working for continuous text #7

Transliteration from Arabic not working for continuous text #7

twardoch commented Aug 11, 2021

kbatsuren commented Aug 12, 2021

twardoch commented Aug 20, 2021

skalyan91 commented Aug 29, 2021

twardoch commented Aug 29, 2021

Transliteration from Arabic not working for continuous text #7

Transliteration from Arabic not working for continuous text #7

Comments

twardoch commented Aug 11, 2021

kbatsuren commented Aug 12, 2021

twardoch commented Aug 20, 2021

skalyan91 commented Aug 29, 2021

twardoch commented Aug 29, 2021