-
Notifications
You must be signed in to change notification settings - Fork 5
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Transliteration from Arabic not working for continuous text #7
Comments
Should I implement preprocessing and postprocessing functions in this case? It is like tokenizing continuous text in preprocessing and concat the transliteration results in postprocessing. |
I think it’d be best to find out WHY it’s happening. There are multiple modules:
I could try to find out how to deal with this, or you might :) We ought |
I would add that when the language is set as |
Yeah, there are a few different Arabic-script transliterators and the whole notion of Arabic needs some special handling in our Py code. |
Transliteration from Arabic is not working for continuous text. It works for single space-separated characters. The Arabic Wiktionary module is a bit complex, need to investigate and add some special processing.
The text was updated successfully, but these errors were encountered: