-
Notifications
You must be signed in to change notification settings - Fork 9
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Trying to add Ukrainian and failing miserably #73
Comments
You did everything right (except setting max memory to 16000 GB 😅). Words are likely not getting matched because wiktionary has diacritics on the headwords, and they aren't getting handled: As for the skipped term tags/parts of speech, that's normal. The parts of speech don't matter unless/until there are deinflection rules written for that language. Adding tags to a P.S. I remember reading this issue of yours back when the official policy in the yomitan readme was 'no other languages'. I might not have even tried to merge my fork with yomitan and do all this if it wasn't for that hint that there would be support for it, so thanks 🙏 |
Thanks!! Just copying the Russian |
Feel free to PR, then Ukrainian dicts will be included automatically from the next release! Also check out the language docs to properly add Ukrainian to Yomitan. Texts with no diacritics or full diacritics should work with these dicts, but you'll want to add the same diacritics processing to yomitan (like yomidevs/yomitan#1057) so texts with partial diacritics and other dicts will work. |
Yeah, normally I would be really into doing something like that but I'm just doing this for a friend who is learning Ukrainian. I don't have any knowledge of Ukrainian (the most I can do is read the Russian alphabet and read a few basic words) so just getting a dictionary set up is sufficient for my needs. |
Thanks for creating this project! I'm trying to add Ukrainian, here's what I got so far:
{"iso": "uk", "language": "Ukrainian", "flag": "🇺🇦"},
tolanguages.json
ran
./auto.sh Ukrainian English
This creates 2 zips, which if I put into Yomitan, suck. If you go to a random Ukrainian wiki page, very few of the words highlight, including words that are for sure in kaikki like критика.
We are skipping a ton of term tags, e.g.
etc.
as well as skipped parts of speech
so maybe this is part of the problem. Look forward to any advice on how to resolve!
The text was updated successfully, but these errors were encountered: