Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Translation to Russian #109

Open
50 tasks done
artyomboyko opened this issue Jul 26, 2023 · 36 comments
Open
50 tasks done

Translation to Russian #109

artyomboyko opened this issue Jul 26, 2023 · 36 comments

Comments

@artyomboyko
Copy link
Contributor

artyomboyko commented Jul 26, 2023

Hi there 👋

Let's translate the course to Russian so that the whole community can benefit from this resource 🌎!

Below are the chapters and files that need translating - let us know here if you'd like to translate any and we'll add your name to the list. Once you're finished, open a pull request and tag this issue by including #issue-number in the description, where issue-number is the number of this issue.

🙋 If you'd like others to help you with the translation, you can also post in our forums or tag @_lewtun on Twitter to gain some visibility.

Chapters

UNIT 0. WELCOME TO THE COURSE!

UNIT 1. WORKING WITH AUDIO DATA

UNIT 2. A GENTLE INTRODUCTION TO AUDIO APPLICATIONS

UNIT 3. TRANSFORMER ARCHITECTURES FOR AUDIO

UNIT 4. BUILD A MUSIC GENRE CLASSIFIER

UNIT 5. AUTOMATIC SPEECH RECOGNITION

UNIT 6. From text to speech

UNIT 7. Putting it all together

UNIT 8. Finish line

Course Events

Adding extra material:

@artyomboyko
Copy link
Contributor Author

Good evening, I have added a file translation_agreements.txt in which I enter various words and phrases that need to be translated in a uniform way. This is necessary to maintain a uniform translation style in case there will be several translators.

@artyomboyko artyomboyko reopened this Jul 27, 2023
@artyomboyko artyomboyko changed the title Translation of the course into Russian Translation to Russian Jul 31, 2023
@artyomboyko
Copy link
Contributor Author

artyomboyko commented Jul 31, 2023

@lewtun Hello. Can you post a review? As soon as I finish translating Unit 1 I plan to push everything already translated into the official repository....

@Lightmourne
Copy link
Contributor

Hi! I can take chapter 2 for translation.

@artyomboyko
Copy link
Contributor Author

@Lightmourne Good news! The two of us can do more!

@artyomboyko
Copy link
Contributor Author

@MKhalusova Good afternoon, Maria. Can you suggest who can do a review of our contribution to the repository?

@MKhalusova
Copy link
Contributor

@blademoon Thanks for initiating and organizing this effort! I can handle the reviews.

@artyomboyko
Copy link
Contributor Author

@MKhalusova That's great!

@artyomboyko
Copy link
Contributor Author

@MKhalusova I have one question, will there be an example in the course on how to train Whisper for multilingual ASR? For example how to fine-tune Whisper for two languages - English and Russian? That would be very good.

@MKhalusova
Copy link
Contributor

@blademoon Whisper is already multilingual, and you can further fine-tune on any language. In the course, we show how to fine-tune it on a language it wasn't trained on - Dhivehi. But you can apply the same principles to other languages.

@artyomboyko
Copy link
Contributor Author

@MKhalusova Good afternoon. Yes, the Whisper fine-tuning notebook is something I've already looked into. I thought that there is some difference if you fine-tune Whisper for two languages at the same time. After all, at least each language needs its own tokenizer configured accordingly. But so far, I don't understand how to do it....

@Lightmourne
Copy link
Contributor

Hi. Chapter 4 [BUILD A MUSIC GENRE CLASSIFIER] translated to Russian.

@Lightmourne
Copy link
Contributor

@blademoon i take chapter 5 for translation [AUTOMATIC SPEECH RECOGNITION].

@artyomboyko
Copy link
Contributor Author

@Lightmourne OK 😉

@artyomboyko
Copy link
Contributor Author

@MKhalusova Good afternoon, Maria. The new part of the translation has been sent to PR #122

Besides, we have agreed with Sergey that after the translation is finished we will reread the whole course again (at the same time we will go through it to check everything) and make minor edits, this should improve the quality of our work.

@artyomboyko
Copy link
Contributor Author

@Lightmourne @MKhalusova Good afternoon. I decided to add a new marker to our task list - MINOR_FIX_DONE. This marker will be used to mark those files that we have reread and corrected minor errors.

@artyomboyko
Copy link
Contributor Author

@MKhalusova Good evening Maria, can you tell me what a "rainbow passage" is? Is it a book or?

@MKhalusova
Copy link
Contributor

@blademoon The rainbow passage is a specific piece of text (this one) that is often used in English language speech and voice research to assess different aspects of speech. It includes a variety of phonetic sounds and linguistic patterns that can help researchers understand how speech sounds are produced by individuals with different accents or speech characteristics.

@artyomboyko
Copy link
Contributor Author

@MKhalusova Thank you, I will add your clarification to the translated version. It will seriously simplify understanding.

@artyomboyko
Copy link
Contributor Author

artyomboyko commented Aug 11, 2023

@MKhalusova In file "pre-trained_model.mdx" (Chapter 6):

## SpeechT5 

[SpeechT5](https://arxiv.org/abs/2110.07205) is a model published by Junyi Ao et al. from Microsoft that is capable of

Junyi Ao - is a person's name. But what this - et al? Could it be a typo?

@artyomboyko
Copy link
Contributor Author

@MKhalusova In Catalan et al means "and others"))) But I don't know Catalan)))

@MKhalusova
Copy link
Contributor

@blademoon et al. comes from Latin, and does mean "and others". It's often encountered in paper citations, and is very common in English:
https://www.merriam-webster.com/dictionary/et%20al.

@artyomboyko
Copy link
Contributor Author

@MKhalusova Good afternoon Maria, can you do a review of Sergei's PR #123 if possible. I am working hard on the translation of the last two units. Sergey is busy checking minor bugs and coordinating the translation of all our work. We are trying to bring the translation to the end and improve the quality. Thank you.

@artyomboyko
Copy link
Contributor Author

@MKhalusova Good afternoon Maria. A small question about the translation of the title House-keeping. Almost all translations known so far are related to housekeeping and agriculture. But this is clearly not it. The context of the content of the section itself didn't help much in choosing synonyms either.... Can you explain the meaning of this word combination more precisely?

@artyomboyko
Copy link
Contributor Author

@MKhalusova Good evening, Maria. For your convenience, my PR #124 should go after Sergey's PR #123.

@artyomboyko
Copy link
Contributor Author

@Lightmourne When the PR is accepted and you're ready, tag me. We'll take the course and make minor adjustments to the translation. Okay?

@artyomboyko
Copy link
Contributor Author

@MKhalusova Добрый день Мария ;) Можешь сориентировать, когда нам сюда вернуться чтобы "причесать" наш перевод) Получилось быстро перевести, но хотелось бы еще и согласовать по терминологии. Есть объективные недостатки и их нужно устранить.

@MKhalusova
Copy link
Contributor

@blademoon
Chapter 5 нужно починить (make style), затем можно мерджить.
Completion of the stage of translation of the course into Russian #124 я посмотрю сегодня или завтра.
Потом сверху отдельным PR можно добавить поправки после "причесания"

@artyomboyko
Copy link
Contributor Author

@MKhalusova мы так и планировали. Сначала смерджить, затем просто мелкие правки.

Отдельный вопрос, курс по RL переводить будут?

@artyomboyko
Copy link
Contributor Author

@MKhalusova по переводу одного предложения, погорячился, сделал лишний коммит. Ваш коммит делает тоже самое, его я тоже подтвердил. Просто видимо мне тоже нужно выспаться, а еще лучше в отпуск))) Спасибо вам за внимательность Мария!

@artyomboyko
Copy link
Contributor Author

artyomboyko commented Aug 30, 2023

@Lightmourne Have a great birthday and thanks so much for your help! 🍸

@MKhalusova
Copy link
Contributor

@blademoon PR с Chapter 5 смерджен. Почини, пожалуйста, конфликт в TOC в #124 и я посмотрю тогда на PR.
RL пока никто не переводил, если хотите, можете взяться за перевод :) если будут какие-то сложности, пингуйте меня.

@artyomboyko
Copy link
Contributor Author

@MKhalusova Добрый вечер Мария. Сделал.

@Lightmourne
Copy link
Contributor

@Lightmourne Have a great birthday and thanks so much for your help! 🍸

@blademoon, thanks for the congratulations! 😊

@artyomboyko
Copy link
Contributor Author

artyomboyko commented Sep 1, 2023

@MKhalusova @Lightmourne

Good evening, Maria and Sergei.

  1. Yesterday's additional materials have been translated.
  2. Regarding the RL course, here is the link
  3. Yesterday I started translating this course a bit, since there are no plans to translate it, I'm working in my repository for now.
  4. Since I won't be able to translate and edit at the same time, Sergey agreed to take on the task of editing the already translated material. Many thanks to him.

@artyomboyko
Copy link
Contributor Author

We need to add a translation from #137.

@artyomboyko
Copy link
Contributor Author

#137 Translated

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants