-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Reproducing the Experiment Results #2
Comments
Hi, thanks for your interest.
|
Thank you for the update! I have one more question. While trying to reproduce the results for German data, I encountered the For your reference, I am currently using Thank you for your help. |
I suspect that insufficient training might have caused the issue with text generation. I re-ran the model on my end and didn't encounter the error you mentioned. To address this, you may want to consider increasing the number of training epochs, for example, setting it to 3 or 5. Additionally, for the GEC task, the ROUGE metric is optional and can be removed directly from the source code (specifically, lines 456 to 465 and 407 to 416 in trainer.py). This is because the ROUGE metric is not necessary for the GEC task. |
Hi, thank you for your interesting work. I was trying to reproduce the experiment results using the code provided and had some questions.
train_and_eval.sh
script to reproduce the results for English dataset after generating speech data for development set (5.7K sentences) and test set (5.7K sentences) with a sample speech data (10K sentences) for training set to test the script. However, I have encountered "IndexError: Dimension specified as 0 but tensor has no dimensions" error. Below is the full error log. Would you be able to provide some guidance on how I can fix this error?soundfile
,transformers
,spacy
, have all not been mentioned in the README file. Would you provide the full list of dependencies, along with the version information, required to reproduce the experiment results?train_and_eval.sh
to train and evaluate for German dataset? I have looked attrain_and_eval.sh
andgec_speech_moe_mse/main.py
and they both seemed to be specific to English dataset. If it is not possible to run these code on German dataset, would it be possible to upload the code used to train and evaluate for German dataset?Any help would be greatly appreciated. Thank you!
The text was updated successfully, but these errors were encountered: