Tensorflow = 1.4.0
The speech stored in this git is enhanced by our 2D-SA.
We evaluate the performance on two datasets.
(1) An open-source dataset [1].
(2) A large-scale dataset (Designed and generated by ourselves).
In this Git, enhanced speech, models, and the enhanced edges are uploaded.
The details and scripts of training and testing are included in folder Scripts
In the folder appendix, more explanations about model structure and hyper-parameters will be added.
If you have questions please contact: Email: [email protected]
References:
[1] Cassia Valentini-Botinhao, Xin Wang, Shinji Takaki, and Junichi Yamagishi, “Investigating rnn-based speech enhancement methods for noise-robust text-to-speech,” in 9th ISCA Speech Synthesis Workshop, pp. 146–152.
A Pytorch implemention will be released soon.