Visual Speech Recognition for Multiple Languages

Latest

Latest

mpc001 released this 09 Sep 15:03

· 11 commits to master since this release

14b3378

This is the repository of Visual Speech Recognition for Multiple Languages, which is the successor of End-to-End Audio-Visual Speech Recognition with Conformers. The repository is mainly based on ESPnet. We provide state-of-the-art algorithms for end-to-end visual speech recognition in the wild.

Assets 2