Is there an audio-visual Chinese model? #15

cooelf · 2023-06-01T11:55:29Z

Thanks for releasing the awesome work! I noticed that the Chinese lip reading model is based on the visual modality. I used the visual model but it achieved poor performance on the example video clips like #5. Is there an audio-visual version that hopefully achieves better results?

Thanks.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Is there an audio-visual Chinese model? #15

Is there an audio-visual Chinese model? #15

cooelf commented Jun 1, 2023

Is there an audio-visual Chinese model? #15

Is there an audio-visual Chinese model? #15

Comments

cooelf commented Jun 1, 2023