TM-Seq2seq
|
Deep Audio-Visual Speech Recognition
|
8.50
|
2018-09-06
|
|
TM-CTC
|
Deep Audio-Visual Speech Recognition
|
8.20
|
2018-09-06
|
|
CTC/Attention
|
Audio-Visual Speech Recognition With A Hybrid CTC…
|
7.00
|
2018-09-28
|
|
LF-MMI TDNN
|
Audio-visual Recognition of Overlapped speech for…
|
5.90
|
2020-01-06
|
|
End2end Conformer
|
End-to-end Audio-visual Speech Recognition with C…
|
3.70
|
2021-02-12
|
|
MoCo + wav2vec (w/o extLM)
|
Leveraging Unimodal Self-Supervised Learning for …
|
2.60
|
2022-02-24
|
|
CTC/Attention
|
Auto-AVSR: Audio-Visual Speech Recognition with A…
|
1.50
|
2023-03-25
|
|
Whisper-Flamingo
|
Whisper-Flamingo: Integrating Visual Features int…
|
1.40
|
2024-06-14
|
|