SyncVSR (Word Boundary)
|
SyncVSR: Data-Efficient Visual Speech Recognition…
|
58.20
|
2024-06-18
|
|
3D Conv + ResNet-18 + MS-TCN + Multi-Head Visual-Audio Memory
|
Distinguishing Homophenes Using Multi-Head Visual…
|
53.80
|
2022-04-04
|
|
3D-ResNet + Bi-GRU + MixUp + Label Smooth + Cosine LR (Word Boundary)
|
Learn an Effective Lip Reading Model without Pains
|
|
2020-11-15
|
|
3D Conv + ResNet-18 + Bi-GRU + Visual-Audio Memory
|
Multi-modality Associative Bridging through Memor…
|
|
2022-04-04
|
|
3D-ResNet + Bi-GRU + MixUp + Label Smooth + Cosine LR
|
Learn an Effective Lip Reading Model without Pains
|
|
2020-11-15
|
|
3D Conv + ResNet-18 + Bi-GRU (Face Cutout)
|
Can We Read Speech Beyond the Lips? Rethinking Ro…
|
|
2020-03-06
|
|
DFTN
|
Deformation Flow Based Two-Stream Network for Lip…
|
|
2020-03-12
|
|
GLMIM
|
Mutual Information Maximization for Effective Lip…
|
|
2020-03-13
|
|
PCPG
|
Pseudo-Convolutional Policy Gradient for Sequence…
|
|
2020-03-09
|
|