ML Research Wiki / Benchmarks / Lipreading / CAS-VSR-W1k (LRW-1000)

CAS-VSR-W1k (LRW-1000)

Lipreading Benchmark

Performance Over Time

📊 Showing 9 results | 📏 Metric: Top-1 Accuracy

Top Performing Models

Rank Model Paper Top-1 Accuracy Date Code
1 SyncVSR (Word Boundary) SyncVSR: Data-Efficient Visual Speech Recognition with End-to-End Crossmodal Audio Token Synchronization 58.20 2024-06-18 📦 KAIST-AILab/SyncVSR
2 3D Conv + ResNet-18 + MS-TCN + Multi-Head Visual-Audio Memory Distinguishing Homophenes Using Multi-Head Visual-Audio Memory for Lip Reading 53.80 2022-04-04 📦 ms-dot-k/Multi-head-Visual-Audio-Memory
3 3D-ResNet + Bi-GRU + MixUp + Label Smooth + Cosine LR (Word Boundary) Learn an Effective Lip Reading Model without Pains 0.00 2020-11-15 📦 Fengdalu/learn-an-effective-lip-reading-model-without-pains
4 3D Conv + ResNet-18 + Bi-GRU + Visual-Audio Memory Multi-modality Associative Bridging through Memory: Speech Sound Recollected from Face Video 0.00 2022-04-04 📦 ms-dot-k/Visual-Audio-Memory
5 3D-ResNet + Bi-GRU + MixUp + Label Smooth + Cosine LR Learn an Effective Lip Reading Model without Pains 0.00 2020-11-15 📦 Fengdalu/learn-an-effective-lip-reading-model-without-pains
6 3D Conv + ResNet-18 + Bi-GRU (Face Cutout) 📚 Can We Read Speech Beyond the Lips? Rethinking RoI Selection for Deep Visual Speech Recognition 0.00 2020-03-06 📦 sailordiary/deep-face-vsr
7 DFTN Deformation Flow Based Two-Stream Network for Lip Reading 0.00 2020-03-12 📦 jingyunx/Deformation-Flow-Based-Two-stream-Network
8 GLMIM Mutual Information Maximization for Effective Lip Reading 0.00 2020-03-13 📦 xing96/MIM-lipreading
9 PCPG Pseudo-Convolutional Policy Gradient for Sequence-to-Sequence Lip-Reading 0.00 2020-03-09 -

All Papers (9)