GeneFace: Generalized and High-Fidelity Audio-Driven 3D Talking Face Synthesis Jan 31, 2023 Face Generation Lip Reading
Code Code Available 45 Where Visual Speech Meets Language: VSP-LLM Framework for Efficient and Context-Aware Visual Speech Processing Feb 23, 2024 Lipreading Lip Reading
Code Code Available 35 Seeing What You Said: Talking Face Generation Guided by a Lip Reading Expert Mar 29, 2023 Contrastive Learning Face Generation
Code Code Available 25 Training Strategies for Improved Lip-reading Sep 3, 2022 Data Augmentation Lipreading
Code Code Available 25 Learning Audio-Visual Speech Representation by Masked Multimodal Cluster Prediction Jan 5, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 25 Personalized Lip Reading: Adapting to Your Unique Lip Movements with Vision and Language Sep 2, 2024 Lip Reading Sentence
Code Code Available 15 Leveraging Unimodal Self-Supervised Learning for Multimodal Audio-Visual Speech Recognition Feb 24, 2022 Audio-Visual Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Deep Audio-Visual Speech Recognition Sep 6, 2018 Audio-Visual Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Do VSR Models Generalize Beyond LRS3? Nov 23, 2023 Lip Reading speech-recognition
Code Code Available 15 Can We Read Speech Beyond the Lips? Rethinking RoI Selection for Deep Visual Speech Recognition Mar 6, 2020 Lipreading Lip Reading
Code Code Available 15 Visual Keyword Spotting with Attention Oct 29, 2021 Lip Reading Visual Keyword Spotting
Code Code Available 15 Selective Listening by Synchronizing Speech with Lips Jun 14, 2021 Lip Reading Target Speaker Extraction
Code Code Available 15 Contrastive Learning of Global-Local Video Representations Apr 7, 2021 Classification Contrastive Learning
Code Code Available 15 Learn an Effective Lip Reading Model without Pains Nov 15, 2020 Lipreading Lip Reading
Code Code Available 15 Distinguishing Homophenes Using Multi-Head Visual-Audio Memory for Lip Reading Apr 4, 2022 Lipreading Lip Reading
Code Code Available 15 Audio-Visual Efficient Conformer for Robust Speech Recognition Jan 4, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 OLKAVS: An Open Large-Scale Korean Audio-Visual Speech Dataset Jan 16, 2023 Audio-Visual Speech Recognition Lip Reading
Code Code Available 15 End-to-End Speech-Driven Facial Animation with Temporal GANs May 23, 2018 Lip Reading
Code Code Available 15 Mutual Information Maximization for Effective Lip Reading Mar 13, 2020 Lipreading Lip Reading
Code Code Available 15 End-to-end Audio-visual Speech Recognition with Conformers Feb 12, 2021 Audio-Visual Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 MixSpeech: Cross-Modality Self-Learning with Audio-Visual Stream Mixup for Visual Speech Translation and Recognition Mar 9, 2023 Lip Reading Machine Translation
Code Code Available 15 LipVoicer: Generating Speech from Silent Videos Guided by Lip Reading Jun 5, 2023 Lip Reading
Code Code Available 15 OpenSR: Open-Modality Speech Recognition via Maintaining Multi-Modality Alignment Jun 10, 2023 Audio-Visual Speech Recognition Lip Reading
Code Code Available 15 SelfTalk: A Self-Supervised Commutative Training Diagram to Comprehend 3D Talking Faces Jun 19, 2023 3D Face Animation Lip Reading
Code Code Available 15 Seeing wake words: Audio-visual Keyword Spotting Sep 2, 2020 Keyword Spotting Lip Reading
Code Code Available 15 Lipreading using Temporal Convolutional Networks Jan 23, 2020 Lipreading Lip Reading
Code Code Available 15 Learning Individual Speaking Styles for Accurate Lip to Speech Synthesis May 17, 2020 Lip Reading Lip to Speech Synthesis
Code Code Available 15 Deformation Flow Based Two-Stream Network for Lip Reading Mar 12, 2020 Knowledge Distillation Lipreading
Code Code Available 15 Neural Text to Articulate Talk: Deep Text to Audiovisual Speech Synthesis achieving both Auditory and Photo-realism Dec 11, 2023 Face Generation Lip Reading
Code Code Available 15 Multi-modality Associative Bridging through Memory: Speech Sound Recollected from Face Video Apr 4, 2022 Lip Reading
Code Code Available 15 Lip-reading with Densely Connected Temporal Convolutional Networks Sep 29, 2020 Lip Reading
Code Code Available 15 Talking Face Generation by Adversarially Disentangled Audio-Visual Representation Jul 20, 2018 Face Generation Lip Reading
Code Code Available 05 Audio-Visual Speech Recognition based on Regulated Transformer and Spatio-Temporal Fusion Strategy for Driver Assistive Systems May 9, 2024 Audio-Visual Speech Recognition Lipreading
Code Code Available 05 Speaker-adaptive Lip Reading with User-dependent Padding Aug 9, 2022 Lip Reading speech-recognition
Code Code Available 05 Synchronous Bidirectional Learning for Multilingual Lip Reading May 8, 2020 Lip Reading
Code Code Available 05 Transforming faces into video stories -- VideoFace2.0 May 4, 2025 Face Detection Face Recognition
Code Code Available 05 Combining Residual Networks with LSTMs for Lipreading Mar 12, 2017 Lipreading Lip Reading
Code Code Available 05 Relaxed Attention for Transformer Models Sep 20, 2022 Decoder Image Classification
Code Code Available 05 MTGA: Multi-View Temporal Granularity Aligned Aggregation for Event-Based Lip-Reading Apr 18, 2024 Lip Reading
Code Code Available 05 Multi-Perspective LSTM for Joint Visual Representation Learning May 6, 2021 Face Recognition Lip Reading
Code Code Available 05 DualTalker: A Cross-Modal Dual Learning Approach for Speech-Driven 3D Facial Animation Nov 8, 2023 Lip Reading
Code Code Available 05 Lip Sync Matters: A Novel Multimodal Forgery Detector Nov 7, 2022 DeepFake Detection Face Swapping
Code Code Available 05 LRW-1000: A Naturally-Distributed Large-Scale Benchmark for Lip Reading in the Wild Oct 16, 2018 Lipreading Lip Reading
Code Code Available 05 AuthNet: A Deep Learning based Authentication Mechanism using Temporal Facial Feature Movements Dec 4, 2020 Benchmarking Lip password classification
Code Code Available 05 Learning Separable Hidden Unit Contributions for Speaker-Adaptive Lip-Reading Oct 8, 2023 Lip Reading
Code Code Available 05 A Novel Interpretable and Generalizable Re-synchronization Model for Cued Speech based on a Multi-Cuer Corpus Jun 5, 2023 Lip Reading
Code Code Available 05 Lend a Hand: Semi Training-Free Cued Speech Recognition via MLLM-Driven Hand Modeling for Barrier-free Communication Mar 11, 2025 Lip Reading Prompt Engineering
Code Code Available 05 Lip2AudSpec: Speech reconstruction from silent lip movements video Oct 26, 2017 Lip Reading
Code Code Available 05 Hearing Lips: Improving Lip Reading by Distilling Speech Recognizers Nov 26, 2019 Knowledge Distillation Lipreading
Code Code Available 05 Estimating speech from lip dynamics Aug 3, 2017 Lip Reading Position
Code Code Available 05