GeneFace: Generalized and High-Fidelity Audio-Driven 3D Talking Face Synthesis Jan 31, 2023 Face Generation Lip Reading
Code Code Available 4Where Visual Speech Meets Language: VSP-LLM Framework for Efficient and Context-Aware Visual Speech Processing Feb 23, 2024 Lipreading Lip Reading
Code Code Available 3Training Strategies for Improved Lip-reading Sep 3, 2022 Data Augmentation Lipreading
Code Code Available 2Learning Audio-Visual Speech Representation by Masked Multimodal Cluster Prediction Jan 5, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 2Seeing What You Said: Talking Face Generation Guided by a Lip Reading Expert Mar 29, 2023 Contrastive Learning Face Generation
Code Code Available 2Deformation Flow Based Two-Stream Network for Lip Reading Mar 12, 2020 Knowledge Distillation Lipreading
Code Code Available 1OpenSR: Open-Modality Speech Recognition via Maintaining Multi-Modality Alignment Jun 10, 2023 Audio-Visual Speech Recognition Lip Reading
Code Code Available 1Deep Audio-Visual Speech Recognition Sep 6, 2018 Audio-Visual Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1SelfTalk: A Self-Supervised Commutative Training Diagram to Comprehend 3D Talking Faces Jun 19, 2023 3D Face Animation Lip Reading
Code Code Available 1Selective Listening by Synchronizing Speech with Lips Jun 14, 2021 Lip Reading Target Speaker Extraction
Code Code Available 1End-to-end Audio-visual Speech Recognition with Conformers Feb 12, 2021 Audio-Visual Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Distinguishing Homophenes Using Multi-Head Visual-Audio Memory for Lip Reading Apr 4, 2022 Lipreading Lip Reading
Code Code Available 1Contrastive Learning of Global-Local Video Representations Apr 7, 2021 Classification Contrastive Learning
Code Code Available 1Learning Individual Speaking Styles for Accurate Lip to Speech Synthesis May 17, 2020 Lip Reading Lip to Speech Synthesis
Code Code Available 1Audio-Visual Efficient Conformer for Robust Speech Recognition Jan 4, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Mutual Information Maximization for Effective Lip Reading Mar 13, 2020 Lipreading Lip Reading
Code Code Available 1MixSpeech: Cross-Modality Self-Learning with Audio-Visual Stream Mixup for Visual Speech Translation and Recognition Mar 9, 2023 Lip Reading Machine Translation
Code Code Available 1Neural Text to Articulate Talk: Deep Text to Audiovisual Speech Synthesis achieving both Auditory and Photo-realism Dec 11, 2023 Face Generation Lip Reading
Code Code Available 1Personalized Lip Reading: Adapting to Your Unique Lip Movements with Vision and Language Sep 2, 2024 Lip Reading Sentence
Code Code Available 1Do VSR Models Generalize Beyond LRS3? Nov 23, 2023 Lip Reading speech-recognition
Code Code Available 1Can We Read Speech Beyond the Lips? Rethinking RoI Selection for Deep Visual Speech Recognition Mar 6, 2020 Lipreading Lip Reading
Code Code Available 1Visual Keyword Spotting with Attention Oct 29, 2021 Lip Reading Visual Keyword Spotting
Code Code Available 1Leveraging Unimodal Self-Supervised Learning for Multimodal Audio-Visual Speech Recognition Feb 24, 2022 Audio-Visual Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Learn an Effective Lip Reading Model without Pains Nov 15, 2020 Lipreading Lip Reading
Code Code Available 1Seeing wake words: Audio-visual Keyword Spotting Sep 2, 2020 Keyword Spotting Lip Reading
Code Code Available 1Lipreading using Temporal Convolutional Networks Jan 23, 2020 Lipreading Lip Reading
Code Code Available 1OLKAVS: An Open Large-Scale Korean Audio-Visual Speech Dataset Jan 16, 2023 Audio-Visual Speech Recognition Lip Reading
Code Code Available 1Multi-modality Associative Bridging through Memory: Speech Sound Recollected from Face Video Apr 4, 2022 Lip Reading
Code Code Available 1LipVoicer: Generating Speech from Silent Videos Guided by Lip Reading Jun 5, 2023 Lip Reading
Code Code Available 1Lip-reading with Densely Connected Temporal Convolutional Networks Sep 29, 2020 Lip Reading
Code Code Available 1End-to-End Speech-Driven Facial Animation with Temporal GANs May 23, 2018 Lip Reading
Code Code Available 1Cross-Attention Fusion of Visual and Geometric Features for Large Vocabulary Arabic Lipreading Feb 18, 2024 Lipreading Lip Reading
— Unverified 0Contrastive Self-Supervised Learning of Global-Local Audio-Visual Representations Jan 1, 2021 Classification DeepFake Detection
— Unverified 0An Empirical Analysis of Deep Audio-Visual Models for Speech Recognition Dec 21, 2018 Lip Reading Sensitivity
— Unverified 0Contrastive Learning of Global and Local Video Representations Dec 1, 2021 Classification Contrastive Learning
— Unverified 0Audio-Visual Speech Enhancement and Separation by Utilizing Multi-Modal Self-Supervised Embeddings Oct 31, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Advances and Challenges in Deep Lip Reading Oct 15, 2021 Deep Learning Lip Reading
— Unverified 0Contextual Audio-Visual Switching For Speech Enhancement in Real-World Environments Aug 28, 2018 Lip Reading Speech Enhancement
— Unverified 0Computation and Parameter Efficient Multi-Modal Fusion Transformer for Cued Speech Recognition Jan 31, 2024 Lip Reading speech-recognition
— Unverified 0Audio-Visual Speech and Gesture Recognition by Sensors of Mobile Devices Feb 17, 2023 Audio-Visual Speech Recognition Gesture Recognition
— Unverified 0Clean Text and Full-Body Transformer: Microsoft's Submission to the WMT22 Shared Task on Sign Language Translation Oct 24, 2022 Action Recognition Lip Reading
— Unverified 0End-to-End Video-To-Speech Synthesis using Generative Adversarial Networks Apr 27, 2021 Lip Reading Speech Synthesis
— Unverified 0A Multi-Purpose Audio-Visual Corpus for Multi-Modal Persian Speech Recognition: the Arman-AV Dataset Jan 21, 2023 Audio-Visual Speech Recognition Automatic Speech Recognition
— Unverified 0Facetron: A Multi-speaker Face-to-Speech Model based on Cross-modal Latent Representations Jul 26, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Finding phonemes: improving machine lip-reading Oct 3, 2017 Lip Reading Phoneme Recognition
— Unverified 0End-to-End Lip Reading in Romanian with Cross-Lingual Domain Adaptation and Lateral Inhibition Oct 7, 2023 Domain Adaptation Lip Reading
— Unverified 0Chinese-LiPS: A Chinese audio-visual speech recognition dataset with Lip-reading and Presentation Slides Apr 21, 2025 Audio-Visual Speech Recognition Automatic Speech Recognition
— Unverified 0Enhancing Lip Reading with Multi-Scale Video and Multi-Encoder Apr 8, 2024 Lipreading Lip Reading
— Unverified 0Enhancing Speech-Driven 3D Facial Animation with Audio-Visual Guidance from Lip Reading Expert Jul 1, 2024 Lip Reading
— Unverified 0Emotional Speech-Driven Animation with Content-Emotion Disentanglement Jun 15, 2023 Disentanglement Lip Reading
— Unverified 0