GeneFace: Generalized and High-Fidelity Audio-Driven 3D Talking Face Synthesis Jan 31, 2023 Face Generation Lip Reading
Code Code Available 4Where Visual Speech Meets Language: VSP-LLM Framework for Efficient and Context-Aware Visual Speech Processing Feb 23, 2024 Lipreading Lip Reading
Code Code Available 3Seeing What You Said: Talking Face Generation Guided by a Lip Reading Expert Mar 29, 2023 Contrastive Learning Face Generation
Code Code Available 2Training Strategies for Improved Lip-reading Sep 3, 2022 Data Augmentation Lipreading
Code Code Available 2Learning Audio-Visual Speech Representation by Masked Multimodal Cluster Prediction Jan 5, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 2Personalized Lip Reading: Adapting to Your Unique Lip Movements with Vision and Language Sep 2, 2024 Lip Reading Sentence
Code Code Available 1Neural Text to Articulate Talk: Deep Text to Audiovisual Speech Synthesis achieving both Auditory and Photo-realism Dec 11, 2023 Face Generation Lip Reading
Code Code Available 1Do VSR Models Generalize Beyond LRS3? Nov 23, 2023 Lip Reading speech-recognition
Code Code Available 1SelfTalk: A Self-Supervised Commutative Training Diagram to Comprehend 3D Talking Faces Jun 19, 2023 3D Face Animation Lip Reading
Code Code Available 1OpenSR: Open-Modality Speech Recognition via Maintaining Multi-Modality Alignment Jun 10, 2023 Audio-Visual Speech Recognition Lip Reading
Code Code Available 1LipVoicer: Generating Speech from Silent Videos Guided by Lip Reading Jun 5, 2023 Lip Reading
Code Code Available 1MixSpeech: Cross-Modality Self-Learning with Audio-Visual Stream Mixup for Visual Speech Translation and Recognition Mar 9, 2023 Lip Reading Machine Translation
Code Code Available 1OLKAVS: An Open Large-Scale Korean Audio-Visual Speech Dataset Jan 16, 2023 Audio-Visual Speech Recognition Lip Reading
Code Code Available 1Audio-Visual Efficient Conformer for Robust Speech Recognition Jan 4, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Multi-modality Associative Bridging through Memory: Speech Sound Recollected from Face Video Apr 4, 2022 Lip Reading
Code Code Available 1Distinguishing Homophenes Using Multi-Head Visual-Audio Memory for Lip Reading Apr 4, 2022 Lipreading Lip Reading
Code Code Available 1Leveraging Unimodal Self-Supervised Learning for Multimodal Audio-Visual Speech Recognition Feb 24, 2022 Audio-Visual Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Visual Keyword Spotting with Attention Oct 29, 2021 Lip Reading Visual Keyword Spotting
Code Code Available 1Selective Listening by Synchronizing Speech with Lips Jun 14, 2021 Lip Reading Target Speaker Extraction
Code Code Available 1Contrastive Learning of Global-Local Video Representations Apr 7, 2021 Classification Contrastive Learning
Code Code Available 1End-to-end Audio-visual Speech Recognition with Conformers Feb 12, 2021 Audio-Visual Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Learn an Effective Lip Reading Model without Pains Nov 15, 2020 Lipreading Lip Reading
Code Code Available 1Lip-reading with Densely Connected Temporal Convolutional Networks Sep 29, 2020 Lip Reading
Code Code Available 1Seeing wake words: Audio-visual Keyword Spotting Sep 2, 2020 Keyword Spotting Lip Reading
Code Code Available 1Learning Individual Speaking Styles for Accurate Lip to Speech Synthesis May 17, 2020 Lip Reading Lip to Speech Synthesis
Code Code Available 1Mutual Information Maximization for Effective Lip Reading Mar 13, 2020 Lipreading Lip Reading
Code Code Available 1Deformation Flow Based Two-Stream Network for Lip Reading Mar 12, 2020 Knowledge Distillation Lipreading
Code Code Available 1Can We Read Speech Beyond the Lips? Rethinking RoI Selection for Deep Visual Speech Recognition Mar 6, 2020 Lipreading Lip Reading
Code Code Available 1Lipreading using Temporal Convolutional Networks Jan 23, 2020 Lipreading Lip Reading
Code Code Available 1Deep Audio-Visual Speech Recognition Sep 6, 2018 Audio-Visual Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1End-to-End Speech-Driven Facial Animation with Temporal GANs May 23, 2018 Lip Reading
Code Code Available 1VisualSpeaker: Visually-Guided 3D Avatar Lip Synthesis Jul 8, 2025 Automatic Speech Recognition Lip Reading
— Unverified 0SwinLip: An Efficient Visual Speech Encoder for Lip Reading Using Swin Transformer May 7, 2025 Audio-Visual Speech Recognition Lip Reading
— Unverified 0Transforming faces into video stories -- VideoFace2.0 May 4, 2025 Face Detection Face Recognition
Code Code Available 0Development and evaluation of a deep learning algorithm for German word recognition from lip movements Apr 22, 2025 Lip Reading speech-recognition
— Unverified 0Chinese-LiPS: A Chinese audio-visual speech recognition dataset with Lip-reading and Presentation Slides Apr 21, 2025 Audio-Visual Speech Recognition Automatic Speech Recognition
— Unverified 0VALLR: Visual ASR Language Model for Lip Reading Mar 27, 2025 Automatic Speech Recognition Language Modeling
— Unverified 0Lend a Hand: Semi Training-Free Cued Speech Recognition via MLLM-Driven Hand Modeling for Barrier-free Communication Mar 11, 2025 Lip Reading Prompt Engineering
Code Code Available 0Integrating Persian Lip Reading in Surena-V Humanoid Robot for Human-Robot Interaction Jan 23, 2025 Landmark Tracking Lip Reading
— Unverified 0GLaM-Sign: Greek Language Multimodal Lip Reading with Integrated Sign Language Accessibility Jan 9, 2025 Lip Reading Sign Language Translation
— Unverified 0LipGen: Viseme-Guided Lip Video Generation for Enhancing Visual Speech Recognition Jan 8, 2025 Lip Reading speech-recognition
— Unverified 0Spatio-temporal Transformers for Action Unit Classification with Event Cameras Oct 29, 2024 Lip Reading
— Unverified 0Quantitative Analysis of Audio-Visual Tasks: An Information-Theoretic Perspective Sep 29, 2024 Audio-Visual Speech Recognition Lip Reading
— Unverified 0Neuromorphic Facial Analysis with Cross-Modal Supervision Sep 16, 2024 Lip Reading
— Unverified 0RAL:Redundancy-Aware Lipreading Model Based on Differential Learning with Symmetric Views Sep 9, 2024 Lipreading Lip Reading
— Unverified 0Enhancing Speech-Driven 3D Facial Animation with Audio-Visual Guidance from Lip Reading Expert Jul 1, 2024 Lip Reading
— Unverified 0Robust Multi-Modal Speech In-Painting: A Sequence-to-Sequence Approach Jun 2, 2024 Lip Reading Multi-Task Learning
— Unverified 0Audio-Visual Speech Recognition based on Regulated Transformer and Spatio-Temporal Fusion Strategy for Driver Assistive Systems May 9, 2024 Audio-Visual Speech Recognition Lipreading
Code Code Available 0Bridge to Non-Barrier Communication: Gloss-Prompted Fine-grained Cued Speech Gesture Generation with Diffusion Model Apr 30, 2024 Descriptive Gesture Generation
— Unverified 0MTGA: Multi-View Temporal Granularity Aligned Aggregation for Event-Based Lip-Reading Apr 18, 2024 Lip Reading
Code Code Available 0