VisualSpeaker: Visually-Guided 3D Avatar Lip Synthesis Jul 8, 2025 Automatic Speech Recognition Lip Reading
— Unverified 0SwinLip: An Efficient Visual Speech Encoder for Lip Reading Using Swin Transformer May 7, 2025 Audio-Visual Speech Recognition Lip Reading
— Unverified 0Transforming faces into video stories -- VideoFace2.0 May 4, 2025 Face Detection Face Recognition
Code Code Available 0Development and evaluation of a deep learning algorithm for German word recognition from lip movements Apr 22, 2025 Lip Reading speech-recognition
— Unverified 0Chinese-LiPS: A Chinese audio-visual speech recognition dataset with Lip-reading and Presentation Slides Apr 21, 2025 Audio-Visual Speech Recognition Automatic Speech Recognition
— Unverified 0VALLR: Visual ASR Language Model for Lip Reading Mar 27, 2025 Automatic Speech Recognition Language Modeling
— Unverified 0Lend a Hand: Semi Training-Free Cued Speech Recognition via MLLM-Driven Hand Modeling for Barrier-free Communication Mar 11, 2025 Lip Reading Prompt Engineering
Code Code Available 0Integrating Persian Lip Reading in Surena-V Humanoid Robot for Human-Robot Interaction Jan 23, 2025 Landmark Tracking Lip Reading
— Unverified 0GLaM-Sign: Greek Language Multimodal Lip Reading with Integrated Sign Language Accessibility Jan 9, 2025 Lip Reading Sign Language Translation
— Unverified 0LipGen: Viseme-Guided Lip Video Generation for Enhancing Visual Speech Recognition Jan 8, 2025 Lip Reading speech-recognition
— Unverified 0Spatio-temporal Transformers for Action Unit Classification with Event Cameras Oct 29, 2024 Lip Reading
— Unverified 0Quantitative Analysis of Audio-Visual Tasks: An Information-Theoretic Perspective Sep 29, 2024 Audio-Visual Speech Recognition Lip Reading
— Unverified 0Neuromorphic Facial Analysis with Cross-Modal Supervision Sep 16, 2024 Lip Reading
— Unverified 0RAL:Redundancy-Aware Lipreading Model Based on Differential Learning with Symmetric Views Sep 9, 2024 Lipreading Lip Reading
— Unverified 0Personalized Lip Reading: Adapting to Your Unique Lip Movements with Vision and Language Sep 2, 2024 Lip Reading Sentence
Code Code Available 1Enhancing Speech-Driven 3D Facial Animation with Audio-Visual Guidance from Lip Reading Expert Jul 1, 2024 Lip Reading
— Unverified 0Robust Multi-Modal Speech In-Painting: A Sequence-to-Sequence Approach Jun 2, 2024 Lip Reading Multi-Task Learning
— Unverified 0Audio-Visual Speech Recognition based on Regulated Transformer and Spatio-Temporal Fusion Strategy for Driver Assistive Systems May 9, 2024 Audio-Visual Speech Recognition Lipreading
Code Code Available 0Bridge to Non-Barrier Communication: Gloss-Prompted Fine-grained Cued Speech Gesture Generation with Diffusion Model Apr 30, 2024 Descriptive Gesture Generation
— Unverified 0MTGA: Multi-View Temporal Granularity Aligned Aggregation for Event-Based Lip-Reading Apr 18, 2024 Lip Reading
Code Code Available 0Enhancing Lip Reading with Multi-Scale Video and Multi-Encoder Apr 8, 2024 Lipreading Lip Reading
— Unverified 0Landmark-Guided Cross-Speaker Lip Reading with Mutual Information Regularization Mar 24, 2024 Lip Reading
— Unverified 0Where Visual Speech Meets Language: VSP-LLM Framework for Efficient and Context-Aware Visual Speech Processing Feb 23, 2024 Lipreading Lip Reading
Code Code Available 3Cross-Attention Fusion of Visual and Geometric Features for Large Vocabulary Arabic Lipreading Feb 18, 2024 Lipreading Lip Reading
— Unverified 0Computation and Parameter Efficient Multi-Modal Fusion Transformer for Cued Speech Recognition Jan 31, 2024 Lip Reading speech-recognition
— Unverified 0Neural Text to Articulate Talk: Deep Text to Audiovisual Speech Synthesis achieving both Auditory and Photo-realism Dec 11, 2023 Face Generation Lip Reading
Code Code Available 1Do VSR Models Generalize Beyond LRS3? Nov 23, 2023 Lip Reading speech-recognition
Code Code Available 1Exploring Lip Segmentation Techniques in Computer Vision: A Comparative Analysis Nov 20, 2023 Edge-computing Lip Reading
— Unverified 0DualTalker: A Cross-Modal Dual Learning Approach for Speech-Driven 3D Facial Animation Nov 8, 2023 Lip Reading
Code Code Available 0Learning Separable Hidden Unit Contributions for Speaker-Adaptive Lip-Reading Oct 8, 2023 Lip Reading
Code Code Available 0End-to-End Lip Reading in Romanian with Cross-Lingual Domain Adaptation and Lateral Inhibition Oct 7, 2023 Domain Adaptation Lip Reading
— Unverified 0Lip Reading for Low-resource Languages by Learning and Combining General Speech Knowledge and Language-specific Knowledge Aug 18, 2023 Lip Reading
— Unverified 0Lip2Vec: Efficient and Robust Visual Speech Recognition via Latent-to-Latent Visual to Audio Representation Mapping Aug 11, 2023 Lip Reading speech-recognition
— Unverified 0Leveraging Visemes for Better Visual Speech Representation and Lip Reading Jul 19, 2023 Lip Reading Sentence
— Unverified 0SelfTalk: A Self-Supervised Commutative Training Diagram to Comprehend 3D Talking Faces Jun 19, 2023 3D Face Animation Lip Reading
Code Code Available 1Emotional Speech-Driven Animation with Content-Emotion Disentanglement Jun 15, 2023 Disentanglement Lip Reading
— Unverified 0OpenSR: Open-Modality Speech Recognition via Maintaining Multi-Modality Alignment Jun 10, 2023 Audio-Visual Speech Recognition Lip Reading
Code Code Available 1LipVoicer: Generating Speech from Silent Videos Guided by Lip Reading Jun 5, 2023 Lip Reading
Code Code Available 1A Novel Interpretable and Generalizable Re-synchronization Model for Cued Speech based on a Multi-Cuer Corpus Jun 5, 2023 Lip Reading
Code Code Available 0Deep Learning-based Spatio Temporal Facial Feature Visual Speech Recognition Apr 30, 2023 Deep Learning Face Recognition
— Unverified 0PixelRNN: In-pixel Recurrent Neural Networks for End-to-end-optimized Perception with Neural Sensors Apr 11, 2023 Gesture Recognition Hand Gesture Recognition
— Unverified 0Word-level Persian Lipreading Dataset Apr 8, 2023 Lipreading Lip Reading
— Unverified 0SynthVSR: Scaling Up Visual Speech Recognition With Synthetic Supervision Mar 30, 2023 Lip Reading speech-recognition
— Unverified 0Seeing What You Said: Talking Face Generation Guided by a Lip Reading Expert Mar 29, 2023 Contrastive Learning Face Generation
Code Code Available 2A large-scale multimodal dataset of human speech recognition Mar 15, 2023 Lip Reading Motion Detection
— Unverified 0MixSpeech: Cross-Modality Self-Learning with Audio-Visual Stream Mixup for Visual Speech Translation and Recognition Mar 9, 2023 Lip Reading Machine Translation
Code Code Available 1Audio-Visual Speech and Gesture Recognition by Sensors of Mobile Devices Feb 17, 2023 Audio-Visual Speech Recognition Gesture Recognition
— Unverified 0GeneFace: Generalized and High-Fidelity Audio-Driven 3D Talking Face Synthesis Jan 31, 2023 Face Generation Lip Reading
Code Code Available 4A Multi-Purpose Audio-Visual Corpus for Multi-Modal Persian Speech Recognition: the Arman-AV Dataset Jan 21, 2023 Audio-Visual Speech Recognition Automatic Speech Recognition
— Unverified 0OLKAVS: An Open Large-Scale Korean Audio-Visual Speech Dataset Jan 16, 2023 Audio-Visual Speech Recognition Lip Reading
Code Code Available 1