Enhancing Lip Reading with Multi-Scale Video and Multi-Encoder Apr 8, 2024 Lipreading Lip Reading
— Unverified 0Landmark-Guided Cross-Speaker Lip Reading with Mutual Information Regularization Mar 24, 2024 Lip Reading
— Unverified 0Cross-Attention Fusion of Visual and Geometric Features for Large Vocabulary Arabic Lipreading Feb 18, 2024 Lipreading Lip Reading
— Unverified 0Computation and Parameter Efficient Multi-Modal Fusion Transformer for Cued Speech Recognition Jan 31, 2024 Lip Reading speech-recognition
— Unverified 0Exploring Lip Segmentation Techniques in Computer Vision: A Comparative Analysis Nov 20, 2023 Edge-computing Lip Reading
— Unverified 0DualTalker: A Cross-Modal Dual Learning Approach for Speech-Driven 3D Facial Animation Nov 8, 2023 Lip Reading
Code Code Available 0Learning Separable Hidden Unit Contributions for Speaker-Adaptive Lip-Reading Oct 8, 2023 Lip Reading
Code Code Available 0End-to-End Lip Reading in Romanian with Cross-Lingual Domain Adaptation and Lateral Inhibition Oct 7, 2023 Domain Adaptation Lip Reading
— Unverified 0Lip Reading for Low-resource Languages by Learning and Combining General Speech Knowledge and Language-specific Knowledge Aug 18, 2023 Lip Reading
— Unverified 0Lip2Vec: Efficient and Robust Visual Speech Recognition via Latent-to-Latent Visual to Audio Representation Mapping Aug 11, 2023 Lip Reading speech-recognition
— Unverified 0Leveraging Visemes for Better Visual Speech Representation and Lip Reading Jul 19, 2023 Lip Reading Sentence
— Unverified 0Emotional Speech-Driven Animation with Content-Emotion Disentanglement Jun 15, 2023 Disentanglement Lip Reading
— Unverified 0A Novel Interpretable and Generalizable Re-synchronization Model for Cued Speech based on a Multi-Cuer Corpus Jun 5, 2023 Lip Reading
Code Code Available 0Deep Learning-based Spatio Temporal Facial Feature Visual Speech Recognition Apr 30, 2023 Deep Learning Face Recognition
— Unverified 0PixelRNN: In-pixel Recurrent Neural Networks for End-to-end-optimized Perception with Neural Sensors Apr 11, 2023 Gesture Recognition Hand Gesture Recognition
— Unverified 0Word-level Persian Lipreading Dataset Apr 8, 2023 Lipreading Lip Reading
— Unverified 0SynthVSR: Scaling Up Visual Speech Recognition With Synthetic Supervision Mar 30, 2023 Lip Reading speech-recognition
— Unverified 0A large-scale multimodal dataset of human speech recognition Mar 15, 2023 Lip Reading Motion Detection
— Unverified 0Audio-Visual Speech and Gesture Recognition by Sensors of Mobile Devices Feb 17, 2023 Audio-Visual Speech Recognition Gesture Recognition
— Unverified 0A Multi-Purpose Audio-Visual Corpus for Multi-Modal Persian Speech Recognition: the Arman-AV Dataset Jan 21, 2023 Audio-Visual Speech Recognition Automatic Speech Recognition
— Unverified 0Speech Driven Video Editing via an Audio-Conditioned Diffusion Model Jan 10, 2023 Denoising Face Model
— Unverified 0Audio-visual video face hallucination with frequency supervision and cross modality support by speech based lip reading loss Nov 20, 2022 Face Hallucination Generative Adversarial Network
— Unverified 0Lip Sync Matters: A Novel Multimodal Forgery Detector Nov 7, 2022 DeepFake Detection Face Swapping
Code Code Available 0Streaming Audio-Visual Speech Recognition with Alignment Regularization Nov 3, 2022 Audio-Visual Speech Recognition Automatic Speech Recognition
— Unverified 0Audio-Visual Speech Enhancement and Separation by Utilizing Multi-Modal Self-Supervised Embeddings Oct 31, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Clean Text and Full-Body Transformer: Microsoft's Submission to the WMT22 Shared Task on Sign Language Translation Oct 24, 2022 Action Recognition Lip Reading
— Unverified 0A Novel Frame Structure for Cloud-Based Audio-Visual Speech Enhancement in Multimodal Hearing-aids Oct 24, 2022 Lip Reading Speech Enhancement
— Unverified 0VCSE: Time-Domain Visual-Contextual Speaker Extraction Network Oct 9, 2022 Lip Reading
— Unverified 0Relaxed Attention for Transformer Models Sep 20, 2022 Decoder Image Classification
Code Code Available 0Visual Speech Recognition in a Driver Assistance System Aug 29, 2022 Data Augmentation Lipreading
— Unverified 0Towards MOOCs for Lipreading: Using Synthetic Talking Heads to Train Humans in Lipreading at Scale Aug 21, 2022 Lipreading Lip Reading
— Unverified 0Speaker-adaptive Lip Reading with User-dependent Padding Aug 9, 2022 Lip Reading speech-recognition
Code Code Available 0Lip-Listening: Mixing Senses to Understand Lips using Cross Modality Knowledge Distillation for Word-Based Models Jun 5, 2022 Knowledge Distillation Lipreading
— Unverified 0Learning Speaker-specific Lip-to-Speech Generation Jun 4, 2022 Decoder Lip Reading
— Unverified 0RUSAVIC Corpus: Russian Audio-Visual Speech in Cars Jun 1, 2022 Audio-Visual Speech Recognition Lip Reading
— Unverified 0Expression-preserving face frontalization improves visually assisted speech processing Apr 6, 2022 Face Model Lip Reading
— Unverified 0A Multimodal German Dataset for Automatic Lip Reading Systems and Transfer Learning Feb 27, 2022 Lip Reading Transfer Learning
— Unverified 0Multi-Grained Spatio-Temporal Features Perceived Network for Event-Based Lip-Reading Jan 1, 2022 Action Recognition Lip Reading
— Unverified 0LipSound2: Self-Supervised Pre-Training for Lip-to-Speech Reconstruction and Lip Reading Dec 9, 2021 Decoder Lip Reading
— Unverified 0Audio-Visual Synchronisation in the wild Dec 8, 2021 Lip Reading
— Unverified 0Contrastive Learning of Global and Local Video Representations Dec 1, 2021 Classification Contrastive Learning
— Unverified 0Leveraging Uni-Modal Self-Supervised Learning for Multimodal Audio-visual Speech Recognition Nov 16, 2021 Audio-Visual Speech Recognition Language Modelling
— Unverified 0Advances and Challenges in Deep Lip Reading Oct 15, 2021 Deep Learning Lip Reading
— Unverified 0Sub-word Level Lip Reading With Visual Attention Oct 14, 2021 Audio-Visual Active Speaker Detection Automatic Speech Recognition
— Unverified 0Perception Point: Identifying Critical Learning Periods in Speech for Bilingual Networks Oct 13, 2021 Lip Reading speech-recognition
— Unverified 0Audio-Visual Speech Recognition is Worth 32328 Voxels Sep 20, 2021 Audio-Visual Speech Recognition Automatic Speech Recognition
— Unverified 0LRWR: Large-Scale Benchmark for Lip Reading in Russian language Sep 14, 2021 Lipreading Lip Reading
— Unverified 0SimulLR: Simultaneous Lip Reading Transducer with Attention-Guided Adaptive Memory Aug 31, 2021 Lip Reading
— Unverified 0Adaptive Semantic-Spatio-Temporal Graph Convolutional Network for Lip Reading Aug 16, 2021 Landmark-based Lipreading Lip Reading
— Unverified 0Spatio-Temporal Attention Mechanism and Knowledge Distillation for Lip Reading Aug 7, 2021 Audio-Visual Speech Recognition Knowledge Distillation
— Unverified 0