Where Visual Speech Meets Language: VSP-LLM Framework for Efficient and Context-Aware Visual Speech Processing Feb 23, 2024 Lipreading Lip Reading
Code Code Available 3SyncVSR: Data-Efficient Visual Speech Recognition with End-to-End Crossmodal Audio Token Synchronization Jun 18, 2024 Landmark-based Lipreading Lipreading
Code Code Available 2Auto-AVSR: Audio-Visual Speech Recognition with Automatic Labels Mar 25, 2023 Audio-Visual Speech Recognition Automatic Speech Recognition
Code Code Available 2Training Strategies for Improved Lip-reading Sep 3, 2022 Data Augmentation Lipreading
Code Code Available 2Visual Speech Recognition for Multiple Languages in the Wild Feb 26, 2022 Hyperparameter Optimization Lipreading
Code Code Available 2Robust Self-Supervised Audio-Visual Speech Recognition Jan 5, 2022 Audio-Visual Speech Recognition Automatic Speech Recognition
Code Code Available 2Learning Audio-Visual Speech Representation by Masked Multimodal Cluster Prediction Jan 5, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 2Audio-Visual Representation Learning via Knowledge Distillation from Speech Foundation Models Feb 9, 2025 Audio-Visual Speech Recognition Automatic Speech Recognition
Code Code Available 1Unified Speech Recognition: A Single Model for Auditory, Visual, and Audiovisual Inputs Nov 4, 2024 Lipreading speech-recognition
Code Code Available 1Watch Your Mouth: Silent Speech Recognition with Depth Sensing May 11, 2024 Deep Learning Lipreading
Code Code Available 1LipLearner: Customizable Silent Speech Interactions on Mobile Devices Feb 12, 2023 Contrastive Learning Incremental Learning
Code Code Available 1Jointly Learning Visual and Auditory Speech Representations from Raw Data Dec 12, 2022 Audio-Visual Speech Recognition Lipreading
Code Code Available 1Distinguishing Homophenes Using Multi-Head Visual-Audio Memory for Lip Reading Apr 4, 2022 Lipreading Lip Reading
Code Code Available 1Leveraging Unimodal Self-Supervised Learning for Multimodal Audio-Visual Speech Recognition Feb 24, 2022 Audio-Visual Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1End-to-end Audio-visual Speech Recognition with Conformers Feb 12, 2021 Audio-Visual Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Lips Don't Lie: A Generalisable and Robust Approach to Face Forgery Detection Dec 14, 2020 DeepFake Detection Lipreading
Code Code Available 1Learn an Effective Lip Reading Model without Pains Nov 15, 2020 Lipreading Lip Reading
Code Code Available 1Towards Practical Lipreading with Distilled and Efficient Models Jul 13, 2020 Knowledge Distillation Lipreading
Code Code Available 1Discriminative Multi-modality Speech Recognition May 12, 2020 Audio-Visual Speech Recognition Lipreading
Code Code Available 1Mutual Information Maximization for Effective Lip Reading Mar 13, 2020 Lipreading Lip Reading
Code Code Available 1Deformation Flow Based Two-Stream Network for Lip Reading Mar 12, 2020 Knowledge Distillation Lipreading
Code Code Available 1Can We Read Speech Beyond the Lips? Rethinking RoI Selection for Deep Visual Speech Recognition Mar 6, 2020 Lipreading Lip Reading
Code Code Available 1Lipreading using Temporal Convolutional Networks Jan 23, 2020 Lipreading Lip Reading
Code Code Available 1Deep Audio-Visual Speech Recognition Sep 6, 2018 Audio-Visual Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1LipNet: End-to-End Sentence-level Lipreading Nov 5, 2016 General Classification Lipreading
Code Code Available 1Learning Speaker-Invariant Visual Features for Lipreading Jun 9, 2025 Disentanglement Lipreading
— Unverified 0UniCUE: Unified Recognition and Generation Framework for Chinese Cued Speech Video-to-Speech Generation Jun 4, 2025 cross-modal alignment Lipreading
— Unverified 0OXSeg: Multidimensional attention UNet-based lip segmentation using semi-supervised lip contours May 8, 2025 Generative Adversarial Network Lipreading
— Unverified 0Target Speaker Lipreading by Audio-Visual Self-Distillation Pretraining and Speaker Adaptation Feb 9, 2025 Cross-Lingual Transfer Lipreading
— Unverified 0Evaluation of End-to-End Continuous Spanish Lipreading in Different Data Conditions Feb 1, 2025 Lipreading speech-recognition
Code Code Available 0RAL:Redundancy-Aware Lipreading Model Based on Differential Learning with Symmetric Views Sep 9, 2024 Lipreading Lip Reading
— Unverified 0Audio-Visual Speech Recognition based on Regulated Transformer and Spatio-Temporal Fusion Strategy for Driver Assistive Systems May 9, 2024 Audio-Visual Speech Recognition Lipreading
Code Code Available 0Enhancing Lip Reading with Multi-Scale Video and Multi-Encoder Apr 8, 2024 Lipreading Lip Reading
— Unverified 0Cross-Attention Fusion of Visual and Geometric Features for Large Vocabulary Arabic Lipreading Feb 18, 2024 Lipreading Lip Reading
— Unverified 0ES3: Evolving Self-Supervised Learning of Robust Audio-Visual Speech Representations Jan 1, 2024 Audio-Visual Speech Recognition Lipreading
— Unverified 0Analysis of Visual Features for Continuous Lipreading in Spanish Nov 21, 2023 Lipreading speech-recognition
— Unverified 0Investigating the dynamics of hand and lips in French Cued Speech using attention mechanisms and CTC-based decoding Jun 14, 2023 Lipreading
— Unverified 0Audio-Visual Speech Enhancement with Score-Based Generative Models Jun 2, 2023 Automatic Speech Recognition Lipreading
— Unverified 0Word-level Persian Lipreading Dataset Apr 8, 2023 Lipreading Lip Reading
— Unverified 0Conformers are All You Need for Visual Speech Recognition Feb 17, 2023 All Lipreading
— Unverified 0LipFormer: Learning to Lipread Unseen Speakers based on Visual-Landmark Transformers Feb 4, 2023 Lipreading Sentence
— Unverified 0Relaxed Attention for Transformer Models Sep 20, 2022 Decoder Image Classification
Code Code Available 0Visual Speech Recognition in a Driver Assistance System Aug 29, 2022 Data Augmentation Lipreading
— Unverified 0Bayesian Neural Network Language Modeling for Speech Recognition Aug 28, 2022 Data Augmentation Language Modeling
Code Code Available 0Towards MOOCs for Lipreading: Using Synthetic Talking Heads to Train Humans in Lipreading at Scale Aug 21, 2022 Lipreading Lip Reading
— Unverified 0Lip-Listening: Mixing Senses to Understand Lips using Cross Modality Knowledge Distillation for Word-Based Models Jun 5, 2022 Knowledge Distillation Lipreading
— Unverified 0Is Lip Region-of-Interest Sufficient for Lipreading? May 28, 2022 Lipreading Self-Supervised Learning
— Unverified 0Accurate and Resource-Efficient Lipreading with Efficientnetv2 and Transformers May 23, 2022 image-classification Image Classification
— Unverified 0Multistream neural architectures for cued-speech recognition using a pre-trained visual feature extractor and constrained CTC decoding Apr 11, 2022 Decoder Lipreading
— Unverified 0Self-supervised Transformer for Deepfake Detection Mar 2, 2022 Contrastive Learning DeepFake Detection
— Unverified 0