Where Visual Speech Meets Language: VSP-LLM Framework for Efficient and Context-Aware Visual Speech Processing Feb 23, 2024 Lipreading Lip Reading
Code Code Available 3Visual Speech Recognition for Multiple Languages in the Wild Feb 26, 2022 Hyperparameter Optimization Lipreading
Code Code Available 2Learning Audio-Visual Speech Representation by Masked Multimodal Cluster Prediction Jan 5, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 2Auto-AVSR: Audio-Visual Speech Recognition with Automatic Labels Mar 25, 2023 Audio-Visual Speech Recognition Automatic Speech Recognition
Code Code Available 2SyncVSR: Data-Efficient Visual Speech Recognition with End-to-End Crossmodal Audio Token Synchronization Jun 18, 2024 Landmark-based Lipreading Lipreading
Code Code Available 2Training Strategies for Improved Lip-reading Sep 3, 2022 Data Augmentation Lipreading
Code Code Available 2Robust Self-Supervised Audio-Visual Speech Recognition Jan 5, 2022 Audio-Visual Speech Recognition Automatic Speech Recognition
Code Code Available 2Discriminative Multi-modality Speech Recognition May 12, 2020 Audio-Visual Speech Recognition Lipreading
Code Code Available 1Deep Audio-Visual Speech Recognition Sep 6, 2018 Audio-Visual Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Mutual Information Maximization for Effective Lip Reading Mar 13, 2020 Lipreading Lip Reading
Code Code Available 1Unified Speech Recognition: A Single Model for Auditory, Visual, and Audiovisual Inputs Nov 4, 2024 Lipreading speech-recognition
Code Code Available 1Deformation Flow Based Two-Stream Network for Lip Reading Mar 12, 2020 Knowledge Distillation Lipreading
Code Code Available 1End-to-end Audio-visual Speech Recognition with Conformers Feb 12, 2021 Audio-Visual Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Distinguishing Homophenes Using Multi-Head Visual-Audio Memory for Lip Reading Apr 4, 2022 Lipreading Lip Reading
Code Code Available 1LipNet: End-to-End Sentence-level Lipreading Nov 5, 2016 General Classification Lipreading
Code Code Available 1Jointly Learning Visual and Auditory Speech Representations from Raw Data Dec 12, 2022 Audio-Visual Speech Recognition Lipreading
Code Code Available 1Watch Your Mouth: Silent Speech Recognition with Depth Sensing May 11, 2024 Deep Learning Lipreading
Code Code Available 1Towards Practical Lipreading with Distilled and Efficient Models Jul 13, 2020 Knowledge Distillation Lipreading
Code Code Available 1Audio-Visual Representation Learning via Knowledge Distillation from Speech Foundation Models Feb 9, 2025 Audio-Visual Speech Recognition Automatic Speech Recognition
Code Code Available 1Learn an Effective Lip Reading Model without Pains Nov 15, 2020 Lipreading Lip Reading
Code Code Available 1Can We Read Speech Beyond the Lips? Rethinking RoI Selection for Deep Visual Speech Recognition Mar 6, 2020 Lipreading Lip Reading
Code Code Available 1Lipreading using Temporal Convolutional Networks Jan 23, 2020 Lipreading Lip Reading
Code Code Available 1LipLearner: Customizable Silent Speech Interactions on Mobile Devices Feb 12, 2023 Contrastive Learning Incremental Learning
Code Code Available 1Lips Don't Lie: A Generalisable and Robust Approach to Face Forgery Detection Dec 14, 2020 DeepFake Detection Lipreading
Code Code Available 1Leveraging Unimodal Self-Supervised Learning for Multimodal Audio-Visual Speech Recognition Feb 24, 2022 Audio-Visual Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Accurate and Resource-Efficient Lipreading with Efficientnetv2 and Transformers May 23, 2022 image-classification Image Classification
— Unverified 0Auxiliary Multimodal LSTM for Audio-visual Speech Recognition and Lipreading Jan 16, 2017 Audio-Visual Speech Recognition Automatic Speech Recognition
— Unverified 0Lipper: Synthesizing Thy Speech using Multi-View Lipreading Jun 28, 2019 Lipreading
— Unverified 0ASR is all you need: cross-modal distillation for lip reading Nov 28, 2019 All Automatic Speech Recognition
— Unverified 0Audio-visual Multi-channel Recognition of Overlapped Speech May 18, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Learning Spatio-Temporal Features with Two-Stream Deep 3D CNNs for Lipreading May 4, 2019 General Classification Lipreading
— Unverified 0Decoding visemes: improving machine lipreading Oct 3, 2017 Classification General Classification
— Unverified 0Audio-Visual Speech Recognition With A Hybrid CTC/Attention Architecture Sep 28, 2018 Audio-Visual Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Lip Reading Sentences in the Wild Nov 16, 2016 Lipreading Lip Reading
— Unverified 0Decoding visemes: improving machine lipreading Oct 3, 2017 Clustering General Classification
— Unverified 0Cross-Attention Fusion of Visual and Geometric Features for Large Vocabulary Arabic Lipreading Feb 18, 2024 Lipreading Lip Reading
— Unverified 0Analysis of Visual Features for Continuous Lipreading in Spanish Nov 21, 2023 Lipreading speech-recognition
— Unverified 0Large-Scale Visual Speech Recognition Jul 13, 2018 Decoder Lipreading
— Unverified 0Conformers are All You Need for Visual Speech Recognition Feb 17, 2023 All Lipreading
— Unverified 0Is Lip Region-of-Interest Sufficient for Lipreading? May 28, 2022 Lipreading Self-Supervised Learning
— Unverified 0Comparing phonemes and visemes with DNN-based lipreading May 8, 2018 Decoder Lipreading
— Unverified 0Large-vocabulary Audio-visual Speech Recognition in Noisy Environments Sep 10, 2021 Audio-Visual Speech Recognition Lipreading
— Unverified 0Audio-Visual Speech Enhancement with Score-Based Generative Models Jun 2, 2023 Automatic Speech Recognition Lipreading
— Unverified 0A Cascade Sequence-to-Sequence Model for Chinese Mandarin Lip Reading Aug 14, 2019 Lipreading Lip Reading
— Unverified 0Learning Contextually Fused Audio-visual Representations for Audio-visual Speech Recognition Feb 15, 2022 Audio-Visual Speech Recognition Lipreading
— Unverified 0Learning from Videos with Deep Convolutional LSTM Networks Apr 9, 2019 Lipreading Lip Reading
— Unverified 0Investigating the dynamics of hand and lips in French Cued Speech using attention mechanisms and CTC-based decoding Jun 14, 2023 Lipreading
— Unverified 0Learning Speaker-Invariant Visual Features for Lipreading Jun 9, 2025 Disentanglement Lipreading
— Unverified 0Improving Speaker-Independent Lipreading with Domain-Adversarial Training Aug 4, 2017 Lipreading speech-recognition
— Unverified 0Comparing heterogeneous visual gestures for measuring the diversity of visual speech signals May 8, 2018 Clustering Diversity
— Unverified 0