Where Visual Speech Meets Language: VSP-LLM Framework for Efficient and Context-Aware Visual Speech Processing Feb 23, 2024 Lipreading Lip Reading
Code Code Available 35 Learning Audio-Visual Speech Representation by Masked Multimodal Cluster Prediction Jan 5, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 25 Robust Self-Supervised Audio-Visual Speech Recognition Jan 5, 2022 Audio-Visual Speech Recognition Automatic Speech Recognition
Code Code Available 25 Visual Speech Recognition for Multiple Languages in the Wild Feb 26, 2022 Hyperparameter Optimization Lipreading
Code Code Available 25 Auto-AVSR: Audio-Visual Speech Recognition with Automatic Labels Mar 25, 2023 Audio-Visual Speech Recognition Automatic Speech Recognition
Code Code Available 25 SyncVSR: Data-Efficient Visual Speech Recognition with End-to-End Crossmodal Audio Token Synchronization Jun 18, 2024 Landmark-based Lipreading Lipreading
Code Code Available 25 Training Strategies for Improved Lip-reading Sep 3, 2022 Data Augmentation Lipreading
Code Code Available 25 Discriminative Multi-modality Speech Recognition May 12, 2020 Audio-Visual Speech Recognition Lipreading
Code Code Available 15 End-to-end Audio-visual Speech Recognition with Conformers Feb 12, 2021 Audio-Visual Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Deformation Flow Based Two-Stream Network for Lip Reading Mar 12, 2020 Knowledge Distillation Lipreading
Code Code Available 15 Deep Audio-Visual Speech Recognition Sep 6, 2018 Audio-Visual Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Jointly Learning Visual and Auditory Speech Representations from Raw Data Dec 12, 2022 Audio-Visual Speech Recognition Lipreading
Code Code Available 15 Learn an Effective Lip Reading Model without Pains Nov 15, 2020 Lipreading Lip Reading
Code Code Available 15 Distinguishing Homophenes Using Multi-Head Visual-Audio Memory for Lip Reading Apr 4, 2022 Lipreading Lip Reading
Code Code Available 15 Watch Your Mouth: Silent Speech Recognition with Depth Sensing May 11, 2024 Deep Learning Lipreading
Code Code Available 15 LipNet: End-to-End Sentence-level Lipreading Nov 5, 2016 General Classification Lipreading
Code Code Available 15 Audio-Visual Representation Learning via Knowledge Distillation from Speech Foundation Models Feb 9, 2025 Audio-Visual Speech Recognition Automatic Speech Recognition
Code Code Available 15 Lips Don't Lie: A Generalisable and Robust Approach to Face Forgery Detection Dec 14, 2020 DeepFake Detection Lipreading
Code Code Available 15 Towards Practical Lipreading with Distilled and Efficient Models Jul 13, 2020 Knowledge Distillation Lipreading
Code Code Available 15 Unified Speech Recognition: A Single Model for Auditory, Visual, and Audiovisual Inputs Nov 4, 2024 Lipreading speech-recognition
Code Code Available 15 Can We Read Speech Beyond the Lips? Rethinking RoI Selection for Deep Visual Speech Recognition Mar 6, 2020 Lipreading Lip Reading
Code Code Available 15 Leveraging Unimodal Self-Supervised Learning for Multimodal Audio-Visual Speech Recognition Feb 24, 2022 Audio-Visual Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 LipLearner: Customizable Silent Speech Interactions on Mobile Devices Feb 12, 2023 Contrastive Learning Incremental Learning
Code Code Available 15 Lipreading using Temporal Convolutional Networks Jan 23, 2020 Lipreading Lip Reading
Code Code Available 15 Mutual Information Maximization for Effective Lip Reading Mar 13, 2020 Lipreading Lip Reading
Code Code Available 15 Bayesian Neural Network Language Modeling for Speech Recognition Aug 28, 2022 Data Augmentation Language Modeling
Code Code Available 05 Recurrent Neural Network Transducer for Audio-Visual Speech Recognition Nov 8, 2019 Audio-Visual Speech Recognition Lipreading
Code Code Available 05 Deep word embeddings for visual speech recognition Oct 30, 2017 Lipreading speech-recognition
Code Code Available 05 Combining Residual Networks with LSTMs for Lipreading Mar 12, 2017 Lipreading Lip Reading
Code Code Available 05 End-to-end Audiovisual Speech Recognition Feb 18, 2018 Lipreading speech-recognition
Code Code Available 05 Audio-Visual Speech Recognition based on Regulated Transformer and Spatio-Temporal Fusion Strategy for Driver Assistive Systems May 9, 2024 Audio-Visual Speech Recognition Lipreading
Code Code Available 05 Relaxed Attention for Transformer Models Sep 20, 2022 Decoder Image Classification
Code Code Available 05 Hearing Lips: Improving Lip Reading by Distilling Speech Recognizers Nov 26, 2019 Knowledge Distillation Lipreading
Code Code Available 05 Evaluation of End-to-End Continuous Spanish Lipreading in Different Data Conditions Feb 1, 2025 Lipreading speech-recognition
Code Code Available 05 LRW-1000: A Naturally-Distributed Large-Scale Benchmark for Lip Reading in the Wild Oct 16, 2018 Lipreading Lip Reading
Code Code Available 05 SpotFast Networks with Memory Augmented Lateral Transformers for Lipreading May 21, 2020 Action Recognition Lipreading
Code Code Available 05 Auxiliary Multimodal LSTM for Audio-visual Speech Recognition and Lipreading Jan 16, 2017 Audio-Visual Speech Recognition Automatic Speech Recognition
— Unverified 00 Audio-visual Multi-channel Recognition of Overlapped Speech May 18, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 00 Decoding visemes: improving machine lipreading Oct 3, 2017 Classification General Classification
— Unverified 00 Audio-Visual Speech Recognition With A Hybrid CTC/Attention Architecture Sep 28, 2018 Audio-Visual Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 00 ASR is all you need: cross-modal distillation for lip reading Nov 28, 2019 All Automatic Speech Recognition
— Unverified 00 Accurate and Resource-Efficient Lipreading with Efficientnetv2 and Transformers May 23, 2022 image-classification Image Classification
— Unverified 00 Learning from Videos with Deep Convolutional LSTM Networks Apr 9, 2019 Lipreading Lip Reading
— Unverified 00 Learning Contextually Fused Audio-visual Representations for Audio-visual Speech Recognition Feb 15, 2022 Audio-Visual Speech Recognition Lipreading
— Unverified 00 Decoding visemes: improving machine lipreading Oct 3, 2017 Clustering General Classification
— Unverified 00 Cross-Attention Fusion of Visual and Geometric Features for Large Vocabulary Arabic Lipreading Feb 18, 2024 Lipreading Lip Reading
— Unverified 00 Learning Spatio-Temporal Features with Two-Stream Deep 3D CNNs for Lipreading May 4, 2019 General Classification Lipreading
— Unverified 00 Learning Speaker-Invariant Visual Features for Lipreading Jun 9, 2025 Disentanglement Lipreading
— Unverified 00 Large-vocabulary Audio-visual Speech Recognition in Noisy Environments Sep 10, 2021 Audio-Visual Speech Recognition Lipreading
— Unverified 00 Large-Scale Visual Speech Recognition Jul 13, 2018 Decoder Lipreading
— Unverified 00