| Kaggle Competition: Cantonese Audio-Visual Speech Recognition for In-car Commands | Jul 6, 2022 | Audio-Visual Speech Recognitionspeech-recognition | —Unverified | 0 |
| Lip-Listening: Mixing Senses to Understand Lips using Cross Modality Knowledge Distillation for Word-Based Models | Jun 5, 2022 | Knowledge DistillationLipreading | —Unverified | 0 |
| CI-AVSR: A Cantonese Audio-Visual Speech Datasetfor In-car Command Recognition | Jun 1, 2022 | Audio-Visual Speech Recognitionspeech-recognition | CodeCode Available | 1 |
| RUSAVIC Corpus: Russian Audio-Visual Speech in Cars | Jun 1, 2022 | Audio-Visual Speech RecognitionLip Reading | —Unverified | 0 |
| Is Lip Region-of-Interest Sufficient for Lipreading? | May 28, 2022 | LipreadingSelf-Supervised Learning | —Unverified | 0 |
| Deep Learning for Visual Speech Analysis: A Survey | May 22, 2022 | Deep Learningspeech-recognition | —Unverified | 0 |
| Visual Speech Recognition for Multiple Languages in the Wild | Feb 26, 2022 | Hyperparameter OptimizationLipreading | CodeCode Available | 2 |
| Leveraging Unimodal Self-Supervised Learning for Multimodal Audio-Visual Speech Recognition | Feb 24, 2022 | Audio-Visual Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Learning Contextually Fused Audio-visual Representations for Audio-visual Speech Recognition | Feb 15, 2022 | Audio-Visual Speech RecognitionLipreading | —Unverified | 0 |
| Transformer-Based Video Front-Ends for Audio-Visual Speech Recognition for Single and Multi-Person Video | Jan 25, 2022 | Audio-Visual Speech RecognitionAutomatic Speech Recognition | —Unverified | 0 |
| Recent Progress in the CUHK Dysarthric Speech Recognition System | Jan 15, 2022 | Audio-Visual Speech RecognitionAutomatic Speech Recognition | —Unverified | 0 |
| CI-AVSR: A Cantonese Audio-Visual Speech Dataset for In-car Command Recognition | Jan 11, 2022 | Audio-Visual Speech Recognitionspeech-recognition | CodeCode Available | 1 |
| Robust Self-Supervised Audio-Visual Speech Recognition | Jan 5, 2022 | Audio-Visual Speech RecognitionAutomatic Speech Recognition | CodeCode Available | 2 |
| Leveraging Uni-Modal Self-Supervised Learning for Multimodal Audio-visual Speech Recognition | Nov 16, 2021 | Audio-Visual Speech RecognitionLanguage Modelling | —Unverified | 0 |
| Advances and Challenges in Deep Lip Reading | Oct 15, 2021 | Deep LearningLip Reading | —Unverified | 0 |
| Sub-word Level Lip Reading With Visual Attention | Oct 14, 2021 | Audio-Visual Active Speaker DetectionAutomatic Speech Recognition | —Unverified | 0 |
| Perception Point: Identifying Critical Learning Periods in Speech for Bilingual Networks | Oct 13, 2021 | Lip Readingspeech-recognition | —Unverified | 0 |
| Audio-Visual Speech Recognition is Worth 32328 Voxels | Sep 20, 2021 | Audio-Visual Speech RecognitionAutomatic Speech Recognition | —Unverified | 0 |
| LRWR: Large-Scale Benchmark for Lip Reading in Russian language | Sep 14, 2021 | LipreadingLip Reading | —Unverified | 0 |
| Large-vocabulary Audio-visual Speech Recognition in Noisy Environments | Sep 10, 2021 | Audio-Visual Speech RecognitionLipreading | —Unverified | 0 |
| Spatio-Temporal Attention Mechanism and Knowledge Distillation for Lip Reading | Aug 7, 2021 | Audio-Visual Speech RecognitionKnowledge Distillation | —Unverified | 0 |
| Interactive decoding of words from visual speech recognition models | Jul 1, 2021 | Positionspeech-recognition | —Unverified | 0 |
| Fusing information streams in end-to-end audio-visual speech recognition | Apr 19, 2021 | Audio-Visual Speech RecognitionLip Reading | —Unverified | 0 |
| End-to-end Audio-visual Speech Recognition with Conformers | Feb 12, 2021 | Audio-Visual Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Part-based Lipreading for Audio-Visual Speech Recognition | Dec 14, 2020 | Audio-Visual Speech RecognitionLipreading | —Unverified | 0 |
| AV Taris: Online Audio-Visual Speech Recognition | Dec 14, 2020 | Action DetectionActivity Detection | CodeCode Available | 1 |
| Lips Don't Lie: A Generalisable and Robust Approach to Face Forgery Detection | Dec 14, 2020 | DeepFake DetectionLipreading | CodeCode Available | 1 |
| Learn an Effective Lip Reading Model without Pains | Nov 15, 2020 | LipreadingLip Reading | CodeCode Available | 1 |
| Lip Graph Assisted Audio-Visual Speech Recognition Using Bidirectional Synchronous Fusion | Oct 25, 2020 | Audio-Visual Speech RecognitionLandmark-based Lipreading | —Unverified | 0 |
| "Notic My Speech" -- Blending Speech Patterns With Multimedia | Jun 12, 2020 | speech-recognitionSpeech Recognition | —Unverified | 0 |
| Should we hard-code the recurrence concept or learn it instead ? Exploring the Transformer architecture for Audio-Visual Speech Recognition | May 19, 2020 | Audio-Visual Speech Recognitionspeech-recognition | CodeCode Available | 1 |
| How to Teach DNNs to Pay Attention to the Visual Modality in Speech Recognition | Apr 17, 2020 | Audio-Visual Speech Recognitionspeech-recognition | CodeCode Available | 1 |
| Can We Read Speech Beyond the Lips? Rethinking RoI Selection for Deep Visual Speech Recognition | Mar 6, 2020 | LipreadingLip Reading | CodeCode Available | 1 |
| Audio-visual Recognition of Overlapped speech for the LRS2 dataset | Jan 6, 2020 | Audio-Visual Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Detecting Adversarial Attacks On Audiovisual Speech Recognition | Dec 18, 2019 | Audio-Visual Speech Recognitionspeech-recognition | —Unverified | 0 |
| Continuous Speech Recognition using EEG and Video | Dec 16, 2019 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| ASR is all you need: cross-modal distillation for lip reading | Nov 28, 2019 | AllAutomatic Speech Recognition | —Unverified | 0 |
| Recurrent Neural Network Transducer for Audio-Visual Speech Recognition | Nov 8, 2019 | Audio-Visual Speech RecognitionLipreading | CodeCode Available | 0 |
| Investigating the Lombard Effect Influence on End-to-End Audio-Visual Speech Recognition | Jun 5, 2019 | Audio-Visual Speech Recognitionspeech-recognition | —Unverified | 0 |
| MobiVSR: A Visual Speech Recognition Solution for Mobile Devices | May 10, 2019 | Lip ReadingQuantization | —Unverified | 0 |
| End-to-End Visual Speech Recognition for Small-Scale Datasets | Apr 2, 2019 | General Classificationspeech-recognition | —Unverified | 0 |
| Harnessing GANs for Zero-shot Learning of New Classes in Visual Speech Recognition | Jan 29, 2019 | speech-recognitionSpeech Recognition | CodeCode Available | 0 |
| Modality Attention for End-to-End Audio-visual Speech Recognition | Nov 13, 2018 | Audio-Visual Speech RecognitionRobust Speech Recognition | —Unverified | 0 |
| LRW-1000: A Naturally-Distributed Large-Scale Benchmark for Lip Reading in the Wild | Oct 16, 2018 | LipreadingLip Reading | CodeCode Available | 0 |
| 3D Feature Pyramid Attention Module for Robust Visual Speech Recognition | Oct 15, 2018 | LipreadingSentence | —Unverified | 0 |
| Audio-Visual Speech Recognition With A Hybrid CTC/Attention Architecture | Sep 28, 2018 | Audio-Visual Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Perfect match: Improved cross-modal embeddings for audio-visual synchronisation | Sep 21, 2018 | Binary ClassificationCross-Modal Retrieval | —Unverified | 0 |
| Deep Audio-Visual Speech Recognition | Sep 6, 2018 | Audio-Visual Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| LRS3-TED: a large-scale dataset for visual speech recognition | Sep 3, 2018 | Audio-Visual Speech Recognitionspeech-recognition | CodeCode Available | 0 |
| Zero-shot keyword spotting for visual speech recognition in-the-wild | Jul 23, 2018 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |