Speech Driven Video Editing via an Audio-Conditioned Diffusion Model Jan 10, 2023 Denoising Face Model
— Unverified 0Audio-Visual Efficient Conformer for Robust Speech Recognition Jan 4, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Audio-visual video face hallucination with frequency supervision and cross modality support by speech based lip reading loss Nov 20, 2022 Face Hallucination Generative Adversarial Network
— Unverified 0Lip Sync Matters: A Novel Multimodal Forgery Detector Nov 7, 2022 DeepFake Detection Face Swapping
Code Code Available 0Streaming Audio-Visual Speech Recognition with Alignment Regularization Nov 3, 2022 Audio-Visual Speech Recognition Automatic Speech Recognition
— Unverified 0Audio-Visual Speech Enhancement and Separation by Utilizing Multi-Modal Self-Supervised Embeddings Oct 31, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0A Novel Frame Structure for Cloud-Based Audio-Visual Speech Enhancement in Multimodal Hearing-aids Oct 24, 2022 Lip Reading Speech Enhancement
— Unverified 0Clean Text and Full-Body Transformer: Microsoft's Submission to the WMT22 Shared Task on Sign Language Translation Oct 24, 2022 Action Recognition Lip Reading
— Unverified 0VCSE: Time-Domain Visual-Contextual Speaker Extraction Network Oct 9, 2022 Lip Reading
— Unverified 0Relaxed Attention for Transformer Models Sep 20, 2022 Decoder Image Classification
Code Code Available 0Training Strategies for Improved Lip-reading Sep 3, 2022 Data Augmentation Lipreading
Code Code Available 2Visual Speech Recognition in a Driver Assistance System Aug 29, 2022 Data Augmentation Lipreading
— Unverified 0Towards MOOCs for Lipreading: Using Synthetic Talking Heads to Train Humans in Lipreading at Scale Aug 21, 2022 Lipreading Lip Reading
— Unverified 0Speaker-adaptive Lip Reading with User-dependent Padding Aug 9, 2022 Lip Reading speech-recognition
Code Code Available 0Lip-Listening: Mixing Senses to Understand Lips using Cross Modality Knowledge Distillation for Word-Based Models Jun 5, 2022 Knowledge Distillation Lipreading
— Unverified 0Learning Speaker-specific Lip-to-Speech Generation Jun 4, 2022 Decoder Lip Reading
— Unverified 0RUSAVIC Corpus: Russian Audio-Visual Speech in Cars Jun 1, 2022 Audio-Visual Speech Recognition Lip Reading
— Unverified 0Expression-preserving face frontalization improves visually assisted speech processing Apr 6, 2022 Face Model Lip Reading
— Unverified 0Distinguishing Homophenes Using Multi-Head Visual-Audio Memory for Lip Reading Apr 4, 2022 Lipreading Lip Reading
Code Code Available 1Multi-modality Associative Bridging through Memory: Speech Sound Recollected from Face Video Apr 4, 2022 Lip Reading
Code Code Available 1A Multimodal German Dataset for Automatic Lip Reading Systems and Transfer Learning Feb 27, 2022 Lip Reading Transfer Learning
— Unverified 0Leveraging Unimodal Self-Supervised Learning for Multimodal Audio-Visual Speech Recognition Feb 24, 2022 Audio-Visual Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Learning Audio-Visual Speech Representation by Masked Multimodal Cluster Prediction Jan 5, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 2Multi-Grained Spatio-Temporal Features Perceived Network for Event-Based Lip-Reading Jan 1, 2022 Action Recognition Lip Reading
— Unverified 0LipSound2: Self-Supervised Pre-Training for Lip-to-Speech Reconstruction and Lip Reading Dec 9, 2021 Decoder Lip Reading
— Unverified 0Audio-Visual Synchronisation in the wild Dec 8, 2021 Lip Reading
— Unverified 0Contrastive Learning of Global and Local Video Representations Dec 1, 2021 Classification Contrastive Learning
— Unverified 0Leveraging Uni-Modal Self-Supervised Learning for Multimodal Audio-visual Speech Recognition Nov 16, 2021 Audio-Visual Speech Recognition Language Modelling
— Unverified 0Visual Keyword Spotting with Attention Oct 29, 2021 Lip Reading Visual Keyword Spotting
Code Code Available 1Advances and Challenges in Deep Lip Reading Oct 15, 2021 Deep Learning Lip Reading
— Unverified 0Sub-word Level Lip Reading With Visual Attention Oct 14, 2021 Audio-Visual Active Speaker Detection Automatic Speech Recognition
— Unverified 0Perception Point: Identifying Critical Learning Periods in Speech for Bilingual Networks Oct 13, 2021 Lip Reading speech-recognition
— Unverified 0Audio-Visual Speech Recognition is Worth 32328 Voxels Sep 20, 2021 Audio-Visual Speech Recognition Automatic Speech Recognition
— Unverified 0LRWR: Large-Scale Benchmark for Lip Reading in Russian language Sep 14, 2021 Lipreading Lip Reading
— Unverified 0SimulLR: Simultaneous Lip Reading Transducer with Attention-Guided Adaptive Memory Aug 31, 2021 Lip Reading
— Unverified 0Adaptive Semantic-Spatio-Temporal Graph Convolutional Network for Lip Reading Aug 16, 2021 Landmark-based Lipreading Lip Reading
— Unverified 0Spatio-Temporal Attention Mechanism and Knowledge Distillation for Lip Reading Aug 7, 2021 Audio-Visual Speech Recognition Knowledge Distillation
— Unverified 0Facetron: A Multi-speaker Face-to-Speech Model based on Cross-modal Latent Representations Jul 26, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Learning From the Master: Distilling Cross-Modal Advanced Knowledge for Lip Reading Jun 19, 2021 Lip Reading Sentence
— Unverified 0LiRA: Learning Visual Speech Representations from Audio through Self-supervision Jun 16, 2021 Lip Reading Self-Supervised Learning
— Unverified 0Selective Listening by Synchronizing Speech with Lips Jun 14, 2021 Lip Reading Target Speaker Extraction
Code Code Available 1Multi-Perspective LSTM for Joint Visual Representation Learning May 6, 2021 Face Recognition Lip Reading
Code Code Available 0End-to-End Video-To-Speech Synthesis using Generative Adversarial Networks Apr 27, 2021 Lip Reading Speech Synthesis
— Unverified 0Fusing information streams in end-to-end audio-visual speech recognition Apr 19, 2021 Audio-Visual Speech Recognition Lip Reading
— Unverified 0Lip reading using external viseme decoding Apr 10, 2021 Lip Reading
— Unverified 0Contrastive Learning of Global-Local Video Representations Apr 7, 2021 Classification Contrastive Learning
Code Code Available 1End-to-end Audio-visual Speech Recognition with Conformers Feb 12, 2021 Audio-Visual Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Contrastive Self-Supervised Learning of Global-Local Audio-Visual Representations Jan 1, 2021 Classification DeepFake Detection
— Unverified 0Lip-reading with Hierarchical Pyramidal Convolution and Self-Attention Dec 28, 2020 Lip Reading
— Unverified 0AuthNet: A Deep Learning based Authentication Mechanism using Temporal Facial Feature Movements Dec 4, 2020 Benchmarking Lip password classification
Code Code Available 0