Joint Masked CPC and CTC Training for ASR Oct 30, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Kaleidoscope: An Efficient, Learnable Representation For All Structured Linear Maps Dec 29, 2020 All image-classification
Code Code Available 1CB-Conformer: Contextual biasing Conformer for biased word recognition Apr 19, 2023 Automatic Speech Recognition Language Modeling
Code Code Available 1Knowledge Distillation from BERT Transformer to Speech Transformer for Intent Classification Aug 5, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Kosp2e: Korean Speech to English Translation Corpus Jul 6, 2021 speech-recognition Speech Recognition
Code Code Available 1KoSpeech: Open-Source Toolkit for End-to-End Korean Speech Recognition Sep 7, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1CI-AVSR: A Cantonese Audio-Visual Speech Datasetfor In-car Command Recognition Jun 1, 2022 Audio-Visual Speech Recognition speech-recognition
Code Code Available 1Advancing Test-Time Adaptation in Wild Acoustic Test Settings Oct 14, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Can we use Common Voice to train a Multi-Speaker TTS system? Oct 12, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1CAPE: Encoding Relative Positions with Continuous Augmented Positional Embeddings Jun 6, 2021 Machine Translation speech-recognition
Code Code Available 1CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition May 27, 2019 Decoder Language Modelling
Code Code Available 1Late reverberation suppression using U-nets Oct 5, 2021 Decoder Speech Dereverberation
Code Code Available 1Adaptation of Whisper models to child speech recognition Jul 24, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Layer-wise Analysis of a Self-supervised Speech Representation Model Jul 10, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Framework for Curating Speech Datasets and Evaluating ASR Systems: A Case Study for Polish Jul 18, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Learning Multi-modal Representations by Watching Hundreds of Surgical Video Lectures Jul 27, 2023 Automatic Speech Recognition Contrastive Learning
Code Code Available 1Learning to Detect Noisy Labels Using Model-Based Features Dec 28, 2022 Meta-Learning speech-recognition
Code Code Available 1Learning to Rank Microphones for Distant Speech Recognition Apr 6, 2021 channel selection Decoder
Code Code Available 1Less Peaky and More Accurate CTC Forced Alignment by Label Priors Apr 22, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Calibrating Transformers via Sparse Gaussian Processes Mar 4, 2023 Bayesian Inference Gaussian Processes
Code Code Available 1Low-Latency Speech Separation Guided Diarization for Telephone Conversations Apr 5, 2022 Action Detection Activity Detection
Code Code Available 1Leveraging Unimodal Self-Supervised Learning for Multimodal Audio-Visual Speech Recognition Feb 24, 2022 Audio-Visual Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Adapting End-to-End Speech Recognition for Readable Subtitles May 25, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1A context-aware knowledge transferring strategy for CTC-based ASR Oct 12, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Can Contextual Biasing Remain Effective with Whisper and GPT-2? Jun 2, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT Mar 29, 2022 All Automatic Speech Recognition
Code Code Available 1Brouhaha: multi-task training for voice activity detection, speech-to-noise ratio, and C50 room acoustics estimation Oct 24, 2022 Action Detection Activity Detection
Code Code Available 1Bridging the Granularity Gap for Acoustic Modeling May 27, 2023 speech-recognition Speech Recognition
Code Code Available 1Byakto Speech: Real-time long speech synthesis with convolutional neural network: Transfer learning from English to Bangla May 31, 2021 Deep Learning speech-recognition
Code Code Available 1Listen, Adapt, Better WER: Source-free Single-utterance Test-time Adaptation for Automatic Speech Recognition Mar 27, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Can We Read Speech Beyond the Lips? Rethinking RoI Selection for Deep Visual Speech Recognition Mar 6, 2020 Lipreading Lip Reading
Code Code Available 1Losses Can Be Blessings: Routing Self-Supervised Speech Representations Towards Efficient Multilingual and Multitask Speech Processing Nov 2, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Making More of Little Data: Improving Low-Resource Automatic Speech Recognition Using Data Augmentation May 18, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Computer-Generated Music for Tabletop Role-Playing Games Aug 16, 2020 speech-recognition Speech Recognition
Code Code Available 1Deep Compressive Offloading: Speeding Up Neural Network Inference by Trading Edge Computation for Network Latency Nov 16, 2020 Compressive Sensing Edge-computing
Code Code Available 1MathSpeech: Leveraging Small LMs for Accurate Conversion in Mathematical Speech-to-Formula Dec 20, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1MediaSpeech: Multilanguage ASR Benchmark and Dataset Mar 30, 2021 speech-recognition Speech Recognition
Code Code Available 1MelHuBERT: A simplified HuBERT on Mel spectrograms Nov 17, 2022 Automatic Speech Recognition Self-Supervised Learning
Code Code Available 1BLSP: Bootstrapping Language-Speech Pre-training via Behavior Alignment of Continuation Writing Sep 2, 2023 speech-recognition Speech Recognition
Code Code Available 1Meta-Transfer Learning for Code-Switched Speech Recognition Apr 29, 2020 Language Modeling Language Modelling
Code Code Available 1Minimum Word Error Rate Training for Attention-based Sequence-to-Sequence Models Dec 5, 2017 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1MIR-GAN: Refining Frame-Level Modality-Invariant Representations with Adversarial Network for Audio-Visual Speech Recognition Jun 18, 2023 Audio-Visual Speech Recognition Representation Learning
Code Code Available 1BrainBERT: Self-supervised representation learning for intracranial recordings Feb 28, 2023 Language Modeling Language Modelling
Code Code Available 1Monotonic Chunkwise Attention Dec 14, 2017 Document Summarization speech-recognition
Code Code Available 1BIG-C: a Multimodal Multi-Purpose Dataset for Bemba May 26, 2023 Machine Translation speech-recognition
Code Code Available 1Beyond Performance Plateaus: A Comprehensive Study on Scalability in Speech Enhancement Jun 6, 2024 Diversity Speech Enhancement
Code Code Available 1BembaSpeech: A Speech Recognition Corpus for the Bemba Language Feb 9, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1BASPRO: a balanced script producer for speech corpus collection based on the genetic algorithm Dec 11, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1BENDR: using transformers and a contrastive self-supervised learning task to learn from massive amounts of EEG data Jan 28, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1BackdoorMBTI: A Backdoor Learning Multimodal Benchmark Tool Kit for Backdoor Defense Evaluation Nov 17, 2024 Action Recognition backdoor defense
Code Code Available 1