HyPoradise: An Open Baseline for Generative Speech Recognition with Large Language Models Sep 27, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Memory-augmented conformer for improved end-to-end long-form ASR Sep 22, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1HypR: A comprehensive study for ASR hypothesis revising with a reference corpus Sep 18, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1EnCodecMAE: Leveraging neural codecs for universal audio representation learning Sep 14, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1OmniDataComposer: A Unified Data Structure for Multimodal Data Fusion and Infinite Data Generation Aug 8, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1ÌròyìnSpeech: A multi-purpose Yorùbá Speech Corpus Jul 29, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Adaptation of Whisper models to child speech recognition Jul 24, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1NoRefER: a Referenceless Quality Metric for Automatic Speech Recognition via Semi-Supervised Language Model Fine-Tuning with Contrastive Learning Jun 21, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1A Reference-less Quality Metric for Automatic Speech Recognition via Contrastive-Learning of a Multi-Language Model with Self-Supervision Jun 21, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1SGEM: Test-Time Adaptation for Automatic Speech Recognition via Sequential-Level Generalized Entropy Minimization Jun 3, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Can Contextual Biasing Remain Effective with Whisper and GPT-2? Jun 2, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1CopyNE: Better Contextual ASR by Copying Named Entities May 22, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Making More of Little Data: Improving Low-Resource Automatic Speech Recognition Using Data Augmentation May 18, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Cross-Modal Global Interaction and Local Alignment for Audio-Visual Speech Recognition May 16, 2023 Audio-Visual Speech Recognition Automatic Speech Recognition
Code Code Available 1Back Translation for Speech-to-text Translation Without Transcripts May 15, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Gradient Remedy for Multi-Task Learning in End-to-End Noise-Robust Speech Recognition Feb 22, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1A Sidecar Separator Can Convert a Single-Talker Speech Recognition System to a Multi-Talker One Feb 20, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Complex Dynamic Neurons Improved Spiking Transformer Network for Efficient Automatic Speech Recognition Feb 2, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Audio-Visual Efficient Conformer for Robust Speech Recognition Jan 4, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Towards Voice Reconstruction from EEG during Imagined Speech Jan 2, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Skit-S2I: An Indian Accented Speech to Intent dataset Dec 26, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1BASPRO: a balanced script producer for speech corpus collection based on the genetic algorithm Dec 11, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1SoftCTC -- Semi-Supervised Learning for Text Recognition using Soft Pseudo-Labels Dec 5, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1A Persian ASR-based SER: Modification of Sharif Emotional Speech Database and Investigation of Persian Text Corpora Nov 18, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1ATCO2 corpus: A Large-Scale Dataset for Research on Automatic Speech Recognition and Natural Language Understanding of Air Traffic Control Communications Nov 8, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Towards Improved Room Impulse Response Estimation for Speech Recognition Nov 8, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Multi-blank Transducers for Speech Recognition Nov 4, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Losses Can Be Blessings: Routing Self-Supervised Speech Representations Towards Efficient Multilingual and Multitask Speech Processing Nov 2, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1data2vec-aqc: Search for the right Teaching Assistant in the Teacher-Student training setup Nov 2, 2022 Automatic Speech Recognition (ASR) Language Modeling
Code Code Available 1Robust Data2vec: Noise-robust Speech Representation Learning for ASR by Combining Regression and Improved Contrastive Learning Oct 27, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Automatic Severity Classification of Dysarthric speech by using Self-supervised Model with Multi-task Learning Oct 27, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1There is more than one kind of robustness: Fooling Whisper with adversarial examples Oct 26, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1ESB: A Benchmark For Multi-Domain End-to-End Speech Recognition Oct 24, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Brouhaha: multi-task training for voice activity detection, speech-to-noise ratio, and C50 room acoustics estimation Oct 24, 2022 Action Detection Activity Detection
Code Code Available 1Towards Relation Extraction From Speech Oct 17, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Can we use Common Voice to train a Multi-Speaker TTS system? Oct 12, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1A context-aware knowledge transferring strategy for CTC-based ASR Oct 12, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1JoeyS2T: Minimalistic Speech-to-Text Modeling with JoeyNMT Oct 5, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1CCC-wav2vec 2.0: Clustering aided Cross Contrastive Self-supervised learning of speech representations Oct 5, 2022 Automatic Speech Recognition (ASR) Clustering
Code Code Available 1TVLT: Textless Vision-Language Transformer Sep 28, 2022 Automatic Speech Recognition (ASR) Image Retrieval
Code Code Available 1Non-autoregressive Error Correction for CTC-based ASR with Phone-conditioned Masked LM Sep 8, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Deep Sparse Conformer for Speech Recognition Sep 1, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1IndicSUPERB: A Speech Processing Universal Performance Benchmark for Indian languages Aug 24, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1ASR Error Correction with Constrained Decoding on Operation Prediction Aug 9, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1DENT-DDSP: Data-efficient noisy speech generator using differentiable digital signal processors for explicit distortion modelling and noise-robust speech recognition Aug 1, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Improving Mandarin Speech Recogntion with Block-augmented Transformer Jul 24, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Transfer Learning of wav2vec 2.0 for Automatic Lyric Transcription Jul 20, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1MM-ALT: A Multimodal Automatic Lyric Transcription System Jul 13, 2022 Action Detection Activity Detection
Code Code Available 1Distilling a Pretrained Language Model to a Multilingual ASR Model Jun 25, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1A Systematic Comparison of Phonetic Aware Techniques for Speech Enhancement Jun 22, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1