SOTAVerified

Automatic Speech Recognition

Papers

Showing 10011050 of 3174 papers

TitleStatusHype
Dialect Adaptation and Data Augmentation for Low-Resource ASR: TalTech Systems for the MADASR 2023 Challenge0
DISCO: A Large Scale Human Annotated Corpus for Disfluency Correction in Indo-European LanguagesCode0
Leveraging Timestamp Information for Serialized Joint Streaming Recognition and Translation0
Quantifying the Dialect Gap and its Correlates Across Languages0
Modality Dropout for Multimodal Device Directed Speech Detection using Verbal and Non-Verbal Features0
Key Frame Mechanism For Efficient Conformer Based End-to-end Speech RecognitionCode0
Conversational Speech Recognition by Learning Audio-textual Cross-modal Contextual Representation0
Intelligibility prediction with a pretrained noise-robust automatic speech recognition model0
The CHiME-7 Challenge: System Description and Performance of NeMo Team's DASR System0
Unintended Memorization in Large ASR Models, and How to Mitigate It0
Zipformer: A faster and better encoder for automatic speech recognition0
Generative error correction for code-switching speech recognition using large language models0
VoxArabica: A Robust Dialect-Aware Arabic Speech Recognition System0
Correction Focused Language Model Training for Speech Recognition0
Iterative Shallow Fusion of Backward Language Model for End-to-End Speech Recognition0
Advanced accent/dialect identification and accentedness assessment with multi-embedding models and automatic speech recognition0
Detecting Speech Abnormalities with a Perceiver-based Sequence Classifier that Leverages a Universal Speech Model0
End-to-end Multichannel Speaker-Attributed ASR: Speaker Guided Decoder and Input Feature Analysis0
Personalization of CTC-based End-to-End Speech Recognition Using Pronunciation-Driven Subword Tokenization0
Large Vocabulary Spontaneous Speech Recognition for Tigrigna0
Improved Contextual Recognition In Automatic Speech Recognition Systems By Semantic Lattice Rescoring0
SALM: Speech-augmented Language Model with In-context Learning for Speech Recognition and Translation0
On the Relevance of Phoneme Duration Variability of Synthesized Training Data for Automatic Speech Recognition0
Fast Word Error Rate Estimation Using Self-Supervised Representations for Speech and Text0
Adapting the adapters for code-switching in multilingual ASRCode0
No Pitch Left Behind: Addressing Gender Unbalance in Automatic Speech Recognition through Pitch Manipulation0
Discriminative Speech Recognition Rescoring with Pre-trained Language Models0
Acoustic Model Fusion for End-to-end Speech Recognition0
Improving End-to-End Speech Processing by Efficient Text Data Utilization with Latent Synthesis0
ed-cec: improving rare word recognition using asr postprocessing based on error detection and context-aware error correctionCode0
Spike-Triggered Contextual Biasing for End-to-End Mandarin Speech Recognition0
HuBERTopic: Enhancing Semantic Representation of HuBERT through Self-supervision Utilizing Topic Model0
A privacy-preserving method using secret key for convolutional neural network-based speech classification0
Neural Language Model Pruning for Automatic Speech Recognition0
An Integrated Algorithm for Robust and Imperceptible Audio Adversarial Examples0
UniverSLU: Universal Spoken Language Understanding for Diverse Tasks with Natural Language Instructions0
One model to rule them all ? Towards End-to-End Joint Speaker Diarization and Speech Recognition0
AfriSpeech-200: Pan-African Accented Speech Dataset for Clinical and General Domain ASR0
AV-CPL: Continuous Pseudo-Labeling for Audio-Visual Speech Recognition0
The Gift of Feedback: Improving ASR Model Quality by Learning from User Corrections through Federated Learning0
Enabling Differentially Private Federated Learning for Speech Recognition: Benchmarks, Adaptive Optimizers and Gradient Clipping0
Wiki-En-ASR-Adapt: Large-scale synthetic dataset for English ASR Customization0
Contextual Biasing with the Knuth-Morris-Pratt Matching Algorithm0
Enhancing Code-switching Speech Recognition with Interactive Language Biases0
SSHR: Leveraging Self-supervised Hierarchical Representations for Multilingual Automatic Speech Recognition0
PP-MeT: a Real-world Personalized Prompt based Meeting Transcription System0
Hierarchical Cross-Modality Knowledge Transfer with Sinkhorn Attention for CTC-based ASR0
Does Single-channel Speech Enhancement Improve Keyword Spotting Accuracy? A Case Study0
Exploring Speech Recognition, Translation, and Understanding with Discrete Speech Units: A Comparative Study0
Segment-Level Vectorized Beam Search Based on Partially Autoregressive Inference0
Show:102550
← PrevPage 21 of 64Next →

No leaderboard results yet.