SOTAVerified

Automatic Speech Recognition

Papers

Showing 11511200 of 3174 papers

TitleStatusHype
A Deep Dive into the Disparity of Word Error Rates Across Thousands of NPTEL MOOC VideosCode0
Model Adaptation for ASR in low-resource Indian Languages0
Ed-Fed: A generic federated learning framework with resource-aware client selection for edge devices0
Representation Learning With Hidden Unit Clustering For Low Resource Speech Applications0
Replay to Remember: Continual Layer-Specific Fine-tuning for German Speech Recognition0
Exploring the Integration of Large Language Models into Automatic Speech Recognition Systems: An Empirical Study0
Speech Diarization and ASR with GMM0
Writer adaptation for offline text recognition: An exploration of neural network-based methodsCode0
Token-Level Serialized Output Training for Joint Streaming ASR and ST Leveraging Textual Alignments0
Online Hybrid CTC/Attention End-to-End Automatic Speech Recognition Architecture0
Align With Purpose: Optimize Desired Properties in CTC Models with a General Plug-and-Play Framework0
Knowledge-Aware Audio-Grounded Generative Slot Filling for Limited Annotated Data0
Boosting Norwegian Automatic Speech Recognition0
Transcribing Educational Videos Using Whisper: A preliminary study on using AI for transcribing educational videos0
Multilingual Contextual Adapters To Improve Custom Word Recognition In Low-resource Languages0
Don't Stop Self-Supervision: Accent Adaptation of Speech Representations via Residual Adapters0
Conformer LLMs -- Convolution Augmented Large Language Models0
Automatic Speech Recognition of Non-Native Child Speech for Language Learning Applications0
Accelerating Transducers through Adjacent Token Merging0
Master-ASR: Achieving Multilingual Scalability and Low-Resource Adaptation in ASR with Modular Learning0
The CHiME-7 DASR Challenge: Distant Meeting Transcription with Multiple Devices in Diverse Scenarios0
Learning When to Trust Which Teacher for Weakly Supervised ASR0
Federated Self-Learning with Weak Supervision for Speech Recognition0
Mixture Encoder for Joint Speech Separation and Recognition0
Strategies in Transfer Learning for Low-Resource Speech Synthesis: Phone Mapping, Features Input, and Source Language Selection0
Exploring the Role of Audio in Video Captioning0
Rehearsal-Free Online Continual Learning for Automatic Speech RecognitionCode0
MobileASR: A resource-aware on-device learning framework for user voice personalization applications on mobile phones0
Lexical Speaker Error Correction: Leveraging Language Models for Speaker Diarization Error Correction0
Unified model for code-switching speech recognition and language identification based on a concatenated tokenizer0
Improving Code-Switching and Named Entity Recognition in ASR with Speech Editing based Data Augmentation0
DCTX-Conformer: Dynamic context carry-over for low latency unified streaming and non-streaming Conformer ASR0
Statistical Beamformer Exploiting Non-stationarity and Sparsity with Spatially Constrained ICA for Robust Speech Recognition0
On the N-gram Approximation of Pre-trained Language Models0
Multimodal Audio-textual Architecture for Robust Spoken Language Understanding0
Multi-View Frequency-Attention Alternative to CNN Frontends for Automatic Speech Recognition0
Impact of Experiencing Misrecognition by Teachable Agents on Learning and Rapport0
Adversarial Training For Low-Resource Disfluency CorrectionCode0
What Can an Accent Identifier Learn? Probing Phonetic and Prosodic Information in a Wav2vec2-based Accent Identification Model0
A Theory of Unsupervised Speech RecognitionCode0
Improving Frame-level Classifier for Word Timings with Non-peaky CTC in End-to-End Automatic Speech Recognition0
Improving Language Model Integration for Neural Machine Translation0
An ASR-Based Tutor for Learning to Read: How to Optimize Feedback to First Graders0
FOOCTTS: Generating Arabic Speech with Acoustic Environment for Football Commentator0
Transfer Learning from Pre-trained Language Models Improves End-to-End Speech Summarization0
A study on the impact of Self-Supervised Learning on automatic dysarthric speech assessment0
Arabic Dysarthric Speech Recognition Using Adversarial and Signal-Based AugmentationCode0
Automatic Assessment of Oral Reading Accuracy for Reading Diagnostics0
Alzheimer Disease Classification through ASR-based Transcriptions: Exploring the Impact of Punctuation and Pauses0
Improving Fairness and Robustness in End-to-End Speech Recognition through unsupervised clustering0
Show:102550
← PrevPage 24 of 64Next →

No leaderboard results yet.