SOTAVerified

Automatic Speech Recognition

Papers

Showing 29513000 of 3174 papers

TitleStatusHype
Automatic Speech Recognition with Very Large Conversational Finnish and Estonian Vocabularies0
Automatic Spoken Language Identification using a Time-Delay Neural Network0
Automatic Text Pronunciation Correlation Generation and Application for Contextual Biasing0
Automatic Transcription Challenges for Inuktitut, a Low-Resource Polysynthetic Language0
Automatic Viseme Vocabulary Construction to Enhance Continuous Lip-reading0
Automating speech reception threshold measurements using automatic speech recognition0
AutoPrep: An Automatic Preprocessing Framework for In-the-Wild Speech Data0
Auxiliary Interference Speaker Loss for Target-Speaker Speech Recognition0
Auxiliary Multimodal LSTM for Audio-visual Speech Recognition and Lipreading0
AVATAR: Robust Voice Search Engine Leveraging Autoregressive Document Retrieval and Contrastive Learning0
AV-CPL: Continuous Pseudo-Labeling for Audio-Visual Speech Recognition0
AVFormer: Injecting Vision into Frozen Speech Models for Zero-Shot AV-ASR0
A Virtual Simulation-Pilot Agent for Training of Air Traffic Controllers0
Back-Translation-Style Data Augmentation for End-to-End ASR0
Balancing Speech Understanding and Generation Using Continual Pre-training for Codec-based Speech LLM0
BanglaNum -- A Public Dataset for Bengali Digit Recognition from Speech0
Bangla-Wave: Improving Bangla Automatic Speech Recognition Utilizing N-gram Language Models0
BART based semantic correction for Mandarin automatic speech recognition system0
BA-SOT: Boundary-Aware Serialized Output Training for Multi-Talker ASR0
BAT: Boundary aware transducer for memory-efficient and low-latency ASR0
Bayesian Learning of LF-MMI Trained Time Delay Neural Networks for Speech Recognition0
Bayes Risk Transducer: Transducer with Controllable Alignment Prediction0
BayesSpeech: A Bayesian Transformer Network for Automatic Speech Recognition0
BBS-KWS:The Mandarin Keyword Spotting System Won the Video Keyword Wakeup Challenge0
BCN2BRNO: ASR System Fusion for Albayzin 2020 Speech to Text Challenge0
BEA-Base: A Benchmark for ASR of Spontaneous Hungarian0
BEA-Base: A Benchmark for ASR of Spontaneous Hungarian0
'Beach' to 'Bitch': Inadvertent Unsafe Transcription of Kids' Content on YouTube0
BECTRA: Transducer-based End-to-End ASR with BERT-Enhanced Encoder0
Benchmarking Automatic Speech Recognition coupled LLM Modules for Medical Diagnostics0
Benchmarking Evaluation Metrics for Code-Switching Automatic Speech Recognition0
Benchmarking Foundation Speech and Language Models for Alzheimer's Disease and Related Dementia Detection from Spontaneous Speech0
Benchmarking Japanese Speech Recognition on ASR-LLM Setups with Multi-Pass Augmented Generative Error Correction0
Benchmarking LF-MMI, CTC and RNN-T Criteria for Streaming ASR0
Benchmarking Rotary Position Embeddings for Automatic Speech Recognition0
Bengali Common Voice Speech Dataset for Automatic Speech Recognition0
Best of Both Worlds: Multi-task Audio-Visual Automatic Speech Recognition and Active Speaker Detection0
Best of Both Worlds: Robust Accented Speech Recognition with Adversarial Transfer Learning0
Best Practices for Noise-Based Augmentation to Improve the Performance of Deployable Speech-Based Emotion Recognition Systems0
Better Pseudo-labeling with Multi-ASR Fusion and Error Correction by SpeechLLM0
Better Transcription of UK Supreme Court Hearings0
Beyond Binary: Multiclass Paraphasia Detection with Generative Pretrained Transformers and End-to-End Models0
Beyond Manual Transcripts: The Potential of Automated Speech Recognition Errors in Improving Alzheimer's Disease Detection0
Beyond Universal Transformer: block reusing with adaptor in Transformer for automatic speech recognition0
Bi-APC: Bidirectional Autoregressive Predictive Coding for Unsupervised Pre-training and Its Application to Children's ASR0
Biased Self-supervised learning for ASR0
Bi-directional Context-Enhanced Speech Large Language Models for Multilingual Conversational ASR0
Bidirectional Representations for Low Resource Spoken Language Understanding0
BigSSL: Exploring the Frontier of Large-Scale Semi-Supervised Learning for Automatic Speech Recognition0
Bilevel Joint Unsupervised and Supervised Training for Automatic Speech Recognition0
Show:102550
← PrevPage 60 of 64Next →

No leaderboard results yet.