SOTAVerified

Automatic Speech Recognition

Papers

Showing 10511100 of 3174 papers

TitleStatusHype
Enhancing Audiovisual Speech Recognition through Bifocal Preference Optimization0
Enhancing Code-Switching ASR Leveraging Non-Peaky CTC Loss and Deep Language Posterior Injection0
Building competitive direct acoustics-to-word models for English conversational speech recognition0
Building a Unified Code-Switching ASR System for South African Languages0
A Probabilistic Framework for Representing Dialog Systems and Entropy-Based Dialog Management through Dynamic Stochastic State Evolution0
Building a Public Domain Voice Database for Odia0
Building a Non-native Speech Corpus Featuring Chinese-English Bilingual Children: Compilation and Rationale0
A privacy-preserving method using secret key for convolutional neural network-based speech classification0
Adversarial Attacks on ASR Systems: An Overview0
Building a Noisy Audio Dataset to Evaluate Machine Learning Approaches for Automatic Speech Recognition Systems0
Building and Evaluation of a Real Room Impulse Response Dataset0
A Preliminary Study on Automated Speaking Assessment of English as a Second Language (ESL) Students0
Building an ASR Error Robust Spoken Virtual Patient System in a Highly Class-Imbalanced Scenario Without Speech Data0
Building a great multi-lingual teacher with sparsely-gated mixture of experts for speech recognition0
A practical two-stage training strategy for multi-stream end-to-end speech recognition0
Adversarial Attacks and Defenses for Speech Recognition Systems0
A Conformer Based Acoustic Model for Robust Automatic Speech Recognition0
Building Accurate Low Latency ASR for Streaming Voice Search0
BUCEADOR, a multi-language search engine for digital libraries0
A practical framework for multi-domain speech recognition and an instance sampling method to neural language modeling0
BTS: Back TranScription for Speech-to-Text Post-Processor using Text-to-Speech-to-Text0
BSTC: A Large-Scale Chinese-English Speech Translation Dataset0
ApproBiVT: Lead ASR Models to Generalize Better Using Approximated Bias-Variance Tradeoff Guided Early Stopping and Checkpoint Averaging0
A Configurable Multilingual Model is All You Need to Recognize All Languages0
BS-PLCNet: Band-split Packet Loss Concealment Network with Multi-task Learning Framework and Multi-discriminators0
Applying LLMs for Rescoring N-best ASR Hypotheses of Casual Conversations: Effects of Domain Adaptation and Context Carry-over0
Bring the Noise: Introducing Noise Robustness to Pretrained Automatic Speech Recognition0
Application-Agnostic Language Modeling for On-Device ASR0
Accelerating Transducers through Adjacent Token Merging0
1SPU: 1-step Speech Processing Unit0
Enhancement and Recognition of Reverberant and Noisy Speech by Extending Its Coherence0
Bridging the Modality Gap: Softly Discretizing Audio Representation for LLM-based Automatic Speech Recognition0
Bridging the gap between streaming and non-streaming ASR systems bydistilling ensembles of CTC and RNN-T models0
App for Resume-Based Job Matching with Speech Interviews and Grammar Analysis: A Review0
Bridging the Gap Between Monaural Speech Enhancement and Recognition with Distortion-Independent Acoustic Modeling0
Bridging the Gap Between Clean Data Training and Real-World Inference for Spoken Language Understanding0
Aphasic Speech Recognition using a Mixture of Speech Intelligibility Experts0
Advancing Speech Recognition With No Speech Or With Noisy Speech0
Bridging Speech and Textual Pre-trained Models with Unsupervised ASR0
Bridging Speech and Text: Enhancing ASR with Pinyin-to-Character Pre-training in LLMs0
BridgeNets: Student-Teacher Transfer Learning Based on Recursive Neural Networks and its Application to Distant Speech Recognition0
Breaking Walls: Pioneering Automatic Speech Recognition for Central Kurdish: End-to-End Transformer Paradigm0
A Parameterized and Annotated Spoken Dialog Corpus of the CMU Let's Go Bus Information System0
A Comprehensive Study of the Current State-of-the-Art in Nepali Automatic Speech Recognition Systems0
A Comprehensive Study of Deep Bidirectional LSTM RNNs for Acoustic Modeling in Speech Recognition0
Handling Trade-Offs in Speech Separation with Sparsely-Gated Mixture of Experts0
Breaking Through the Spike: Spike Window Decoding for Accelerated and Precise Automatic Speech Recognition0
Breaking the Transcription Bottleneck: Fine-tuning ASR Models for Extremely Low-Resource Fieldwork Languages0
Breaking the Data Barrier: Towards Robust Speech Translation via Adversarial Stability Training0
``Oh, I've Heard That Before'': Modelling Own-Dialect Bias After Perceptual Learning by Weighting Training Data0
Show:102550
← PrevPage 22 of 64Next →

No leaderboard results yet.