SOTAVerified

Automatic Speech Recognition

Papers

Showing 601650 of 3174 papers

TitleStatusHype
ADIMA: Abuse Detection In Multilingual AudioCode0
Confusion2vec 2.0: Enriching Ambiguous Spoken Language Representations with SubwordsCode0
Conditional independence for pretext task selection in Self-supervised speech representation learningCode0
How You Say It Matters: Measuring the Impact of Verbal Disfluency Tags on Automated Dementia DetectionCode0
Human Transcription Quality ImprovementCode0
Hybrid ASR for Resource-Constrained Robots: HMM - Deep Learning FusionCode0
BeaverTalk: Oregon State University's IWSLT 2025 Simultaneous Speech Translation SystemCode0
Comparison and Analysis of New Curriculum Criteria for End-to-End ASRCode0
Comparing Self-Supervised Learning Models Pre-Trained on Human Speech and Animal Vocalizations for Bioacoustics ProcessingCode0
Confidence Estimation for Black Box Automatic Speech Recognition Systems Using Lattice Recurrent Neural NetworksCode0
Arabic Dysarthric Speech Recognition Using Adversarial and Signal-Based AugmentationCode0
Improving CTC-based speech recognition via knowledge transferring from pre-trained language modelsCode0
Code-Switched Urdu ASR for Noisy Telephonic Environment using Data Centric Approach with Hybrid HMM and CNN-TDNNCode0
Collecting Resources in Sub-Saharan African Languages for Automatic Speech Recognition: a Case Study of WolofCode0
Character-Level Neural Translation for Multilingual Media Monitoring in the SUMMA ProjectCode0
CHSER: A Dataset and Case Study on Generative Speech Error Correction for Child ASRCode0
Channel-Aware Domain-Adaptive Generative Adversarial Network for Robust Speech RecognitionCode0
Detecting Adversarial Examples for Speech Recognition via Uncertainty QuantificationCode0
Enhancing Quantised End-to-End ASR Models via PersonalisationCode0
Realizing Petabyte Scale Acoustic ModelingCode0
BA-SOT: Boundary-Aware Serialized Output Training for Multi-Talker ASR0
BART based semantic correction for Mandarin automatic speech recognition system0
An Efficient Self-Learning Framework For Interactive Spoken Dialog Systems0
Bangla-Wave: Improving Bangla Automatic Speech Recognition Utilizing N-gram Language Models0
BanglaNum -- A Public Dataset for Bengali Digit Recognition from Speech0
An Effective Training Framework for Light-Weight Automatic Speech Recognition Models0
Balancing Speech Understanding and Generation Using Continual Pre-training for Codec-based Speech LLM0
Back-Translation-Style Data Augmentation for End-to-End ASR0
An Effective, Performant Named Entity Recognition System for Noisy Business Telephone Conversation Transcripts0
An Effective Mixture-Of-Experts Approach For Code-Switching Speech Recognition Leveraging Encoder Disentanglement0
A Deep Generative Acoustic Model for Compositional Automatic Speech Recognition0
A Virtual Simulation-Pilot Agent for Training of Air Traffic Controllers0
An Effective End-to-End Modeling Approach for Mispronunciation Detection0
AVFormer: Injecting Vision into Frozen Speech Models for Zero-Shot AV-ASR0
AV-CPL: Continuous Pseudo-Labeling for Audio-Visual Speech Recognition0
An Effective Contextual Language Modeling Framework for Speech Summarization with Augmented Features0
AVATAR: Robust Voice Search Engine Leveraging Autoregressive Document Retrieval and Contrastive Learning0
Auxiliary Multimodal LSTM for Audio-visual Speech Recognition and Lipreading0
An Effective Context-Balanced Adaptation Approach for Long-Tailed Speech Recognition0
An Effective Automated Speaking Assessment Approach to Mitigating Data Scarcity and Imbalanced Distribution0
A bandit approach to curriculum generation for automatic speech recognition0
Auxiliary Interference Speaker Loss for Target-Speaker Speech Recognition0
AutoPrep: An Automatic Preprocessing Framework for In-the-Wild Speech Data0
Automating speech reception threshold measurements using automatic speech recognition0
Automatic Viseme Vocabulary Construction to Enhance Continuous Lip-reading0
An Audio-enriched BERT-based Framework for Spoken Multiple-choice Question Answering0
Adaptive Sparse and Monotonic Attention for Transformer-based Automatic Speech Recognition0
Automatic Transcription Challenges for Inuktitut, a Low-Resource Polysynthetic Language0
Automatic Text Pronunciation Correlation Generation and Application for Contextual Biasing0
Anatomy of Industrial Scale Multilingual ASR0
Show:102550
← PrevPage 13 of 64Next →

No leaderboard results yet.