SOTAVerified

Automatic Speech Recognition

Papers

Showing 14511475 of 3174 papers

TitleStatusHype
Leveraging Domain Features for Detecting Adversarial Attacks Against Deep Speech Recognition in Noise0
Phonetic-assisted Multi-Target Units Modeling for Improving Conformer-Transducer ASR system0
Streaming Audio-Visual Speech Recognition with Alignment Regularization0
H_eval: A new hybrid evaluation metric for automatic speech recognition tasks0
Probing Statistical Representations For End-To-End ASR0
InterMPL: Momentum Pseudo-Labeling with Intermediate CTC Loss0
Towards Zero-Shot Code-Switched Speech Recognition0
More Speaking or More Speakers?0
Monolingual Recognizers Fusion for Code-switching Speech Recognition0
BECTRA: Transducer-based End-to-End ASR with BERT-Enhanced Encoder0
A Comparative Study on Multichannel Speaker-Attributed Automatic Speech Recognition in Multi-party Meetings0
Adapting self-supervised models to multi-talker speech recognition using speaker embeddings0
Unified End-to-End Speech Recognition and Endpointing for Fast and Efficient Speech Systems0
Mandarin-English Code-Switching Speech Recognition System for Specific Domain0
A Preliminary Study on Automated Speaking Assessment of English as a Second Language (ESL) Students0
An analysis of degenerating speech due to progressive dysarthria on ASR performance0
DiaCorrect: End-to-end error correction for speaker diarizationCode0
Structured State Space Decoder for Speech Recognition and Synthesis0
Audio-Visual Speech Enhancement and Separation by Utilizing Multi-Modal Self-Supervised Embeddings0
Predicting Multi-Codebook Vector Quantization Indexes for Knowledge Distillation0
FusionFormer: Fusing Operations in Transformer for Efficient Streaming Speech Recognition0
Blank Collapse: Compressing CTC emission for the faster decodingCode0
DuDe: Dual-Decoder Multilingual ASR for Indian Languages using Common Label Set0
Phonemic Representation and Transcription for Speech to Text Applications for Under-resourced Indigenous African Languages: The Case of Kiswahili0
Random Utterance Concatenation Based Data Augmentation for Improving Short-video Speech Recognition0
Show:102550
← PrevPage 59 of 127Next →

No leaderboard results yet.