SOTAVerified

Automatic Speech Recognition

Papers

Showing 11011150 of 3174 papers

TitleStatusHype
Breaking the Data Barrier: Towards Robust Speech Translation via Adversarial Stability Training0
``Oh, I've Heard That Before'': Modelling Own-Dialect Bias After Perceptual Learning by Weighting Training Data0
Advancing Multi-talker ASR Performance with Large Language Models0
Brain Signals to Rescue Aphasia, Apraxia and Dysarthria Speech Recognition0
An Ultra-low Power RNN Classifier for Always-On Voice Wake-Up Detection Robust to Real-World Scenarios0
Bootstrap an end-to-end ASR system by multilingual training, transfer learning, text-to-text mapping and synthetic audio0
Boosting the Transferability of Audio Adversarial Examples with Acoustic Representation Optimization0
Anti-spoofing Methods for Automatic SpeakerVerification System0
Advancing Momentum Pseudo-Labeling with Conformer and Initialization Strategy0
A Case Study on Combining ASR and Visual Features for Generating Instructional Video Captions0
Evaluating ASR Confidence Scores for Automated Error Detection in User-Assisted Correction Interfaces0
Evaluating User Perception of Speech Recognition System Quality with Semantic Distance Metric0
Boosting Punctuation Restoration with Data Generation and Reinforcement Learning0
Boosting Norwegian Automatic Speech Recognition0
A Novel Self-training Approach for Low-resource Speech Recognition0
Boosting Noise Robustness of Acoustic Model via Deep Adversarial Training0
A Novel End-to-End CAPT System for L2 Children Learners0
Advancing Hearing Assessment: An ASR-Based Frequency-Specific Speech Test for Diagnosing Presbycusis0
EURO: ESPnet Unsupervised ASR Open-source Toolkit0
An Online Attention-based Model for Speech Recognition0
Boosting Code-Switching ASR with Mixture of Experts Enhanced Speech-Conditioned LLM0
Advancing CTC-CRF Based End-to-End Speech Recognition with Wordpieces and Conformers0
Boosting Chinese ASR Error Correction with Dynamic Error Scaling Mechanism0
BLSTM-Based Confidence Estimation for End-to-End Speech Recognition0
A non-expert Kaldi recipe for Vietnamese Speech Recognition System0
A Benchmark of French ASR Systems Based on Error Severity0
Euronews: a multilingual speech corpus for ASR0
Enhancing CTC-Based Visual Speech Recognition0
Enhancing CTC-based speech recognition with diverse modeling units0
Blockwise Streaming Transformer for Spoken Language Understanding and Simultaneous Speech Translation0
Enhancing Code-switching Speech Recognition with Interactive Language Biases0
Enhancing Code-Switching ASR Leveraging Non-Peaky CTC Loss and Deep Language Posterior Injection0
Blind Signal Dereverberation for Machine Speech Recognition0
A Non-autoregressive Model for Joint STT and TTS0
Enhancing Dialogue Speech Recognition with Robust Contextual Awareness via Noise Representation Learning0
Enhancing Documentation of Hupa with Automatic Speech Recognition0
Enhancing Indonesian Automatic Speech Recognition: Evaluating Multilingual Models with Diverse Speech Variabilities0
Enhancing Large Language Model-based Speech Recognition by Contextualization for Rare and Ambiguous Words0
Enhancing Low-Resource ASR through Versatile TTS: Bridging the Data Gap0
Enhancing Lyrics Transcription on Music Mixtures with Consistency Loss0
Enhancing Child Vocalization Classification with Phonetically-Tuned Embeddings for Assisting Autism Diagnosis0
Enhancing Multilingual ASR for Unseen Languages via Language Embedding Modeling0
Enhancing Aviation Communication Transcription: Fine-Tuning Distil-Whisper with LoRA0
Boosting End-to-End Multilingual Phoneme Recognition through Exploiting Universal Speech Attributes Constraints0
Enhancing Speech Large Language Models with Prompt-Aware Mixture of Audio Encoders0
Enhancing Synthetic Training Data for Speech Commands: From ASR-Based Filtering to Domain Adaptation in SSL Latent Space0
Enhancing Unsupervised Speech Recognition with Diffusion GANs0
Enhancing Whisper's Accuracy and Speed for Indian Languages through Prompt-Tuning and Tokenization0
Enriching ASR Lattices with POS Tags for Dependency Parsing0
Blind and neural network-guided convolutional beamformer for joint denoising, dereverberation, and source separation0
Show:102550
← PrevPage 23 of 64Next →

No leaderboard results yet.