SOTAVerified

Automatic Speech Recognition

Papers

Showing 11261150 of 3174 papers

TitleStatusHype
A Benchmark of French ASR Systems Based on Error Severity0
Euronews: a multilingual speech corpus for ASR0
Enhancing CTC-Based Visual Speech Recognition0
Enhancing CTC-based speech recognition with diverse modeling units0
Blockwise Streaming Transformer for Spoken Language Understanding and Simultaneous Speech Translation0
Enhancing Code-switching Speech Recognition with Interactive Language Biases0
Enhancing Code-Switching ASR Leveraging Non-Peaky CTC Loss and Deep Language Posterior Injection0
Blind Signal Dereverberation for Machine Speech Recognition0
A Non-autoregressive Model for Joint STT and TTS0
Enhancing Dialogue Speech Recognition with Robust Contextual Awareness via Noise Representation Learning0
Enhancing Documentation of Hupa with Automatic Speech Recognition0
Enhancing Indonesian Automatic Speech Recognition: Evaluating Multilingual Models with Diverse Speech Variabilities0
Enhancing Large Language Model-based Speech Recognition by Contextualization for Rare and Ambiguous Words0
Enhancing Low-Resource ASR through Versatile TTS: Bridging the Data Gap0
Enhancing Lyrics Transcription on Music Mixtures with Consistency Loss0
Enhancing Child Vocalization Classification with Phonetically-Tuned Embeddings for Assisting Autism Diagnosis0
Enhancing Multilingual ASR for Unseen Languages via Language Embedding Modeling0
Enhancing Aviation Communication Transcription: Fine-Tuning Distil-Whisper with LoRA0
Boosting End-to-End Multilingual Phoneme Recognition through Exploiting Universal Speech Attributes Constraints0
Enhancing Speech Large Language Models with Prompt-Aware Mixture of Audio Encoders0
Enhancing Synthetic Training Data for Speech Commands: From ASR-Based Filtering to Domain Adaptation in SSL Latent Space0
Enhancing Unsupervised Speech Recognition with Diffusion GANs0
Enhancing Whisper's Accuracy and Speed for Indian Languages through Prompt-Tuning and Tokenization0
Enriching ASR Lattices with POS Tags for Dependency Parsing0
Blind and neural network-guided convolutional beamformer for joint denoising, dereverberation, and source separation0
Show:102550
← PrevPage 46 of 127Next →

No leaderboard results yet.