SOTAVerified

Automatic Speech Recognition

Papers

Showing 876900 of 3174 papers

TitleStatusHype
FunCodec: A Fundamental, Reproducible and Integrable Open-source Toolkit for Neural Speech CodecCode2
Enhancing Child Vocalization Classification with Phonetically-Tuned Embeddings for Assisting Autism Diagnosis0
Open-vocabulary Keyword-spotting with Adaptive Instance Normalization0
Can Whisper perform speech-based in-context learning?0
Improving Robustness of Neural Inverse Text Normalization via Data-Augmentation, Semi-Supervised Learning, and Post-Aligning Method0
Kid-Whisper: Towards Bridging the Performance Gap in Automatic Speech Recognition for Children VS. Adults0
Hybrid ASR for Resource-Constrained Robots: HMM - Deep Learning FusionCode0
Leveraging Large Language Models for Exploiting ASR Uncertainty0
Perceptual and Task-Oriented Assessment of a Semantic Metric for ASR EvaluationCode0
Multiple Representation Transfer from Large Language Models to End-to-End ASR Systems0
LanSER: Language-Model Supported Speech Emotion Recognition0
TODM: Train Once Deploy Many Efficient Supernet-Based RNN-T Compression For On-device ASR Models0
Bring the Noise: Introducing Noise Robustness to Pretrained Automatic Speech Recognition0
AVATAR: Robust Voice Search Engine Leveraging Autoregressive Document Retrieval and Contrastive Learning0
Text-Only Domain Adaptation for End-to-End Speech Recognition through Down-Sampling Acoustic Representation0
Contextual Biasing of Named-Entities with Large Language Models0
Learning Speech Representation From Contrastive Token-Acoustic Pretraining0
Knowledge Distillation from Non-streaming to Streaming ASR Encoder using Auxiliary Non-streaming Layer0
ASTER: Automatic Speech Recognition System Accessibility Testing for Stutterers0
Adapting Text-based Dialogue State Tracker for Spoken Dialogues0
Unsupervised Active Learning: Optimizing Labeling Cost-Effectiveness for Automatic Speech Recognition0
Neural approaches to spoken content embedding0
Decoupled Structure for Improved Adaptability of End-to-End Models0
A Small and Fast BERT for Chinese Medical Punctuation RestorationCode0
SeamlessM4T: Massively Multilingual & Multimodal Machine TranslationCode2
Show:102550
← PrevPage 36 of 127Next →

No leaderboard results yet.