SOTAVerified

Automatic Speech Recognition

Papers

Showing 401425 of 3174 papers

TitleStatusHype
Improving LSTM-CTC based ASR performance in domains with limited training dataCode0
Human Transcription Quality ImprovementCode0
HuBERT-EE: Early Exiting HuBERT for Efficient Speech RecognitionCode0
Hybrid ASR for Resource-Constrained Robots: HMM - Deep Learning FusionCode0
Language Complexity and Speech Recognition Accuracy: Orthographic Complexity Hurts, Phonological Complexity Doesn'tCode0
Language Identification Using Deep Convolutional Recurrent Neural NetworksCode0
How You Say It Matters: Measuring the Impact of Verbal Disfluency Tags on Automated Dementia DetectionCode0
HYBRIDFORMER: improving SqueezeFormer with hybrid attention and NSR mechanismCode0
Guiding Frame-Level CTC Alignments Using Self-knowledge DistillationCode0
Growing Trees on Sounds: Assessing Strategies for End-to-End Dependency Parsing of SpeechCode0
Greek2MathTex: A Greek Speech-to-Text Framework for LaTeX Equations GenerationCode0
Guided Source Separation Meets a Strong ASR Backend: Hitachi/Paderborn University Joint Investigation for Dinner Party ASRCode0
How Phonotactics Affect Multilingual and Zero-shot ASR PerformanceCode0
Hybrid phonetic-neural model for correction in speech recognition systemsCode0
Generative Adversarial Training Data Adaptation for Very Low-resource Automatic Speech RecognitionCode0
Fleurs-SLU: A Massively Multilingual Benchmark for Spoken Language UnderstandingCode0
AI-Generated Song Detection via Lyrics TranscriptsCode0
FLEURS: Few-shot Learning Evaluation of Universal Representations of SpeechCode0
A Simplified Fully Quantized Transformer for End-to-end Speech RecognitionCode0
Let SSMs be ConvNets: State-space Modeling with Optimal Tensor ContractionsCode0
Fine-Grained Grounding for Multimodal Speech RecognitionCode0
Fine-tuning Strategies for Faster Inference using Speech Self-Supervised Models: A Comparative StudyCode0
Assessing the Use of Prosody in Constituency Parsing of Imperfect TranscriptsCode0
Federating Dynamic Models using Early-Exit Architectures for Automatic Speech Recognition on Heterogeneous ClientsCode0
Finnish Parliament ASR corpus - Analysis, benchmarks and statisticsCode0
Show:102550
← PrevPage 17 of 127Next →

No leaderboard results yet.