SOTAVerified

Automatic Speech Recognition

Papers

Showing 11261150 of 3174 papers

TitleStatusHype
Complex-Valued Time-Frequency Self-Attention for Speech Dereverberation0
ASR is all you need: cross-modal distillation for lip reading0
A Hierarchical Reasoning Graph Neural Network for The Automatic Scoring of Answer Transcriptions in Video Job Interviews0
Acoustic Model Optimization over Multiple Data Sources: Merging and Valuation0
Accent-Robust Automatic Speech Recognition Using Supervised and Unsupervised Wav2vec Embeddings0
Improving noisy student training for low-resource languages in End-to-End ASR using CycleGAN and inter-domain losses0
Comparison of Soft and Hard Target RNN-T Distillation for Large-scale ASR0
Comparison of Self-Supervised Speech Pre-Training Methods on Flemish Dutch0
ASR in German: A Detailed Error Analysis0
Comparison of Lattice-Free and Lattice-Based Sequence Discriminative Training Criteria for LVCSR0
Comparison of Grapheme-to-Phoneme Conversion Methods on a Myanmar Pronunciation Dictionary0
ASR-GLUE: A New Multi-task Benchmark for ASR-Robust Natural Language Understanding0
A Hierarchical Context-aware Modeling Approach for Multi-aspect and Multi-granular Pronunciation Assessment0
Comparing the Benefit of Synthetic Training Data for Various Automatic Speech Recognition Architectures0
ASR for Non-standardised Languages with Dialectal Variation: the case of Swiss German0
Comparing Discrete and Continuous Space LLMs for Speech Recognition0
ASR-FAIRBENCH: Measuring and Benchmarking Equity Across Speech Recognition Systems0
A Hardware-Oriented and Memory-Efficient Method for CTC Decoding0
Acoustic Model Optimization Based On Evolutionary Stochastic Gradient Descent with Anchors for Automatic Speech Recognition0
Comparing CTC and LFMMI for out-of-domain adaptation of wav2vec 2.0 acoustic model0
Comparing Apples to Oranges: LLM-powered Multimodal Intention Prediction in an Object Categorization Task0
ASR error management for improving spoken language understanding0
Comparative Analysis of the wav2vec 2.0 Feature Extractor0
Communication-Efficient Personalized Federated Learning for Speech-to-Text Tasks0
ASR Error Detection via Audio-Transcript entailment0
Show:102550
← PrevPage 46 of 127Next →

No leaderboard results yet.