SOTAVerified

Automatic Speech Recognition

Papers

Showing 12761300 of 3174 papers

TitleStatusHype
Can Whisper perform speech-based in-context learning?0
Fast Word Error Rate Estimation Using Self-Supervised Representations for Speech and Text0
Facetron: A Multi-speaker Face-to-Speech Model based on Cross-modal Latent Representations0
Face-Dubbing++: Lip-Synchronous, Voice Preserving Translation of Videos0
On Architectures and Training for Raw Waveform Feature Extraction in ASR0
Can We Trust Explainable AI Methods on ASR? An Evaluation on Phoneme Recognition0
Enabling Differentially Private Federated Learning for Speech Recognition: Benchmarks, Adaptive Optimizers and Gradient Clipping0
Extreme Encoder Output Frame Rate Reduction: Improving Computational Latencies of Large End-to-End Models0
Federated Representation Learning for Automatic Speech Recognition0
Extracting Domain Invariant Features by Unsupervised Learning for Robust Automatic Speech Recognition0
Can We Train a Language Model Inside an End-to-End ASR Model? - Investigating Effective Implicit Language Modeling0
Are Transformers in Pre-trained LM A Good ASR Encoder? An Empirical Study0
Fewer Hallucinations, More Verification: A Three-Stage LLM-Based Framework for ASR Error Correction0
Adversarial Speaker Disentanglement Using Unannotated External Data for Self-supervised Representation Based Voice Conversion0
Filter and evolve: progressive pseudo label refining for semi-supervised automatic speech recognition0
FinAudio: A Benchmark for Audio Large Language Models in Financial Applications0
Findings of the 2024 Mandarin Stuttering Event Detection and Automatic Speech Recognition Challenge0
Findings of the Shared Task on Speech Recognition for Vulnerable Individuals in Tamil0
Finding Task-specific Subnetworks in Multi-task Spoken Language Understanding Model0
Accented Speech Recognition: A Survey0
Extracting Biomedical Entities from Noisy Audio Transcripts0
Fine-tuning convergence model in Bengali speech recognition0
Fine-tuning pre-trained models for Automatic Speech Recognition, experiments on a fieldwork corpus of Japhug (Trans-Himalayan family)0
Can Visual Context Improve Automatic Speech Recognition for an Embodied Agent?0
Extending Recurrent Neural Aligner for Streaming End-to-End Speech Recognition in Mandarin0
Show:102550
← PrevPage 52 of 127Next →

No leaderboard results yet.