SOTAVerified

Automatic Speech Recognition

Papers

Showing 12511300 of 3174 papers

TitleStatusHype
Enhancing ASR for Stuttered Speech with Limited Data Using Detect and Pass0
Blending LSTMs into CNNs0
Failing Forward: Improving Generative Error Correction for ASR with Synthetic Data and Retrieval Augmentation0
FairASR: Fair Audio Contrastive Learning for Automatic Speech Recognition0
FairLENS: Assessing Fairness in Law Enforcement Speech Recognition0
Fairness of Automatic Speech Recognition in Cleft Lip and Palate Speech0
Falling silent, lost for words ... Tracing personal involvement in interviews with Dutch war veterans0
Capitalization and Punctuation Restoration: a Survey0
Fast and Accurate Capitalization and Punctuation for Automatic Speech Recognition Using Transformer and Chunk Merging0
Fast and Accurate OOV Decoder on High-Level Features0
Fast and Robust Unsupervised Contextual Biasing for Speech Recognition0
Fast Conformer with Linearly Scalable Attention for Efficient Speech Recognition0
A Noise-Robust Self-supervised Pre-training Model Based Speech Representation Learning for Automatic Speech Recognition0
Fast Contextual Adaptation with Neural Associative Memory for On-Device Personalized Speech Recognition0
FastCorrect 2: Fast Error Correction on Multiple Candidates for Automatic Speech Recognition0
FastCorrect: Fast Error Correction with Edit Alignment for Automatic Speech Recognition0
Advancing Arabic Speech Recognition Through Large-Scale Weakly Supervised Learning0
Cascaded CNN-resBiLSTM-CTC: An End-to-End Acoustic Model For Speech Recognition0
Fast Entropy-Based Methods of Word-Level Confidence Estimation for End-To-End Automatic Speech Recognition0
FastInject: Injecting Unpaired Text Data into CTC-based ASR training0
Enhancing AAC Software for Dysarthric Speakers in e-Health Settings: An Evaluation Using TORGO0
Fast offline Transformer-based end-to-end automatic speech recognition for real-world applications0
Fast Real-time Personalized Speech Enhancement: End-to-End Enhancement Network (E3Net) and Knowledge Distillation0
Enhancements in statistical spoken language translation by de-normalization of ASR results0
Fast Streaming Transducer ASR Prototyping via Knowledge Distillation with Whisper0
Blending LLMs into Cascaded Speech Translation: KIT's Offline Speech Translation System for IWSLT 20240
Fast Word Error Rate Estimation Using Self-Supervised Representations for Speech and Text0
Enhancement and Recognition of Reverberant and Noisy Speech by Extending Its Coherence0
Enhanced Robot Speech Recognition Using Biomimetic Binaural Sound Source Localization0
Annotated Speech Corpus for Low Resource Indian Languages: Awadhi, Bhojpuri, Braj and Magahi0
Feature selection using Fisher's ratio technique for automatic speech recognition0
Enabling Differentially Private Federated Learning for Speech Recognition: Benchmarks, Adaptive Optimizers and Gradient Clipping0
Enhanced Direct Speech-to-Speech Translation Using Self-supervised Pre-training and Data Augmentation0
Federated Representation Learning for Automatic Speech Recognition0
Federated Self-Learning with Weak Supervision for Speech Recognition0
Causal Analysis of ASR Errors for Children: Quantifying the Impact of Physiological, Cognitive, and Extrinsic Factors0
Enhanced CORILGA: Introducing the Automatic Phonetic Alignment Tool for Continuous Speech0
Fewer Hallucinations, More Verification: A Three-Stage LLM-Based Framework for ASR Error Correction0
Blank-regularized CTC for Frame Skipping in Neural Transducer0
Filter and evolve: progressive pseudo label refining for semi-supervised automatic speech recognition0
FinAudio: A Benchmark for Audio Large Language Models in Financial Applications0
Findings of the 2024 Mandarin Stuttering Event Detection and Automatic Speech Recognition Challenge0
Findings of the Shared Task on Speech Recognition for Vulnerable Individuals in Tamil0
Finding Task-specific Subnetworks in Multi-task Spoken Language Understanding Model0
Chain of Correction for Full-text Speech Recognition with Large Language Models0
Fine-Tuning Automatic Speech Recognition for People with Parkinson's: An Effective Strategy for Enhancing Speech Technology Accessibility0
Fine-tuning convergence model in Bengali speech recognition0
Fine-tuning pre-trained models for Automatic Speech Recognition, experiments on a fieldwork corpus of Japhug (Trans-Himalayan family)0
Challenges and Opportunities in Multi-device Speech Processing0
An Investigative Study of Multi-Modal Cross-Lingual Retrieval0
Show:102550
← PrevPage 26 of 64Next →

No leaderboard results yet.