SOTAVerified

Automatic Speech Recognition

Papers

Showing 13511400 of 3174 papers

TitleStatusHype
Cantonese Automatic Speech Recognition Using Transfer Learning from Mandarin0
Generative AI and Large Language Models in Language Preservation: Opportunities and Challenges0
Generative Context-aware Fine-tuning of Self-supervised Speech Models0
Generative error correction for code-switching speech recognition using large language models0
German-Arabic Speech-to-Speech Translation for Psychiatric Diagnosis0
Gesture-Aware Zero-Shot Speech Recognition for Patients with Language Disorders0
Exploring WavLM Back-ends for Speech Spoofing and Deepfake Detection0
Exploring Transfer Learning For End-to-End Spoken Language Understanding0
A Wav2vec2-Based Experimental Study on Self-Supervised Learning Methods to Improve Child Speech Recognition0
Exploring the Role of Audio in Video Captioning0
Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages0
Gradient-Adjusted Neuron Activation Profiles for Comprehensive Introspection of Convolutional Speech Recognition Models0
Gradient Norm-based Fine-Tuning for Backdoor Defense in Automatic Speech Recognition0
Exploring the Integration of Speech Separation and Recognition with Self-Supervised Learning Representation0
Grammar Based Speaker Role Identification for Air Traffic Control Speech Recognition0
Graph based manifold regularized deep neural networks for automatic speech recognition0
Are disentangled representations all you need to build speaker anonymization systems?0
Adversarial Speaker Adaptation0
Exploring the Integration of Large Language Models into Automatic Speech Recognition Systems: An Empirical Study0
Exploring the Integration of E2E ASR and Pronunciation Modeling for English Mispronunciation Detection0
Guided contrastive self-supervised pre-training for automatic speech recognition0
Calm-Whisper: Reduce Whisper Hallucination On Non-Speech By Calming Crazy Heads Down0
Guiding CTC Posterior Spike Timings for Improved Posterior Fusion and Knowledge Distillation0
Exploring the Impact of Data Quantity on ASR in Extremely Low-resource Languages0
Gujarati-English Code-Switching Speech Recognition using ensemble prediction of spoken language0
Hallucination of speech recognition errors with sequence to sequence learning0
Hallucinations in Neural Automatic Speech Recognition: Identifying Errors and Hallucinatory Models0
Halving transcription time: A fast, user-friendly and GDPR-compliant workflow to create AI-assisted transcripts for content analysis0
Exploring the Effect of Dialect Mismatched Language Models in Telugu Automatic Speech Recognition0
Hard-Synth: Synthesizing Diverse Hard Samples for ASR using Zero-Shot TTS and LLM0
Calibration of Phone Likelihoods in Automatic Speech Recognition0
Harnessing Transfer Learning from Swahili: Advancing Solutions for Comorian Dialects0
HASP: A High-Performance Adaptive Mobile Security Enhancement Against Malicious Speech Recognition0
Head-synchronous Decoding for Transformer-based Streaming ASR0
Hear "No Evil", See "Kenansville": Efficient and Transferable Black-Box Attacks on Speech Recognition and Voice Identification Systems0
Hear No Evil: Towards Adversarial Robustness of Automatic Speech Recognition via Multi-Task Learning0
HebDB: a Weakly Supervised Dataset for Hebrew Speech Processing0
HENT-SRT: Hierarchical Efficient Neural Transducer with Self-Distillation for Joint Speech Recognition and Translation0
HESITA(te) in Portuguese0
Heterogeneous Language Model Optimization in Automatic Speech Recognition0
Heterogeneous Reservoir Computing Models for Persian Speech Recognition0
Hey ASR System! Why Aren't You More Inclusive? Automatic Speech Recognition Systems' Bias and Proposed Bias Mitigation Techniques. A Literature Review0
A Recorded Debating Dataset0
Hierarchical Cross-Modality Knowledge Transfer with Sinkhorn Attention for CTC-based ASR0
Hierarchical Multi Task Learning With CTC0
Hierarchical Recurrent Adapters for Efficient Multi-Task Adaptation of Large Speech Models0
Exploring Textual and Speech information in Dialogue Act Classification with Speaker Domain Adaptation0
Hierarchical Summarization for Longform Spoken Dialog0
Hierarchical Transformer-based Large-Context End-to-end ASR with Large-Context Knowledge Distillation0
Exploring Targeted Universal Adversarial Perturbations to End-to-end ASR Models0
Show:102550
← PrevPage 28 of 64Next →

No leaderboard results yet.