SOTAVerified

Automatic Speech Recognition

Papers

Showing 12511300 of 3174 papers

TitleStatusHype
Articulatory and bottleneck features for speaker-independent ASR of dysarthric speech0
Adversarial Training for Multilingual Acoustic Modeling0
Fast and Robust Unsupervised Contextual Biasing for Speech Recognition0
Careful Whisper -- leveraging advances in automatic speech recognition for robust and interpretable aphasia subtype classification0
Fast and Accurate OOV Decoder on High-Level Features0
Fast and Accurate Capitalization and Punctuation for Automatic Speech Recognition Using Transformer and Chunk Merging0
Capturing Multi-Resolution Context by Dilated Self-Attention0
A review of on-device fully neural end-to-end automatic speech recognition algorithms0
Falling silent, lost for words ... Tracing personal involvement in interviews with Dutch war veterans0
Capitalization and Punctuation Restoration: a Survey0
Fairness of Automatic Speech Recognition in Cleft Lip and Palate Speech0
Fast Conformer with Linearly Scalable Attention for Efficient Speech Recognition0
FairLENS: Assessing Fairness in Law Enforcement Speech Recognition0
Fast Contextual Adaptation with Neural Associative Memory for On-Device Personalized Speech Recognition0
FastCorrect 2: Fast Error Correction on Multiple Candidates for Automatic Speech Recognition0
FastCorrect: Fast Error Correction with Edit Alignment for Automatic Speech Recognition0
Can You Hear It? Backdoor Attacks via Ultrasonic Triggers0
A Review of Deep Learning Techniques for Speech Processing0
Adversarial synthesis based data-augmentation for code-switched spoken language identification0
FastInject: Injecting Unpaired Text Data into CTC-based ASR training0
FairASR: Fair Audio Contrastive Learning for Automatic Speech Recognition0
Failing Forward: Improving Generative Error Correction for ASR with Synthetic Data and Retrieval Augmentation0
Fast Real-time Personalized Speech Enhancement: End-to-End Enhancement Network (E3Net) and Knowledge Distillation0
Factual Consistency Oriented Speech Recognition0
Fast Streaming Transducer ASR Prototyping via Knowledge Distillation with Whisper0
Can Whisper perform speech-based in-context learning?0
Fast Word Error Rate Estimation Using Self-Supervised Representations for Speech and Text0
Facetron: A Multi-speaker Face-to-Speech Model based on Cross-modal Latent Representations0
Face-Dubbing++: Lip-Synchronous, Voice Preserving Translation of Videos0
On Architectures and Training for Raw Waveform Feature Extraction in ASR0
Can We Trust Explainable AI Methods on ASR? An Evaluation on Phoneme Recognition0
Enabling Differentially Private Federated Learning for Speech Recognition: Benchmarks, Adaptive Optimizers and Gradient Clipping0
Extreme Encoder Output Frame Rate Reduction: Improving Computational Latencies of Large End-to-End Models0
Federated Representation Learning for Automatic Speech Recognition0
Extracting Domain Invariant Features by Unsupervised Learning for Robust Automatic Speech Recognition0
Can We Train a Language Model Inside an End-to-End ASR Model? - Investigating Effective Implicit Language Modeling0
Are Transformers in Pre-trained LM A Good ASR Encoder? An Empirical Study0
Fewer Hallucinations, More Verification: A Three-Stage LLM-Based Framework for ASR Error Correction0
Adversarial Speaker Disentanglement Using Unannotated External Data for Self-supervised Representation Based Voice Conversion0
Filter and evolve: progressive pseudo label refining for semi-supervised automatic speech recognition0
FinAudio: A Benchmark for Audio Large Language Models in Financial Applications0
Findings of the 2024 Mandarin Stuttering Event Detection and Automatic Speech Recognition Challenge0
Findings of the Shared Task on Speech Recognition for Vulnerable Individuals in Tamil0
Finding Task-specific Subnetworks in Multi-task Spoken Language Understanding Model0
Accented Speech Recognition: A Survey0
Extracting Biomedical Entities from Noisy Audio Transcripts0
Fine-tuning convergence model in Bengali speech recognition0
Fine-tuning pre-trained models for Automatic Speech Recognition, experiments on a fieldwork corpus of Japhug (Trans-Himalayan family)0
Can Visual Context Improve Automatic Speech Recognition for an Embodied Agent?0
Extending Recurrent Neural Aligner for Streaming End-to-End Speech Recognition in Mandarin0
Show:102550
← PrevPage 26 of 64Next →

No leaderboard results yet.