SOTAVerified

Automatic Speech Recognition

Papers

Showing 12511300 of 3174 papers

TitleStatusHype
SE-Bridge: Speech Enhancement with Consistent Brownian Bridge0
Evaluating OpenAI's Whisper ASR for Punctuation Prediction and Topic Modeling of life histories of the Museum of the Person0
Personalized Predictive ASR for Latency Reduction in Voice Assistants0
Text Generation with Speech Synthesis for ASR Data Augmentation0
Debiased Automatic Speech Recognition for Dysarthric Speech via Sample Reweighting with Sample Affinity Test0
GNCformer Enhanced Self-attention for Automatic Speech Recognition0
On the Efficacy and Noise-Robustness of Jointly Learned Speech Emotion and Automatic Speech Recognition0
Semantic VAD: Low-Latency Voice Activity Detection for Speech Interaction0
Hystoc: Obtaining word confidences for fusion of end-to-end ASR systems0
CASA-ASR: Context-Aware Speaker-Attributed ASR0
VAKTA-SETU: A Speech-to-Speech Machine Translation Service in Select Indic Languages0
Self-supervised representations in speech-based depression detection0
Blank-regularized CTC for Frame Skipping in Neural Transducer0
Unsupervised ASR via Cross-Lingual Pseudo-Labeling0
BAT: Boundary aware transducer for memory-efficient and low-latency ASR0
A Comparative Study on E-Branchformer vs Conformer in Speech Recognition, Translation, and Understanding Tasks0
A Lexical-aware Non-autoregressive Transformer-based ASR Model0
ML-SUPERB: Multilingual Speech Universal PERformance Benchmark0
Adversarial Speaker Disentanglement Using Unannotated External Data for Self-supervised Representation Based Voice Conversion0
Application-Agnostic Language Modeling for On-Device ASR0
Critical Appraisal of Artificial Intelligence-Mediated Communication0
OOD-Speech: A Large Bengali Speech Recognition Dataset for Out-of-Distribution Benchmarking0
Self-supervised Neural Factor Analysis for Disentangling Utterance-level Speech Representations0
Continual Learning for End-to-End ASR by Averaging Domain Experts0
Investigating the Sensitivity of Automatic Speech Recognition Systems to Phonetic Variation in L2 Englishes0
Masked Audio Text Encoders are Effective Multi-Modal Rescorers0
Quran Recitation Recognition using End-to-End Deep Learning0
Who Needs Decoders? Efficient Estimation of Sequence-level Attributes0
Robust Acoustic and Semantic Contextual Biasing in Neural Transducers for Speech Recognition0
Exploration of Language Dependency for Japanese Self-Supervised Speech Representation Models0
Multi-Temporal Lip-Audio Memory for Visual Speech Recognition0
Fast Conformer with Linearly Scalable Attention for Efficient Speech Recognition0
Lookahead When It Matters: Adaptive Non-causal Transformers for Streaming Neural Transducers0
Evaluating Variants of wav2vec 2.0 on Affective Vocal Burst TasksCode0
Hybrid Transducer and Attention based Encoder-Decoder Modeling for Speech-to-Text Tasks0
End-to-end spoken language understanding using joint CTC loss and self-supervised, pretrained acoustic encoders0
Employing Hybrid Deep Neural Networks on Dari Speech0
Considerations for Ethical Speech Recognition Datasets0
A Study on the Integration of Pipeline and E2E SLU systems for Spoken Semantic Parsing toward STOP Quality Challenge0
Lessons Learned in ATCO2: 5000 hours of Air Traffic Control Communications for Robust Automatic Speech Recognition and Understanding0
A Review of Deep Learning Techniques for Speech Processing0
Building a Non-native Speech Corpus Featuring Chinese-English Bilingual Children: Compilation and Rationale0
Towards Better Domain Adaptation for Self-supervised Models: A Case Study of Child ASRCode0
Understanding Shared Speech-Text Representations0
Deep Transfer Learning for Automatic Speech Recognition: Towards Better Generalization0
Self-regularised Minimum Latency Training for Streaming Transformer-based Speech Recognition0
Non-autoregressive End-to-end Approaches for Joint Automatic Speech Recognition and Spoken Language Understanding0
OLISIA: a Cascade System for Spoken Dialogue State TrackingCode0
Towards the Universal Defense for Query-Based Audio Adversarial Attacks0
Security and Privacy Problems in Voice Assistant Applications: A Survey0
Show:102550
← PrevPage 26 of 64Next →

No leaderboard results yet.