SOTAVerified

Automatic Speech Recognition

Papers

Showing 451500 of 3174 papers

TitleStatusHype
FastEmit: Low-latency Streaming ASR with Sequence-level Emission RegularizationCode0
On-the-Fly Aligned Data Augmentation for Sequence-to-Sequence ASRCode0
Fine-Grained Grounding for Multimodal Speech RecognitionCode0
FASA: a Flexible and Automatic Speech Aligner for Extracting High-quality Aligned Children Speech DataCode0
Exploiting Attention-based Sequence-to-Sequence Architectures for Sound Event LocalizationCode0
Perceptual and Task-Oriented Assessment of a Semantic Metric for ASR EvaluationCode0
Analyzing Hidden Representations in End-to-End Automatic Speech Recognition SystemsCode0
Exploring Generative Error Correction for Dysarthric Speech RecognitionCode0
Fine-tuning Strategies for Faster Inference using Speech Self-Supervised Models: A Comparative StudyCode0
Evaluation of Neural Architectures Trained with Square Loss vs Cross-Entropy in Classification TasksCode0
Analyzing Phonetic and Graphemic Representations in End-to-End Automatic Speech RecognitionCode0
Evaluating Variants of wav2vec 2.0 on Affective Vocal Burst TasksCode0
ESPnet-TTS: Unified, Reproducible, and Integratable Open Source End-to-End Text-to-Speech ToolkitCode0
ASR Benchmarking: Need for a More Representative Conversational DatasetCode0
Enhancing Quantised End-to-End ASR Models via PersonalisationCode0
Error-preserving Automatic Speech Recognition of Young English Learners' LanguageCode0
Enhanced ASR Robustness to Packet Loss with a Front-End Adaptation NetworkCode0
End-to-End Speech Recognition With Joint Dereverberation Of Sub-Band Autoregressive EnvelopesCode0
Explainability of Speech Recognition Transformers via Gradient-based Attention VisualizationCode0
Finnish Parliament ASR corpus - Analysis, benchmarks and statisticsCode0
End-to-End Learning of Speech 2D Feature-Trajectory for Prosthetic HandsCode0
End-to-End Open Vocabulary Keyword Search With Multilingual Neural RepresentationsCode0
End to End ASR System with Automatic Punctuation InsertionCode0
A Small and Fast BERT for Chinese Medical Punctuation RestorationCode0
AfriHuBERT: A self-supervised speech representation model for African languagesCode0
ELITR-Bench: A Meeting Assistant Benchmark for Long-Context Language ModelsCode0
Reducing Language confusion for Code-switching Speech Recognition with Token-level Language DiarizationCode0
Efficient Adaptation of Multilingual Models for Japanese ASRCode0
ASDF: A Differential Testing Framework for Automatic Speech Recognition SystemsCode0
AequeVox: Automated Fairness Testing of Speech Recognition SystemsCode0
Effects of Layer Freezing on Transferring a Speech Recognition System to Under-resourced LanguagesCode0
Efficient Ensemble for Multimodal Punctuation Restoration using Time-Delay Neural NetworkCode0
ed-cec: improving rare word recognition using asr postprocessing based on error detection and context-aware error correctionCode0
EESEN: End-to-End Speech Recognition using Deep RNN Models and WFST-based DecodingCode0
Effectiveness of Text, Acoustic, and Lattice-based representations in Spoken Language Understanding tasksCode0
Intrinsic evaluation of language models for code-switchingCode0
Segmentation-Free Streaming Machine TranslationCode0
Selective Attention Merging for low resource tasks: A case study of Child ASRCode0
Adversarial Training For Low-Resource Disfluency CorrectionCode0
Self-supervised Speech Representations Still Struggle with African American Vernacular EnglishCode0
Semantically Meaningful Metrics for Norwegian ASR SystemsCode0
Semantic Mask for Transformer based End-to-End Speech RecognitionCode0
An Automatic Speech Recognition System for Bengali Language based on Wav2Vec2 and Transfer LearningCode0
Domain Specific Wav2vec 2.0 Fine-tuning For The SE&R 2022 ChallengeCode0
DoCIA: An Online Document-Level Context Incorporation Agent for Speech TranslationCode0
Does Joint Training Really Help Cascaded Speech Translation?Code0
A Comparative Study on Transformer vs RNN in Speech ApplicationsCode0
Are Neural Open-Domain Dialog Systems Robust to Speech Recognition Errors in the Dialog History? An Empirical StudyCode0
Discrete Speech Unit Extraction via Independent Component AnalysisCode0
BERT Attends the Conversation: Improving Low-Resource Conversational ASRCode0
Show:102550
← PrevPage 10 of 64Next →

No leaderboard results yet.