SOTAVerified

Automatic Speech Recognition

Papers

Showing 30513100 of 3174 papers

TitleStatusHype
Star Temporal Classification: Sequence Classification with Partially Labeled DataCode0
Rehearsal-Free Online Continual Learning for Automatic Speech RecognitionCode0
Unsupervised Speech Domain Adaptation Based on Disentangled Representation Learning for Robust Speech RecognitionCode0
On Out-of-Distribution Detection for Audio with Deep Nearest NeighborsCode0
Graph Neural Networks for Contextual ASR with the Tree-Constrained Pointer GeneratorCode0
Data Fusion for Audiovisual Speaker Localization: Extending Dynamic Stream Weights to the Spatial DomainCode0
Generative Adversarial Training Data Adaptation for Very Low-resource Automatic Speech RecognitionCode0
A Simplified Fully Quantized Transformer for End-to-end Speech RecognitionCode0
A Unified Speaker Adaptation Approach for ASRCode0
Written Term Detection Improves Spoken Term DetectionCode0
Transcription free filler word detection with Neural semi-CRFsCode0
Efficient Adaptation of Multilingual Models for Japanese ASRCode0
Adversarial Training For Low-Resource Disfluency CorrectionCode0
Unsupervised Submodular Rank Aggregation on Score-based PermutationsCode0
MaSS: A Large and Clean Multilingual Corpus of Sentence-aligned Spoken Utterances Extracted from the BibleCode0
Data augmentation using prosody and false starts to recognize non-native children's speechCode0
A Small and Fast BERT for Chinese Medical Punctuation RestorationCode0
Massively Multilingual Neural Grapheme-to-Phoneme ConversionCode0
Cross-domain Speech Recognition with Unsupervised Character-level Distribution MatchingCode0
On-the-Fly Aligned Data Augmentation for Sequence-to-Sequence ASRCode0
On the Impact of Speech Recognition Errors in Passage Retrieval for Spoken Question AnsweringCode0
Fleurs-SLU: A Massively Multilingual Benchmark for Spoken Language UnderstandingCode0
Channel-Aware Domain-Adaptive Generative Adversarial Network for Robust Speech RecognitionCode0
Stochastic Attention Head Removal: A simple and effective method for improving Transformer Based ASR ModelsCode0
Measuring the Accuracy of Automatic Speech Recognition SolutionsCode0
Measuring the Effect of Transcription Noise on Downstream Language Understanding TasksCode0
A Method to Reveal Speaker Identity in Distributed ASR Training, and How to Counter ItCode0
SkinAugment: Auto-Encoding Speaker Conversions for Automatic Speech TranslationCode0
Unsupervised Uncertainty Measures of Automatic Speech Recognition for Non-intrusive Speech Intelligibility PredictionCode0
CAT: CRF-based ASR ToolkitCode0
Vietnamese Capitalization and Punctuation Recovery ModelsCode0
FLEURS: Few-shot Learning Evaluation of Universal Representations of SpeechCode0
Finnish Parliament ASR corpus - Analysis, benchmarks and statisticsCode0
AI-Generated Song Detection via Lyrics TranscriptsCode0
SlothSpeech: Denial-of-service Attack Against Speech Recognition ModelsCode0
Cascading and Direct Approaches to Unsupervised Constituency Parsing on Spoken SentencesCode0
Towards Better Domain Adaptation for Self-supervised Models: A Case Study of Child ASRCode0
Analyzing Phonetic and Graphemic Representations in End-to-End Automatic Speech RecognitionCode0
Fine-tuning Strategies for Faster Inference using Speech Self-Supervised Models: A Comparative StudyCode0
Cascaded Cross-Modal Transformer for Audio-Textual ClassificationCode0
Rethinking Evaluation in ASR: Are Our Models Robust Enough?Code0
Advancing Topic Segmentation of Broadcasted Speech with Multilingual Semantic EmbeddingsCode0
The Far Side of Failure: Investigating the Impact of Speech Recognition Errors on Subsequent Dementia ClassificationCode0
Effects of Layer Freezing on Transferring a Speech Recognition System to Under-resourced LanguagesCode0
SoccerChat: Integrating Multimodal Data for Enhanced Soccer Game UnderstandingCode0
Bringing NURC/SP to Digital Life: the Role of Open-source Automatic Speech Recognition ModelsCode0
ASDF: A Differential Testing Framework for Automatic Speech Recognition SystemsCode0
Fine-Grained Grounding for Multimodal Speech RecognitionCode0
Federating Dynamic Models using Early-Exit Architectures for Automatic Speech Recognition on Heterogeneous ClientsCode0
Revealing and Protecting Labels in Distributed TrainingCode0
Show:102550
← PrevPage 62 of 64Next →

No leaderboard results yet.