SOTAVerified

Automatic Speech Recognition

Papers

Showing 426450 of 3174 papers

TitleStatusHype
Generative Adversarial Training Data Adaptation for Very Low-resource Automatic Speech RecognitionCode0
A Simplified Fully Quantized Transformer for End-to-end Speech RecognitionCode0
AI-Generated Song Detection via Lyrics TranscriptsCode0
Guided Source Separation Meets a Strong ASR Backend: Hitachi/Paderborn University Joint Investigation for Dinner Party ASRCode0
A Unified Speaker Adaptation Approach for ASRCode0
FLEURS: Few-shot Learning Evaluation of Universal Representations of SpeechCode0
Assessing the Use of Prosody in Constituency Parsing of Imperfect TranscriptsCode0
Measuring the Accuracy of Automatic Speech Recognition SolutionsCode0
Finnish Parliament ASR corpus - Analysis, benchmarks and statisticsCode0
Fleurs-SLU: A Massively Multilingual Benchmark for Spoken Language UnderstandingCode0
Federating Dynamic Models using Early-Exit Architectures for Automatic Speech Recognition on Heterogeneous ClientsCode0
Fine-Grained Grounding for Multimodal Speech RecognitionCode0
Multi-Stage Speaker Diarization for Noisy ClassroomsCode0
Fine-tuning Strategies for Faster Inference using Speech Self-Supervised Models: A Comparative StudyCode0
Guiding Frame-Level CTC Alignments Using Self-knowledge DistillationCode0
Improving Voice Separation by Incorporating End-to-end Speech RecognitionCode0
Exploring Generative Error Correction for Dysarthric Speech RecognitionCode0
Explainability of Speech Recognition Transformers via Gradient-based Attention VisualizationCode0
Exploiting Attention-based Sequence-to-Sequence Architectures for Sound Event LocalizationCode0
FASA: a Flexible and Automatic Speech Aligner for Extracting High-quality Aligned Children Speech DataCode0
Multichannel AV-wav2vec2: A Framework for Learning Multichannel Multi-Modal Speech RepresentationCode0
Evaluating Variants of wav2vec 2.0 on Affective Vocal Burst TasksCode0
Evaluation of Neural Architectures Trained with Square Loss vs Cross-Entropy in Classification TasksCode0
ESPnet-TTS: Unified, Reproducible, and Integratable Open Source End-to-End Text-to-Speech ToolkitCode0
Enhanced ASR Robustness to Packet Loss with a Front-End Adaptation NetworkCode0
Show:102550
← PrevPage 18 of 127Next →

No leaderboard results yet.