SOTAVerified

Automatic Speech Recognition

Papers

Showing 176200 of 3174 papers

TitleStatusHype
AVATAR: Unconstrained Audiovisual Speech RecognitionCode1
LAE: Language-Aware Encoder for Monolingual and Multilingual ASRCode1
Language Models with Image Descriptors are Strong Few-Shot Video-Language LearnersCode1
Vietnamese Automatic Speech Recognition using Wav2vec 2.0Code1
Transformer-Based Multi-Aspect Multi-Granularity Non-Native English Speaker Pronunciation AssessmentCode1
Speaker Recognition in the WildCode1
Wav2Seq: Pre-training Speech-to-Text Encoder-Decoder Models Using Pseudo LanguagesCode1
A Survey on Non-Autoregressive Generation for Neural Machine Translation and BeyondCode1
Large-Scale Streaming End-to-End Speech Translation with Neural TransducersCode1
PriMock57: A Dataset Of Primary Care Mock ConsultationsCode1
How Does Pre-trained Wav2Vec 2.0 Perform on Domain Shifted ASR? An Extensive Benchmark on Air Traffic Control CommunicationsCode1
indic-punct: An automatic punctuation restoration and inverse text normalization framework for Indic languagesCode1
A Hybrid Continuity Loss to Reduce Over-Suppression for Time-domain Target Speaker ExtractionCode1
Streaming Speaker-Attributed ASR with Token-Level Speaker EmbeddingsCode1
Unsupervised Text-to-Speech Synthesis by Unsupervised Automatic Speech RecognitionCode1
Integrating Lattice-Free MMI into End-to-End Speech RecognitionCode1
LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERTCode1
ASR data augmentation in low-resource settings using cross-lingual multi-speaker TTS and cross-lingual voice conversionCode1
Earnings-22: A Practical Benchmark for Accents in the WildCode1
Shifted Chunk Encoder for Transformer Based Streaming End-to-End ASRCode1
Dual-Path Style Learning for End-to-End Noise-Robust Speech RecognitionCode1
Listen, Adapt, Better WER: Source-free Single-utterance Test-time Adaptation for Automatic Speech RecognitionCode1
Automatic Speech Recognition for Speech Assessment of Persian Preschool ChildrenCode1
Neural Predictor for Black-Box Adversarial Attacks on Speech RecognitionCode1
DUAL: Discrete Spoken Unit Adaptive Learning for Textless Spoken Question AnsweringCode1
Show:102550
← PrevPage 8 of 127Next →

No leaderboard results yet.