SOTAVerified

Automatic Speech Recognition

Papers

Showing 151200 of 3174 papers

TitleStatusHype
A Systematic Comparison of Phonetic Aware Techniques for Speech EnhancementCode1
Espresso: A Fast End-to-end Neural Speech Recognition ToolkitCode1
Factorized Neural Transducer for Efficient Language Model AdaptationCode1
Attentive Sequence-to-Sequence Learning for Diacritic Restoration of Yorùbá Language TextCode1
A context-aware knowledge transferring strategy for CTC-based ASRCode1
Audio-Visual Efficient Conformer for Robust Speech RecognitionCode1
A Survey on Non-Autoregressive Generation for Neural Machine Translation and BeyondCode1
FAST-RIR: Fast neural diffuse room impulse response generatorCode1
Advancing Test-Time Adaptation in Wild Acoustic Test SettingsCode1
A transfer learning based approach for pronunciation scoringCode1
AV Taris: Online Audio-Visual Speech RecognitionCode1
Automatic Speech Recognition Benchmark for Air-Traffic CommunicationsCode1
Evolutionary Prompt Design for LLM-Based Post-ASR Error CorrectionCode1
Automatic speech recognition for the Nepali language using CNN, bidirectional LSTM and ResNetCode1
Automatic Speech Recognition in Sanskrit: A New Speech Corpus and Modelling InsightsCode1
A Variance-Preserving Interpolation Approach for Diffusion Models with Applications to Single Channel Speech Enhancement and RecognitionCode1
BASPRO: a balanced script producer for speech corpus collection based on the genetic algorithmCode1
Back Translation for Speech-to-text Translation Without TranscriptsCode1
Gradient Remedy for Multi-Task Learning in End-to-End Noise-Robust Speech RecognitionCode1
How2: A Large-scale Dataset for Multimodal Language UnderstandingCode1
BembaSpeech: A Speech Recognition Corpus for the Bemba LanguageCode1
BENDR: using transformers and a contrastive self-supervised learning task to learn from massive amounts of EEG dataCode1
BERTraffic: BERT-based Joint Speaker Role and Speaker Change Detection for Air Traffic Control CommunicationsCode1
HypR: A comprehensive study for ASR hypothesis revising with a reference corpusCode1
Can we use Common Voice to train a Multi-Speaker TTS system?Code1
Improving Mandarin End-to-End Speech Recognition with Word N-gram Language ModelCode1
ASR Error Correction with Constrained Decoding on Operation PredictionCode1
A Hybrid Continuity Loss to Reduce Over-Suppression for Time-domain Target Speaker ExtractionCode1
Can Contextual Biasing Remain Effective with Whisper and GPT-2?Code1
Brazilian Portuguese Speech Recognition Using Wav2vec 2.0Code1
Enhancing Monotonic Multihead Attention for Streaming ASRCode1
AISHELL-NER: Named Entity Recognition from Chinese SpeechCode1
IndicSUPERB: A Speech Processing Universal Performance Benchmark for Indian languagesCode1
Integrating Lattice-Free MMI into End-to-End Speech RecognitionCode1
CB-Conformer: Contextual biasing Conformer for biased word recognitionCode1
Investigating the Reordering Capability in CTC-based Non-Autoregressive End-to-End Speech TranslationCode1
Enhancing Multimodal Sentiment Analysis for Missing Modality through Self-Distillation and Unified Modality Cross-AttentionCode1
A Study of Multilingual End-to-End Speech Recognition for Kazakh, Russian, and EnglishCode1
ALIF: Low-Cost Adversarial Audio Attacks on Black-Box Speech Platforms using Linguistic FeaturesCode1
ClovaCall: Korean Goal-Oriented Dialog Speech Corpus for Automatic Speech Recognition of Contact CentersCode1
ESB: A Benchmark For Multi-Domain End-to-End Speech RecognitionCode1
ExKaldi-RT: A Real-Time Automatic Speech Recognition Extension Toolkit of KaldiCode1
Distilling Knowledge from Ensembles of Acoustic Models for Joint CTC-Attention End-to-End Speech RecognitionCode1
Complex Dynamic Neurons Improved Spiking Transformer Network for Efficient Automatic Speech RecognitionCode1
ContextNet: Improving Convolutional Neural Networks for Automatic Speech Recognition with Global ContextCode1
K-Wav2vec 2.0: Automatic Speech Recognition based on Joint Decoding of Graphemes and SyllablesCode1
FlanEC: Exploring Flan-T5 for Post-ASR Error CorrectionCode1
Consistent Training and Decoding For End-to-end Speech Recognition Using Lattice-free MMICode1
End-to-End Automatic Speech Recognition for GujaratiCode1
ArzEn-LLM: Code-Switched Egyptian Arabic-English Translation and Speech Recognition Using LLMsCode1
Show:102550
← PrevPage 4 of 64Next →

No leaderboard results yet.