SOTAVerified

Automatic Speech Recognition

Papers

Showing 101150 of 3174 papers

TitleStatusHype
BERTraffic: BERT-based Joint Speaker Role and Speaker Change Detection for Air Traffic Control CommunicationsCode1
Evolutionary Prompt Design for LLM-Based Post-ASR Error CorrectionCode1
Automatic Speech Recognition for Speech Assessment of Persian Preschool ChildrenCode1
Automatic speech recognition for the Nepali language using CNN, bidirectional LSTM and ResNetCode1
A Variance-Preserving Interpolation Approach for Diffusion Models with Applications to Single Channel Speech Enhancement and RecognitionCode1
Adaptation of Whisper models to child speech recognitionCode1
EnCodecMAE: Leveraging neural codecs for universal audio representation learningCode1
Automatic Severity Classification of Dysarthric speech by using Self-supervised Model with Multi-task LearningCode1
Adapting End-to-End Speech Recognition for Readable SubtitlesCode1
Brouhaha: multi-task training for voice activity detection, speech-to-noise ratio, and C50 room acoustics estimationCode1
Automatic Disfluency Detection from Untranscribed SpeechCode1
Efficient conformer: Progressive downsampling and grouped attention for automatic speech recognitionCode1
Automatic Speech Recognition Benchmark for Air-Traffic CommunicationsCode1
CB-Conformer: Contextual biasing Conformer for biased word recognitionCode1
AVATAR: Unconstrained Audiovisual Speech RecognitionCode1
Efficient Neural Architecture Search for End-to-end Speech Recognition via Straight-Through GradientsCode1
End-to-End Automatic Speech Recognition for GujaratiCode1
ClovaCall: Korean Goal-Oriented Dialog Speech Corpus for Automatic Speech Recognition of Contact CentersCode1
Framework for Curating Speech Datasets and Evaluating ASR Systems: A Case Study for PolishCode1
Combining Frame-Synchronous and Label-Synchronous Systems for Speech RecognitionCode1
Common Voice: A Massively-Multilingual Speech CorpusCode1
HypR: A comprehensive study for ASR hypothesis revising with a reference corpusCode1
Complex Dynamic Neurons Improved Spiking Transformer Network for Efficient Automatic Speech RecognitionCode1
Improved Noisy Student Training for Automatic Speech RecognitionCode1
Improving Mandarin End-to-End Speech Recognition with Word N-gram Language ModelCode1
Improving Mandarin Speech Recogntion with Block-augmented TransformerCode1
Audio-Visual Efficient Conformer for Robust Speech RecognitionCode1
Audio-Visual Representation Learning via Knowledge Distillation from Speech Foundation ModelsCode1
DuplexMamba: Enhancing Real-time Speech Conversations with Duplex and Streaming CapabilitiesCode1
Attentive Sequence-to-Sequence Learning for Diacritic Restoration of Yorùbá Language TextCode1
Dual-Path Style Learning for End-to-End Noise-Robust Speech RecognitionCode1
DUAL: Discrete Spoken Unit Adaptive Learning for Textless Spoken Question AnsweringCode1
Earnings-22: A Practical Benchmark for Accents in the WildCode1
Attention-based Contextual Language Model Adaptation for Speech RecognitionCode1
Accented Speech Recognition With Accent-specific CodebooksCode1
Dompteur: Taming Audio Adversarial ExamplesCode1
Attention-based Audio-Visual Fusion for Robust Automatic Speech RecognitionCode1
AVLnet: Learning Audio-Visual Language Representations from Instructional VideosCode1
Dual-decoder Transformer for Joint Automatic Speech Recognition and Multilingual Speech TranslationCode1
Efficient Adapter Transfer of Self-Supervised Speech Models for Automatic Speech RecognitionCode1
End-to-end Named Entity Recognition from English SpeechCode1
A convolutional neural-network model of human cochlear mechanics and filter tuning for real-time applicationsCode1
A context-aware knowledge transferring strategy for CTC-based ASRCode1
A Systematic Comparison of Phonetic Aware Techniques for Speech EnhancementCode1
A Study of Multilingual End-to-End Speech Recognition for Kazakh, Russian, and EnglishCode1
A Survey on Non-Autoregressive Generation for Neural Machine Translation and BeyondCode1
ATCO2 corpus: A Large-Scale Dataset for Research on Automatic Speech Recognition and Natural Language Understanding of Air Traffic Control CommunicationsCode1
Distilling the Knowledge of BERT for Sequence-to-Sequence ASRCode1
ASR data augmentation in low-resource settings using cross-lingual multi-speaker TTS and cross-lingual voice conversionCode1
A Sidecar Separator Can Convert a Single-Talker Speech Recognition System to a Multi-Talker OneCode1
Show:102550
← PrevPage 3 of 64Next →

No leaderboard results yet.