SOTAVerified

Automatic Speech Recognition

Papers

Showing 18011850 of 3174 papers

TitleStatusHype
VADOI:Voice-Activity-Detection Overlapping Inference For End-to-end Long-form Speech Recognition0
Improving CTC-based speech recognition via knowledge transferring from pre-trained language modelsCode0
Korean Tokenization for Beam Search Rescoring in Speech Recognition0
r-G2P: Evaluating and Enhancing Robustness of Grapheme to Phoneme Conversion by Controlled noise introducing and Contextual information incorporation0
Speaker Adaptation Using Spectro-Temporal Deep Features for Dysarthric and Elderly Speech Recognition0
SemEval 2022 Task 12: Symlink- Linking Mathematical Symbols to their Descriptions0
Domain Adaptation of low-resource Target-Domain models using well-trained ASR Conformer Models0
Mitigating Closed-model Adversarial Examples with Bayesian Neural Modeling for Enhanced End-to-End Speech Recognition0
'Beach' to 'Bitch': Inadvertent Unsafe Transcription of Kids' Content on YouTube0
MLP-ASR: Sequence-length agnostic all-MLP architectures for speech recognition0
Knowledge Transfer from Large-scale Pretrained Language Models to End-to-end Speech Recognizers0
ADIMA: Abuse Detection In Multilingual AudioCode0
Conversational Speech Recognition By Learning Conversation-level Characteristics0
Multi-style Training for South African Call Centre Audio0
Saving RNN Computations with a Neuron-Level Fuzzy Memoization Scheme0
Multimodal Depression Classification Using Articulatory Coordination Features And Hierarchical Attention Based Text Embeddings0
Improving Automatic Speech Recognition for Non-Native English with Transfer Learning and Language Model DecodingCode0
A two-step approach to leverage contextual data: speech recognition in air-traffic communications0
Enhancing ASR for Stuttered Speech with Limited Data Using Detect and Pass0
The CUHK-TENCENT speaker diarization system for the ICASSP 2022 multi-channel multi-party meeting transcription challenge0
Polyphonic pitch detection with convolutional recurrent neural networks0
Joint Speech Recognition and Audio Captioning0
The RoyalFlush System of Speech Recognition for M2MeT Challenge0
Error Correction in ASR using Sequence-to-Sequence Models0
ASR-Aware End-to-end Neural Diarization0
RescoreBERT: Discriminative Speech Recognition Rescoring with BERT0
Visualizing Automatic Speech Recognition -- Means for a Better Understanding?0
BEA-Base: A Benchmark for ASR of Spontaneous Hungarian0
Language Dependencies in Adversarial Attacks on Speech Recognition Systems0
Reducing language context confusion for end-to-end code-switching automatic speech recognition0
Star Temporal Classification: Sequence Classification with Partially Labeled DataCode0
Synthesizing Dysarthric Speech Using Multi-talker TTS for Dysarthric Speech Recognition0
Sentiment-Aware Automatic Speech Recognition pre-training for enhanced Speech Emotion Recognition0
On the Effectiveness of Pinyin-Character Dual-Decoding for End-to-End Mandarin Chinese ASR0
Discovering Phonetic Inventories with Crosslingual Automatic Speech RecognitionCode0
The Norwegian Parliamentary Speech Corpus0
Improving non-autoregressive end-to-end speech recognition with pre-trained acoustic and language models0
Transformer-Based Video Front-Ends for Audio-Visual Speech Recognition for Single and Multi-Person Video0
Improving the fusion of acoustic and text representations in RNN-T0
Run-and-back stitch search: novel block synchronous decoding for streaming encoder-decoder ASR0
A Noise-Robust Self-supervised Pre-training Model Based Speech Representation Learning for Automatic Speech Recognition0
How Bad Are Artifacts?: Analyzing the Impact of Speech Enhancement Errors on ASR0
Human and Automatic Speech Recognition Performance on German Oral History Interviews0
RED-ACE: Robust Error Detection for ASR using Confidence Embeddings0
DUAL: Textless Spoken Question Answering with Speech Discrete Unit Adaptive Learning0
Recent Progress in the CUHK Dysarthric Speech Recognition System0
Learning to Enhance or Not: Neural Network-Based Switching of Enhanced and Observed Signals for Overlapping Speech Recognition0
Cross-Modal ASR Post-Processing System for Error Correction and Utterance Rejection0
A Likelihood Ratio based Domain Adaptation Method for E2E Models0
Neural Architecture Search For LF-MMI Trained Time Delay Neural NetworksCode0
Show:102550
← PrevPage 37 of 64Next →

No leaderboard results yet.