SOTAVerified

Automatic Speech Recognition

Papers

Showing 17011750 of 3174 papers

TitleStatusHype
The CUHK-TENCENT speaker diarization system for the ICASSP 2022 multi-channel multi-party meeting transcription challenge0
Joint Speech Recognition and Audio Captioning0
The RoyalFlush System of Speech Recognition for M2MeT Challenge0
Streaming Multi-Talker ASR with Token-Level Serialized Output TrainingCode1
ASR-Aware End-to-end Neural Diarization0
Error Correction in ASR using Sequence-to-Sequence Models0
RescoreBERT: Discriminative Speech Recognition Rescoring with BERT0
Language Dependencies in Adversarial Attacks on Speech Recognition Systems0
BEA-Base: A Benchmark for ASR of Spontaneous Hungarian0
Visualizing Automatic Speech Recognition -- Means for a Better Understanding?0
Reducing language context confusion for end-to-end code-switching automatic speech recognition0
Star Temporal Classification: Sequence Classification with Partially Labeled DataCode0
Synthesizing Dysarthric Speech Using Multi-talker TTS for Dysarthric Speech Recognition0
Sentiment-Aware Automatic Speech Recognition pre-training for enhanced Speech Emotion Recognition0
Discovering Phonetic Inventories with Crosslingual Automatic Speech RecognitionCode0
On the Effectiveness of Pinyin-Character Dual-Decoding for End-to-End Mandarin Chinese ASR0
The Norwegian Parliamentary Speech Corpus0
Improving the fusion of acoustic and text representations in RNN-T0
Improving non-autoregressive end-to-end speech recognition with pre-trained acoustic and language models0
Run-and-back stitch search: novel block synchronous decoding for streaming encoder-decoder ASR0
Transformer-Based Video Front-Ends for Audio-Visual Speech Recognition for Single and Multi-Person Video0
Unified Multimodal Punctuation Restoration Framework for Mixed-Modality CorpusCode1
A Noise-Robust Self-supervised Pre-training Model Based Speech Representation Learning for Automatic Speech Recognition0
How Bad Are Artifacts?: Analyzing the Impact of Speech Enhancement Errors on ASR0
Human and Automatic Speech Recognition Performance on German Oral History Interviews0
DUAL: Textless Spoken Question Answering with Speech Discrete Unit Adaptive Learning0
RED-ACE: Robust Error Detection for ASR using Confidence Embeddings0
Recent Progress in the CUHK Dysarthric Speech Recognition System0
Learning to Enhance or Not: Neural Network-Based Switching of Enhanced and Observed Signals for Overlapping Speech Recognition0
A Likelihood Ratio based Domain Adaptation Method for E2E Models0
Cross-Modal ASR Post-Processing System for Error Correction and Utterance Rejection0
Neural Architecture Search For LF-MMI Trained Time Delay Neural NetworksCode0
Automatic Speech Recognition Datasets in Cantonese: A Survey and New Dataset0
Improving Mandarin End-to-End Speech Recognition with Word N-gram Language ModelCode1
Robust Self-Supervised Audio-Visual Speech RecognitionCode2
Learning Audio-Visual Speech Representation by Masked Multimodal Cluster PredictionCode2
Speech-to-SQL: Towards Speech-driven SQL Query Generation From Natural Language Question0
Tencent-MVSE: A Large-Scale Benchmark Dataset for Multi-Modal Video Similarity Evaluation0
Multi-Dialect Arabic Speech Recognition0
Multi-Variant Consistency based Self-supervised Learning for Robust Automatic Speech Recognition0
Regularizing End-to-End Speech Translation with Triangular Decomposition AgreementCode1
Voice Quality and Pitch Features in Transformer-Based Speech Recognition0
Integrating Knowledge in End-to-End Automatic Speech Recognition for Mandarin-English Code-Switching0
Multi-turn RNN-T for streaming recognition of multi-party speech0
Continual Learning for Monolingual End-to-End Automatic Speech RecognitionCode0
Prompt Tuning GPT-2 language model for parameter-efficient domain adaptation of ASR systems0
Real-Time Neural Voice Camouflage0
Improving Hybrid CTC/Attention End-to-end Speech Recognition with Pretrained Acoustic and Language Model0
Robustifying automatic speech recognition by extracting slowly varying features0
PM-MMUT: Boosted Phone-Mask Data Augmentation using Multi-Modeling Unit Training for Phonetic-Reduction-Robust E2E Speech Recognition0
Show:102550
← PrevPage 35 of 64Next →

No leaderboard results yet.