SOTAVerified

Automatic Speech Recognition

Papers

Showing 13011325 of 3174 papers

TitleStatusHype
DuDe: Dual-Decoder Multilingual ASR for Indian Languages using Common Label Set0
Phonemic Representation and Transcription for Speech to Text Applications for Under-resourced Indigenous African Languages: The Case of Kiswahili0
Random Utterance Concatenation Based Data Augmentation for Improving Short-video Speech Recognition0
Filter and evolve: progressive pseudo label refining for semi-supervised automatic speech recognition0
Contextual-Utterance Training for Automatic Speech Recognition0
Simulating realistic speech overlaps improves multi-talker ASR0
Virtuoso: Massive Multilingual Speech-Text Joint Semi-Supervised Learning for Text-To-Speech0
TRScore: A Novel GPT-based Readability Scorer for ASR Segmentation and Punctuation model evaluation and selection0
Robust Data2vec: Noise-robust Speech Representation Learning for ASR by Combining Regression and Improved Contrastive LearningCode1
Automatic Severity Classification of Dysarthric speech by using Self-supervised Model with Multi-task LearningCode1
SAN: a robust end-to-end ASR model architecture0
On Out-of-Distribution Detection for Audio with Deep Nearest NeighborsCode0
Weight Averaging: A Simple Yet Effective Method to Overcome Catastrophic Forgetting in Automatic Speech Recognition0
Streaming Voice Conversion Via Intermediate Bottleneck Features And Non-streaming Teacher Guidance0
Iterative pseudo-forced alignment by acoustic CTC loss for self-supervised ASR domain adaptationCode0
Exploring Effective Distillation of Self-Supervised Speech Models for Automatic Speech Recognition0
Make More of Your Data: Minimal Effort Data Augmentation for Automatic Speech Recognition and Translation0
V-Cloak: Intelligibility-, Naturalness- & Timbre-Preserving Real-Time Voice Anonymization0
End-to-End Speech to Intent Prediction to improve E-commerce Customer Support Voicebot in Hindi and English0
There is more than one kind of robustness: Fooling Whisper with adversarial examplesCode1
Smart Speech Segmentation using Acousto-Linguistic Features with look-ahead0
Reducing Language confusion for Code-switching Speech Recognition with Token-level Language DiarizationCode0
UFO2: A unified pre-training framework for online and offline speech recognition0
Four-in-One: A Joint Approach to Inverse Text Normalization, Punctuation, Capitalization, and Disfluency for Automatic Speech Recognition0
Efficient Utilization of Large Pre-Trained Models for Low Resource ASR0
Show:102550
← PrevPage 53 of 127Next →

No leaderboard results yet.