SOTAVerified

Automatic Speech Recognition

Papers

Showing 13011350 of 3174 papers

TitleStatusHype
DuDe: Dual-Decoder Multilingual ASR for Indian Languages using Common Label Set0
Phonemic Representation and Transcription for Speech to Text Applications for Under-resourced Indigenous African Languages: The Case of Kiswahili0
Random Utterance Concatenation Based Data Augmentation for Improving Short-video Speech Recognition0
Filter and evolve: progressive pseudo label refining for semi-supervised automatic speech recognition0
Contextual-Utterance Training for Automatic Speech Recognition0
Simulating realistic speech overlaps improves multi-talker ASR0
Weight Averaging: A Simple Yet Effective Method to Overcome Catastrophic Forgetting in Automatic Speech Recognition0
V-Cloak: Intelligibility-, Naturalness- & Timbre-Preserving Real-Time Voice Anonymization0
Automatic Severity Classification of Dysarthric speech by using Self-supervised Model with Multi-task LearningCode1
Iterative pseudo-forced alignment by acoustic CTC loss for self-supervised ASR domain adaptationCode0
Make More of Your Data: Minimal Effort Data Augmentation for Automatic Speech Recognition and Translation0
Robust Data2vec: Noise-robust Speech Representation Learning for ASR by Combining Regression and Improved Contrastive LearningCode1
SAN: a robust end-to-end ASR model architecture0
TRScore: A Novel GPT-based Readability Scorer for ASR Segmentation and Punctuation model evaluation and selection0
Exploring Effective Distillation of Self-Supervised Speech Models for Automatic Speech Recognition0
Streaming Voice Conversion Via Intermediate Bottleneck Features And Non-streaming Teacher Guidance0
Virtuoso: Massive Multilingual Speech-Text Joint Semi-Supervised Learning for Text-To-Speech0
On Out-of-Distribution Detection for Audio with Deep Nearest NeighborsCode0
End-to-End Speech to Intent Prediction to improve E-commerce Customer Support Voicebot in Hindi and English0
There is more than one kind of robustness: Fooling Whisper with adversarial examplesCode1
Reducing Language confusion for Code-switching Speech Recognition with Token-level Language DiarizationCode0
Smart Speech Segmentation using Acousto-Linguistic Features with look-ahead0
UFO2: A unified pre-training framework for online and offline speech recognition0
Four-in-One: A Joint Approach to Inverse Text Normalization, Punctuation, Capitalization, and Disfluency for Automatic Speech Recognition0
Efficient Utilization of Large Pre-Trained Models for Low Resource ASR0
Monotonic segmental attention for automatic speech recognition0
Linguistic-Enhanced Transformer with CTC Embedding for Speech Recognition0
Does Joint Training Really Help Cascaded Speech Translation?Code0
Time-Domain Speech Enhancement for Robust Automatic Speech Recognition0
Brouhaha: multi-task training for voice activity detection, speech-to-noise ratio, and C50 room acoustics estimationCode1
Investigating self-supervised, weakly supervised and fully supervised training approaches for multi-domain automatic speech recognition: a study on Bangladeshi Bangla0
ESB: A Benchmark For Multi-Domain End-to-End Speech RecognitionCode1
Guided contrastive self-supervised pre-training for automatic speech recognition0
Optimizing Bilingual Neural Transducer with Synthetic Code-switching Text Generation0
Can Visual Context Improve Automatic Speech Recognition for an Embodied Agent?0
Improving Semi-supervised End-to-end Automatic Speech Recognition using CycleGAN and Inter-domain Losses0
G-Augment: Searching for the Meta-Structure of Data Augmentation Policies for ASR0
Discrete Cross-Modal Alignment Enables Zero-Shot Speech TranslationCode0
Continuous Pseudo-Labeling from the Start0
Language-agnostic Code-Switching in Sequence-To-Sequence Speech Recognition0
Towards Relation Extraction From SpeechCode1
Sub-8-bit quantization for on-device speech recognition: a regularization-free approach0
Bringing NURC/SP to Digital Life: the Role of Open-source Automatic Speech Recognition ModelsCode0
LeVoice ASR Systems for the ISCSLP 2022 Intelligent Cockpit Speech Recognition Challenge0
Learning to Jointly Transcribe and Subtitle for End-to-End Spontaneous Speech Recognition0
Experiments on Turkish ASR with Self-Supervised Speech Representation Learning0
Can we use Common Voice to train a Multi-Speaker TTS system?Code1
Summary on the ISCSLP 2022 Chinese-English Code-Switching ASR Challenge0
A context-aware knowledge transferring strategy for CTC-based ASRCode1
Streaming Punctuation for Long-form Dictation with Transformers0
Show:102550
← PrevPage 27 of 64Next →

No leaderboard results yet.