SOTAVerified

Automatic Speech Recognition

Papers

Showing 501550 of 3174 papers

TitleStatusHype
A Virtual Simulation-Pilot Agent for Training of Air Traffic Controllers0
A Closer Look at Wav2Vec2 Embeddings for On-Device Single-Channel Speech Enhancement0
2-bit Conformer quantization for automatic speech recognition0
Audio Adversarial Examples for Robust Hybrid CTC/Attention Speech Recognition0
Back-Translation-Style Data Augmentation for End-to-End ASR0
Balancing Speech Understanding and Generation Using Continual Pre-training for Codec-based Speech LLM0
BanglaNum -- A Public Dataset for Bengali Digit Recognition from Speech0
Bangla-Wave: Improving Bangla Automatic Speech Recognition Utilizing N-gram Language Models0
BART based semantic correction for Mandarin automatic speech recognition system0
BA-SOT: Boundary-Aware Serialized Output Training for Multi-Talker ASR0
A two-step approach to leverage contextual data: speech recognition in air-traffic communications0
Almost Unsupervised Text to Speech and Automatic Speech Recognition0
A two-stage transliteration approach to improve performance of a multilingual ASR0
All-neural online source separation, counting, and diarization for meeting analysis0
A Cycle-GAN Approach to Model Natural Perturbations in Speech for ASR Applications0
Listening while Speaking and Visualizing: Improving ASR through Multimodal Chain0
Bridging Speech and Textual Pre-trained Models with Unsupervised ASR0
All-neural beamformer for continuous speech separation0
Attentive Adversarial Learning for Domain-Invariant Training0
A Curriculum Learning Method for Improved Noise Robustness in Automatic Speech Recognition0
Attention Enhanced Citrinet for Speech Recognition0
Attention-based Wav2Text with Feature Transfer Learning0
A Likelihood Ratio based Domain Adaptation Method for E2E Models0
Handling Trade-Offs in Speech Separation with Sparsely-Gated Mixture of Experts0
Attention based on-device streaming speech recognition with large speech corpus0
Align, Write, Re-order: Explainable End-to-End Speech Translation via Operation Sequence Generation0
Activity focused Speech Recognition of Preschool Children in Early Childhood Classrooms0
Attention based end to end Speech Recognition for Voice Search in Hindi and English0
Align With Purpose: Optimize Desired Properties in CTC Models with a General Plug-and-Play Framework0
A Closer Look at Audio-Visual Multi-Person Speech Recognition and Active Speaker Selection0
Breaking Walls: Pioneering Automatic Speech Recognition for Central Kurdish: End-to-End Transformer Paradigm0
Attention-based ASR with Lightweight and Dynamic Convolutions0
Alignment Restricted Streaming Recurrent Neural Network Transducer0
Alignment Knowledge Distillation for Online Streaming Attention-based Speech Recognition0
Attacks as Defenses: Designing Robust Audio CAPTCHAs Using Attacks on Automatic Speech Recognition Systems0
A CTC Triggered Siamese Network with Spatial-Temporal Dropout for Speech Recognition0
LegoSLM: Connecting LLM with Speech Encoder using CTC Posteriors0
A Transfer Learning Method for Speech Emotion Recognition from Automatic Speech Recognition0
Alignment-Free Training for Transducer-based Multi-Talker ASR0
A Toolkit for Joint Speaker Diarization and Identification with Application to Speaker-Attributed ASR0
Alignment Entropy Regularization0
A CTC Alignment-based Non-autoregressive Transformer for End-to-end Automatic Speech Recognition0
A CLARIN Transcription Portal for Interview Data0
Breaking Through the Spike: Spike Window Decoding for Accelerated and Precise Automatic Speech Recognition0
BridgeNets: Student-Teacher Transfer Learning Based on Recursive Neural Networks and its Application to Distant Speech Recognition0
Bridging the Gap Between Clean Data Training and Real-World Inference for Spoken Language Understanding0
BS-PLCNet: Band-split Packet Loss Concealment Network with Multi-task Learning Framework and Multi-discriminators0
Building Open-source Speech Technology for Low-resource Minority Languages with SáMi as an Example – Tools, Methods and Experiments0
A Text-to-Speech Pipeline, Evaluation Methodology, and Initial Fine-Tuning Results for Child Speech Synthesis0
ATCSpeechNet: A multilingual end-to-end speech recognition framework for air traffic control systems0
Show:102550
← PrevPage 11 of 64Next →

No leaderboard results yet.