SOTAVerified

Automatic Speech Recognition

Papers

Showing 10011050 of 3174 papers

TitleStatusHype
Dynamic Acoustic Unit Augmentation With BPE-Dropout for Low-Resource End-to-End Speech Recognition0
Dynamic ASR Pathways: An Adaptive Masking Approach Towards Efficient Pruning of A Multilingual ASR Model0
Dynamic Context-Aware Streaming Pretrained Language Model For Inverse Text Normalization0
Dynamic Data Pruning for Automatic Speech Recognition0
Contribution \`a l'\'etude de la variabilit\'e de la voix des personnes \^ag\'ees en reconnaissance automatique de la parole (Contribution to the study of elderly people's voice variability in automatic speech recognition) [in French]0
Dynamic Masking for Improved Stability in Spoken Language Translation0
A Transfer Learning Method for Speech Emotion Recognition from Automatic Speech Recognition0
Contrastive Semi-supervised Learning for ASR0
Learning Video Representations using Contrastive Bidirectional Transformer0
Alignment-Free Training for Transducer-based Multi-Talker ASR0
EasyASR: A Distributed Machine Learning Platform for End-to-end Automatic Speech Recognition0
E-Branchformer: Branchformer with Enhanced merging for speech recognition0
Echo State Speech Recognition0
Echotune: A Modular Extractor Leveraging the Variable-Length Nature of Speech in ASR Tasks0
A Toolkit for Joint Speaker Diarization and Identification with Application to Speaker-Attributed ASR0
Continuous Speech Recognition using EEG and Video0
EdgeCRNN: an edgecomputing oriented model of acoustic feature enhancement for keyword spotting0
EEG based Continuous Speech Recognition using Transformers0
Continuous Pseudo-Labeling from the Start0
Effective Decoder Masking for Transformer Based End-to-End Speech Recognition0
Effectively pretraining a speech translation decoder with Machine Translation data0
Alignment Entropy Regularization0
A CTC Alignment-based Non-autoregressive Transformer for End-to-end Automatic Speech Recognition0
A CLARIN Transcription Portal for Interview Data0
LegoSLM: Connecting LLM with Speech Encoder using CTC Posteriors0
Towards interfacing large language models with ASR systems using confidence measures and prompting0
Improving OOV Detection and Resolution with External Language Models in Acoustic-to-Word ASR0
Listening while Speaking and Visualizing: Improving ASR through Multimodal Chain0
Continuous Modeling of the Denoising Process for Speech Enhancement Based on Deep Learning0
Continuously Learning New Words in Automatic Speech Recognition0
A Text-to-Speech Pipeline, Evaluation Methodology, and Initial Fine-Tuning Results for Child Speech Synthesis0
Continuous Learning for Children's ASR: Overcoming Catastrophic Forgetting with Elastic Weight Consolidation and Synaptic Intelligence0
Continued Pretraining for Domain Adaptation of Wav2vec2.0 in Automatic Speech Recognition for Elementary Math Classroom Settings0
ATCSpeechNet: A multilingual end-to-end speech recognition framework for air traffic control systems0
Aligning Speech to Languages to Enhance Code-switching Speech Recognition0
An Adapter Based Pre-Training for Efficient and Scalable Self-Supervised Speech Representation Learning0
Continual learning using lattice-free MMI for speech recognition0
Continual Learning in Machine Speech Chain Using Gradient Episodic Memory0
ATCSpeech: a multilingual pilot-controller speech corpus from real Air Traffic Control environment0
Aligning Pre-trained Models for Spoken Language Translation0
Continual Learning for On-Device Speech Recognition using Disentangled Conformers0
ATC-ANNO: Semantic Annotation for Air Traffic Control with Assistive Auto-Annotation0
Continual Learning for End-to-End ASR by Averaging Domain Experts0
Contextual-Utterance Training for Automatic Speech Recognition0
Aligner-Encoders: Self-Attention Transformers Can Be Self-Transducers0
Contextual Speech Recognition with Difficult Negative Training Examples0
Contextual Semi-Supervised Learning: An Approach To Leverage Air-Surveillance and Untranscribed ATC Data in ASR Systems0
Asynchronous Tool Usage for Real-Time Agents0
Contextual RNN-T For Open Domain ASR0
Contextual Language Model Adaptation for Conversational Agents0
Show:102550
← PrevPage 21 of 64Next →

No leaderboard results yet.