SOTAVerified

Automatic Speech Recognition

Papers

Showing 17511800 of 3174 papers

TitleStatusHype
Multi-task RNN-T with Semantic Decoder for Streamable Spoken Language Understanding0
Analyzing the factors affecting usefulness of Self-Supervised Pre-trained Representations for Speech Recognition0
Open Source MagicData-RAMC: A Rich Annotated Mandarin Conversational(RAMC) Speech Dataset0
A Comparative Study on Speaker-attributed Automatic Speech Recognition in Multi-party Meetings0
Memory-Efficient Training of RNN-Transducer with Sampled Softmax0
Importance of Different Temporal Modulations of Speech: A Tale of Two Perspectives0
Pre-Training Transformer Decoder for End-to-End ASR Model with Unpaired Speech Data0
Effectiveness of text to speech pseudo labels for forced alignment and cross lingual pretrained models for low resource speech recognition0
Improving Speech Recognition for Indic Languages using Language Model0
Using Adapters to Overcome Catastrophic Forgetting in End-to-End Automatic Speech RecognitionCode0
Code Switched and Code Mixed Speech Recognition for Indic languages0
Is Word Error Rate a good evaluation metric for Speech Recognition in Indic Languages?0
Improving Generalization of Deep Neural Network Acoustic Models with Length Perturbation and N-best Based Label Smoothing0
Dynamic Latency for CTC-Based Streaming Automatic Speech Recognition With Emformer0
Analysis of EEG frequency bands for Envisioned Speech RecognitionCode0
Frequency-Directional Attention Model for Multilingual Automatic Speech Recognition0
Short-Term Word-Learning in a Dynamically Changing Environment0
Mel Frequency Spectral Domain Defenses against Adversarial Attacks on Speech Recognition Systems0
Shallow Fusion of Weighted Finite-State Transducer and Language Model for Text NormalizationCode0
Finnish Parliament ASR corpus - Analysis, benchmarks and statisticsCode0
A Dataset for Speech Emotion Recognition in Greek Theatrical PlaysCode0
A Speech Representation Anonymization Framework via Selective Noise PerturbationCode0
Impact of Dataset on Acoustic Models for Automatic Speech Recognition0
Speech-enhanced and Noise-aware Networks for Robust Speech RecognitionCode0
Disentangleing Content and Fine-grained Prosody Information via Hybrid ASR Bottleneck Features for Voice Conversion0
Computing Optimal Location of Microphone for Improved Speech Recognition0
Lahjoita puhetta -- a large-scale corpus of spoken Finnish with some benchmarks0
Pseudo Label Is Better Than Human Label0
A Text-to-Speech Pipeline, Evaluation Methodology, and Initial Fine-Tuning Results for Child Speech Synthesis0
Building Robust Spoken Language Understanding by Cross Attention between Phoneme Sequence and ASR Hypothesis0
Exploiting Cross Domain Acoustic-to-articulatory Inverted Features For Disordered Speech Recognition0
Representative Subset Selection for Efficient Fine-Tuning in Self-Supervised Speech Recognition0
Prediction of speech intelligibility with DNN-based performance measures0
Whither the Priors for (Vocal) Interactivity?0
RED-ACE: Robust Error Detection for ASR using Confidence EmbeddingsCode0
Spectral Modification Based Data Augmentation For Improving End-to-End ASR For Children's Speech0
Transformer-based Streaming ASR with Cumulative Attention0
Attacks as Defenses: Designing Robust Audio CAPTCHAs Using Attacks on Automatic Speech Recognition Systems0
A practical framework for multi-domain speech recognition and an instance sampling method to neural language modeling0
Which French speech recognition system for assistant robots?0
Towards Contextual Spelling Correction for Customization of End-to-end Speech Recognition SystemsCode0
A Conformer Based Acoustic Model for Robust Automatic Speech Recognition0
Measuring the Impact of Individual Domain Factors in Self-Supervised Pre-Training0
Extended Graph Temporal Classification for Multi-Speaker End-to-End ASR0
Integrating Text Inputs For Training and Adapting RNN Transducer ASR Models0
Language technology practitioners as language managers: arbitrating data bias and predictive bias in ASR0
A Survey of Multilingual Models for Automatic Speech Recognition0
Towards Better Meta-Initialization with Task Augmentation for Kindergarten-aged Speech Recognition0
Ask2Mask: Guided Data Selection for Masked Speech Modeling0
Differentially Private Speaker Anonymization0
Show:102550
← PrevPage 36 of 64Next →

No leaderboard results yet.