SOTAVerified

Automatic Speech Recognition

Papers

Showing 16511700 of 3174 papers

TitleStatusHype
Automatic Speech Recognition for Speech Assessment of Persian Preschool ChildrenCode1
Lahjoita puhetta -- a large-scale corpus of spoken Finnish with some benchmarks0
Pseudo Label Is Better Than Human Label0
Building Robust Spoken Language Understanding by Cross Attention between Phoneme Sequence and ASR Hypothesis0
A Text-to-Speech Pipeline, Evaluation Methodology, and Initial Fine-Tuning Results for Child Speech Synthesis0
Exploiting Cross Domain Acoustic-to-articulatory Inverted Features For Disordered Speech Recognition0
Neural Predictor for Black-Box Adversarial Attacks on Speech RecognitionCode1
Representative Subset Selection for Efficient Fine-Tuning in Self-Supervised Speech Recognition0
Prediction of speech intelligibility with DNN-based performance measures0
Whither the Priors for (Vocal) Interactivity?0
RED-ACE: Robust Error Detection for ASR using Confidence EmbeddingsCode0
Spectral Modification Based Data Augmentation For Improving End-to-End ASR For Children's Speech0
Transformer-based Streaming ASR with Cumulative Attention0
Attacks as Defenses: Designing Robust Audio CAPTCHAs Using Attacks on Automatic Speech Recognition Systems0
DUAL: Discrete Spoken Unit Adaptive Learning for Textless Spoken Question AnsweringCode1
A practical framework for multi-domain speech recognition and an instance sampling method to neural language modeling0
Which French speech recognition system for assistant robots?0
Towards Contextual Spelling Correction for Customization of End-to-end Speech Recognition SystemsCode0
Sentiment Word Aware Multimodal Refinement for Multimodal Sentiment Analysis with ASR ErrorsCode1
Extended Graph Temporal Classification for Multi-Speaker End-to-End ASR0
Measuring the Impact of Individual Domain Factors in Self-Supervised Pre-Training0
A Conformer Based Acoustic Model for Robust Automatic Speech Recognition0
Integrating Text Inputs For Training and Adapting RNN Transducer ASR Models0
A Survey of Multilingual Models for Automatic Speech Recognition0
Language technology practitioners as language managers: arbitrating data bias and predictive bias in ASR0
Ask2Mask: Guided Data Selection for Masked Speech Modeling0
Towards Better Meta-Initialization with Task Augmentation for Kindergarten-aged Speech Recognition0
Differentially Private Speaker Anonymization0
Improving CTC-based speech recognition via knowledge transferring from pre-trained language modelsCode0
Korean Tokenization for Beam Search Rescoring in Speech Recognition0
VADOI:Voice-Activity-Detection Overlapping Inference For End-to-end Long-form Speech Recognition0
r-G2P: Evaluating and Enhancing Robustness of Grapheme to Phoneme Conversion by Controlled noise introducing and Contextual information incorporation0
Speaker Adaptation Using Spectro-Temporal Deep Features for Dysarthric and Elderly Speech Recognition0
SemEval 2022 Task 12: Symlink- Linking Mathematical Symbols to their Descriptions0
Domain Adaptation of low-resource Target-Domain models using well-trained ASR Conformer Models0
'Beach' to 'Bitch': Inadvertent Unsafe Transcription of Kids' Content on YouTube0
MLP-ASR: Sequence-length agnostic all-MLP architectures for speech recognition0
Mitigating Closed-model Adversarial Examples with Bayesian Neural Modeling for Enhanced End-to-End Speech Recognition0
AISHELL-NER: Named Entity Recognition from Chinese SpeechCode1
ADIMA: Abuse Detection In Multilingual AudioCode0
Conversational Speech Recognition By Learning Conversation-level Characteristics0
Knowledge Transfer from Large-scale Pretrained Language Models to End-to-end Speech Recognizers0
Multi-style Training for South African Call Centre Audio0
Saving RNN Computations with a Neuron-Level Fuzzy Memoization Scheme0
Multimodal Depression Classification Using Articulatory Coordination Features And Hierarchical Attention Based Text Embeddings0
Improving Automatic Speech Recognition for Non-Native English with Transfer Learning and Language Model DecodingCode0
Enhancing ASR for Stuttered Speech with Limited Data Using Detect and Pass0
A two-step approach to leverage contextual data: speech recognition in air-traffic communications0
Efficient Adapter Transfer of Self-Supervised Speech Models for Automatic Speech RecognitionCode1
Polyphonic pitch detection with convolutional recurrent neural networks0
Show:102550
← PrevPage 34 of 64Next →

No leaderboard results yet.