SOTAVerified

Automatic Speech Recognition

Papers

Showing 401450 of 3174 papers

TitleStatusHype
AMPS: ASR with Multimodal Paraphrase Supervision0
Amortized Neural Networks for Low-Latency Speech Recognition0
Adaptable End-to-End ASR Models using Replaceable Internal LMs and Residual Softmax0
3-D Feature and Acoustic Modeling for Far-Field Speech Recognition0
A Mixture of Expert Based Deep Neural Network for Improved ASR0
On the Problem of Text-To-Speech Model Selection for Synthetic Data Generation in Automatic Speech Recognition0
Improving OOV Detection and Resolution with External Language Models in Acoustic-to-Word ASR0
Automatic Speech Recognition on a Firefighter TETRA Broadcast Channel0
Automatic Speech Recognition using Advanced Deep Learning Approaches: A survey0
Audio-visual Multi-channel Recognition of Overlapped Speech0
Audio-visual Multi-channel Integration and Recognition of Overlapped Speech0
Amharic-English Speech Translation in Tourism Domain0
Audio-visual fine-tuning of audio-only ASR models0
Adam^+: A Stochastic Method with Adaptive Variance Reduction0
Audio-visual multi-channel speech separation, dereverberation and recognition0
AC-Mix: Self-Supervised Adaptation for Low-Resource Automatic Speech Recognition using Agnostic Contrastive Mixup0
AdaMER-CTC: Connectionist Temporal Classification with Adaptive Maximum Entropy Regularization for Automatic Speech Recognition0
Audio-Visual Speech Enhancement and Separation by Utilizing Multi-Modal Self-Supervised Embeddings0
Audio-Visual Speech Enhancement with Score-Based Generative Models0
Audio-Visual Speech Recognition is Worth 32328 Voxels0
Audio Visual Speech Recognition using Deep Recurrent Neural Networks0
Auditory-Based Data Augmentation for End-to-End Automatic Speech Recognition0
Augmenting Automatic Speech Recognition Models with Disfluency Detection0
Augmenting Bottleneck Features of Deep Neural Network Employing Motor State for Speech Recognition at Humanoid Robots0
Augmenting Images for ASR and TTS through Single-loop and Dual-loop Multimodal Chain Framework0
A Multimodal Approach to Device-Directed Speech Detection with Large Language Models0
A meta learning scheme for fast accent domain expansion in Mandarin speech recognition0
A Unified Cascaded Encoder ASR Model for Dynamic Model Sizes0
A Unified Neural Architecture for Joint Dialog Act Segmentation and Recognition in Spoken Dialog System0
A Multi-Purpose Audio-Visual Corpus for Multi-Modal Persian Speech Recognition: the Arman-AV Dataset0
AudioFool: Fast, Universal and synchronization-free Cross-Domain Attack on Speech Recognition0
A Universally-Deployable ASR Frontend for Joint Acoustic Echo Cancellation, Speech Enhancement, and Voice Separation0
A user study to compare two conversational assistants designed for people with hearing impairments0
ADAGIO: Interactive Experimentation with Adversarial Attack and Defense for Audio0
Automated Cross-language Intelligibility Analysis of Parkinson's Disease Patients Using Speech Recognition Technologies0
Automated speech audiometry: Can it work using open-source pre-trained Kaldi-NL automatic speech recognition?0
Automated speech tools for helping communities process restricted-access corpora for language revival efforts0
Automated speech-unit delimitation in spoken learner English0
Audio Enhancement for Computer Audition -- An Iterative Training Paradigm Using Sample Importance0
Automatic Assessment of Oral Reading Accuracy for Reading Diagnostics0
Automatic assessment of spoken language proficiency of non-native children0
Audio De-identification - a New Entity Recognition Task0
Automatic Documentation of ICD Codes with Far-Field Speech Recognition0
Automatic Estimation of Intelligibility Measure for Consonants in Speech0
Automatic language identity tagging on word and sentence-level in multilingual text sources: a case-study on Luxembourgish0
Automatic Learning of Subword Dependent Model Scales0
Automatic Quality Estimation for ASR System Combination0
Automatic recognition and detection of aphasic natural speech0
Automatic recognition of child speech for robotic applications in noisy environments0
A Meeting Transcription System for an Ad-Hoc Acoustic Sensor Network0
Show:102550
← PrevPage 9 of 64Next →

No leaderboard results yet.