SOTAVerified

Automatic Speech Recognition

Papers

Showing 13011350 of 3174 papers

TitleStatusHype
Challenges and Opportunities of Speech Recognition for Bengali Language0
Enhancements in statistical spoken language translation by de-normalization of ASR results0
Blending LLMs into Cascaded Speech Translation: KIT's Offline Speech Translation System for IWSLT 20240
Challenges of Applying Automatic Speech Recognition for Transcribing EU Parliament Committee Meetings: A Pilot Study0
Challenges of Computational Processing of Code-Switching0
Flexible Multichannel Speech Enhancement for Noise-Robust Frontend0
Flexi-Transducer: Optimizing Latency, Accuracy and Compute forMulti-Domain On-Device Scenarios0
FluentNet: End-to-End Detection of Speech Disfluency with Deep Learning0
Focused Discriminative Training For Streaming CTC-Trained Automatic Speech Recognition Models0
FOOCTTS: Generating Arabic Speech with Acoustic Environment for Football Commentator0
Enhancement and Recognition of Reverberant and Noisy Speech by Extending Its Coherence0
Fotheidil: an Automatic Transcription System for the Irish Language0
Four-in-One: A Joint Approach to Inverse Text Normalization, Punctuation, Capitalization, and Disfluency for Automatic Speech Recognition0
Free English and Czech telephone speech corpus shared under the CC-BY-SA 3.0 license0
Enhanced Robot Speech Recognition Using Biomimetic Binaural Sound Source Localization0
Frequency Domain Multi-channel Acoustic Modeling for Distant Speech Recognition0
From Audio to Semantics: Approaches to end-to-end spoken language understanding0
From English to More Languages: Parameter-Efficient Model Reprogramming for Cross-Lingual Speech Recognition0
Annotated Speech Corpus for Low Resource Indian Languages: Awadhi, Bhojpuri, Braj and Magahi0
Enhanced Direct Speech-to-Speech Translation Using Self-supervised Pre-training and Data Augmentation0
From Statistical Methods to Pre-Trained Models; A Survey on Automatic Speech Recognition for Resource Scarce Urdu Language0
Enhanced CORILGA: Introducing the Automatic Phonetic Alignment Tool for Continuous Speech0
From Voice to Safety: Language AI Powered Pilot-ATC Communication Understanding for Airport Surface Movement Collision Risk Assessment0
From Weak Labels to Strong Results: Utilizing 5,000 Hours of Noisy Classroom Transcripts with Minimal Accurate Data0
FT Speech: Danish Parliament Speech Corpus0
Full-text Error Correction for Chinese Speech Recognition with Large Language Model0
Fully Convolutional ASR for Less-Resourced Endangered Languages0
Fully Learnable Front-End for Multi-Channel Acoustic Modeling using Semi-Supervised Learning0
Fully Neural Network Based Speech Recognition on Mobile and Embedded Devices0
Cleanformer: A multichannel array configuration-invariant neural enhancement frontend for ASR in smart speakers0
Blank-regularized CTC for Frame Skipping in Neural Transducer0
An Investigative Study of Multi-Modal Cross-Lingual Retrieval0
Fusing ASR Outputs in Joint Training for Speech Emotion Recognition0
Efficiently Fusing Pretrained Acoustic and Linguistic Encoders for Low-resource Speech Recognition0
FusionFormer: Fusing Operations in Transformer for Efficient Streaming Speech Recognition0
Fusion Models for Improved Visual Captioning0
English Broadcast News Speech Recognition by Humans and Machines0
Gated Low-rank Adaptation for personalized Code-Switching Automatic Speech Recognition on the low-spec devices0
Gated Recurrent Fusion with Joint Training Framework for Robust End-to-End Speech Recognition0
G-Augment: Searching for the Meta-Structure of Data Augmentation Policies for ASR0
A Complementary Joint Training Approach Using Unpaired Speech and Text for Low-Resource Automatic Speech Recognition0
GEC-RAG: Improving Generative Error Correction via Retrieval-Augmented Generation for Automatic Speech Recognition Systems0
Improving sequence-to-sequence speech recognition training with on-the-fly data augmentation0
Gender Representation in French Broadcast Corpora and Its Impact on ASR Performance0
Guiding CTC Posterior Spike Timings for Improved Posterior Fusion and Knowledge Distillation0
Generating Human Readable Transcript for Automatic Speech Recognition with Pre-trained Language Model0
Generating Robust Audio Adversarial Examples using Iterative Proportional Clipping0
Generating Synthetic Audio Data for Attention-Based Speech Recognition Systems0
Generating Synthetic Clinical Speech Data through Simulated ASR Deletion Error0
Gujarati-English Code-Switching Speech Recognition using ensemble prediction of spoken language0
Show:102550
← PrevPage 27 of 64Next →

No leaderboard results yet.