SOTAVerified

Automatic Speech Recognition

Papers

Showing 14011450 of 3174 papers

TitleStatusHype
Highland Puebla Nahuatl Speech Translation Corpus for Endangered Language Documentation0
High-precision medical speech recognition through synthetic data and semantic correction: UNITED-MEDASR0
High-precision Voice Search Query Correction via Retrievable Speech-text Embedings0
Hindi-English Code-Switching Speech Corpus0
homeService: Voice-enabled assistive technology in the home using cloud-based automatic speech recognition0
Homogeneous Speaker Features for On-the-Fly Dysarthric and Elderly Speaker Adaptation0
Exploring the Role of Audio in Video Captioning0
Houston we have a Divergence: A Subgroup Performance Analysis of ASR Models0
Exploring the Integration of Speech Separation and Recognition with Self-Supervised Learning Representation0
Are disentangled representations all you need to build speaker anonymization systems?0
How Bad Are Artifacts?: Analyzing the Impact of Speech Enhancement Errors on ASR0
How does end-to-end speech recognition training impact speech enhancement artifacts?0
Adversarial Speaker Adaptation0
How do Hyenas deal with Human Speech? Speech Recognition and Translation with ConfHyena0
How Might We Create Better Benchmarks for Speech Recognition?0
Exploring the Integration of Large Language Models into Automatic Speech Recognition Systems: An Empirical Study0
Exploring the Integration of E2E ASR and Pronunciation Modeling for English Mispronunciation Detection0
How to Connect Speech Foundation Models and Large Language Models? What Matters and What Does Not0
How to Learn a New Language? An Efficient Solution for Self-Supervised Learning Models Unseen Languages Adaption in Low-Resource Scenario0
How to Scale Up Kernel Methods to Be As Good As Deep Neural Nets0
Calm-Whisper: Reduce Whisper Hallucination On Non-Speech By Calming Crazy Heads Down0
Exploring the Impact of Data Quantity on ASR in Extremely Low-resource Languages0
HTEC: Human Transcription Error Correction0
Exploring the Effect of Dialect Mismatched Language Models in Telugu Automatic Speech Recognition0
Calibration of Phone Likelihoods in Automatic Speech Recognition0
Experiments on Turkish ASR with Self-Supervised Speech Representation Learning0
A Recorded Debating Dataset0
Human-Informed Speakers and Interpreters Analysis in the WAW Corpus and an Automatic Method for Calculating Interpreters' D\'ecalage0
Exploring Textual and Speech information in Dialogue Act Classification with Speaker Domain Adaptation0
Exploring Targeted Universal Adversarial Perturbations to End-to-end ASR Models0
Calibrate and Refine! A Novel and Agile Framework for ASR-error Robust Intent Detection0
Huqariq: A Multilingual Speech Corpus of Native Languages of Peru forSpeech Recognition0
Exploring SSL Discrete Tokens for Multilingual ASR0
Exploring Speech Recognition, Translation, and Understanding with Discrete Speech Units: A Comparative Study0
Hybrid Autoregressive Transducer (hat)0
Hybrid CTC-Attention based End-to-End Speech Recognition using Subword Units0
CAFE A Novel Code switching Dataset for Algerian Dialect French and English0
Hybridized Feature Extraction and Acoustic Modelling Approach for Dysarthric Speech Recognition0
Adversarial Meta Sampling for Multilingual Low-Resource Speech Recognition0
A Conformer-based Waveform-domain Neural Acoustic Echo Canceller Optimized for ASR Accuracy0
Exploring Speech Enhancement with Generative Adversarial Networks for Robust Speech Recognition0
Exploring Speech Enhancement for Low-resource Speech Synthesis0
Byte Pair Encoding Is All You Need For Automatic Bengali Speech Recognition0
Exploring RNN-Transducer for Chinese Speech Recognition0
Hypothesis Stitcher for End-to-End Speaker-attributed ASR on Long-form Multi-talker Recordings0
Exploring Pre-training with Alignments for RNN Transducer based End-to-End Speech Recognition0
Bypass Temporal Classification: Weakly Supervised Automatic Speech Recognition with Imperfect Transcripts0
ICMC-ASR: The ICASSP 2024 In-Car Multi-Channel Automatic Speech Recognition Challenge0
Ideal-LLM: Integrating Dual Encoders and Language-Adapted LLM for Multilingual Speech-to-Text0
Arabic Language WEKA-Based Dialect Classifier for Arabic Automatic Speech Recognition Transcripts0
Show:102550
← PrevPage 29 of 64Next →

No leaderboard results yet.