SOTAVerified

Automatic Speech Recognition

Papers

Showing 15011550 of 3174 papers

TitleStatusHype
Improving Data Driven Inverse Text Normalization using Data Augmentation0
Improving Distinction between ASR Errors and Speech Disfluencies with Feature Space Interpolation0
Exploring RNN-Transducer for Chinese Speech Recognition0
Improving Dysarthric Speech Intelligibility Using Cycle-consistent Adversarial Training0
Improving EEG based Continuous Speech Recognition0
Improving End-to-End Bangla Speech Recognition with Semi-supervised Training0
Improving End-to-End Speech Processing by Efficient Text Data Utilization with Latent Synthesis0
Improving Fairness and Robustness in End-to-End Speech Recognition through unsupervised clustering0
Improving Fast-slow Encoder based Transducer with Streaming Deliberation0
Improving Frame-level Classifier for Word Timings with Non-peaky CTC in End-to-End Automatic Speech Recognition0
Exploring Pre-training with Alignments for RNN Transducer based End-to-End Speech Recognition0
Improving Hybrid CTC/Attention End-to-end Speech Recognition with Pretrained Acoustic and Language Model0
Improving Hypernasality Estimation with Automatic Speech Recognition in Cleft Palate Speech0
Improving Language Model Adaptation using Automatic Data Selection and Neural Network0
Bypass Temporal Classification: Weakly Supervised Automatic Speech Recognition with Imperfect Transcripts0
Improving Large-scale Deep Biasing with Phoneme Features and Text-only Data in Streaming Transducer0
Improving low-resource ASR performance with untranscribed out-of-domain data0
Improving Low Resource Code-switched ASR using Augmented Code-switched TTS0
Arabic Language WEKA-Based Dialect Classifier for Arabic Automatic Speech Recognition Transcripts0
Exploring Methods for the Automatic Detection of Errors in Manual Transcription0
Exploring Machine Speech Chain for Domain Adaptation and Few-Shot Speaker Adaptation0
BUT System for the MLC-SLM Challenge0
Exploring Heterogeneous Characteristics of Layers in ASR Models for More Efficient Training0
Improving Medical Speech-to-Text Accuracy with Vision-Language Pre-training Model0
BUT Opensat 2019 Speech Recognition System0
Improving Multilingual ASR in the Wild Using Simple N-best Re-ranking0
Improving Multilingual Speech Models on ML-SUPERB 2.0: Fine-tuning with Data Augmentation and LID-Aware CTC0
Improving Multimodal Speech Recognition by Data Augmentation and Speech Representations0
Improving Named Entity Recognition in Spoken Dialog Systems by Context and Speech Pattern Modeling0
Improving Named Entity Transcription with Contextual LLM-based Revision0
Improving Neural Biasing for Contextual Speech Recognition by Early Context Injection and Text Perturbation0
Improving noise robust automatic speech recognition with single-channel time-domain enhancement network0
Improving Noise Robustness of an End-to-End Neural Model for Automatic Speech Recognition0
Improving noise robustness of automatic speech recognition via parallel data and teacher-student learning0
Adversarial Joint Training with Self-Attention Mechanism for Robust End-to-End Speech Recognition0
Improving Noisy Student Training on Non-target Domain Data for Automatic Speech Recognition0
Improving non-autoregressive end-to-end speech recognition with pre-trained acoustic and language models0
Improving Practical Aspects of End-to-End Multi-Talker Speech Recognition for Online and Offline Scenarios0
Exploring Gender Disparities in Automatic Speech Recognition Technology0
Improving Punctuation Restoration for Speech Transcripts via External Data0
Exploring Effective Distillation of Self-Supervised Speech Models for Automatic Speech Recognition0
Improving Readability for Automatic Speech Recognition Transcription0
Building state-of-the-art distant speech recognition using the CHiME-4 challenge with a setup of speech enhancement baseline0
Improving RNN-T ASR Performance with Date-Time and Location Awareness0
Exploring data augmentation in bias mitigation against non-native-accented speech0
Exploring CTC Based End-to-End Techniques for Myanmar Speech Recognition0
Improving RNN transducer with normalized jointer network0
Improving Robustness of Neural Inverse Text Normalization via Data-Augmentation, Semi-Supervised Learning, and Post-Aligning Method0
Improving Scheduled Sampling for Neural Transducer-based ASR0
Building Robust Spoken Language Understanding by Cross Attention between Phoneme Sequence and ASR Hypothesis0
Show:102550
← PrevPage 31 of 64Next →

No leaderboard results yet.