Enhancing Indonesian Automatic Speech Recognition: Evaluating Multilingual Models with Diverse Speech Variabilities Oct 11, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Full-Rank No More: Low-Rank Weight Training for Modern Speech Recognition Models Oct 10, 2024 speech-recognition Speech Recognition
— Unverified 0A two-stage transliteration approach to improve performance of a multilingual ASR Oct 9, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Advocating Character Error Rate for Multilingual ASR Evaluation Oct 9, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0The USTC-NERCSLIP Systems for the CHiME-8 MMCSG Challenge Oct 8, 2024 speech-recognition Speech Recognition
— Unverified 0Incorporating Talker Identity Aids With Improving Speech Recognition in Adversarial Environments Oct 7, 2024 Speaker Identification speech-recognition
— Unverified 0Automatic Screening for Children with Speech Disorder using Automatic Speech Recognition: Opportunities and Challenges Oct 7, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0CR-CTC: Consistency regularization on CTC for improved speech recognition Oct 7, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Punctuation Prediction for Polish Texts using Transformers Oct 6, 2024 Prediction Reading Comprehension
— Unverified 0Casablanca: Data and Models for Multidialectal Arabic Speech Recognition Oct 6, 2024 Arabic Speech Recognition speech-recognition
— Unverified 0The OCON model: an old but green solution for distributable supervised classification for acoustic monitoring in smart cities Oct 5, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0The OCON model: an old but gold solution for distributable supervised classification Oct 5, 2024 Automatic Speech Recognition Classification
Code Code Available 0Enhancement of Dysarthric Speech Reconstruction by Contrastive Learning Oct 5, 2024 Contrastive Learning speech-recognition
— Unverified 0Reverb: Open-Source ASR and Diarization from Rev Oct 4, 2024 speech-recognition Speech Recognition
— Unverified 0Self-Powered LLM Modality Expansion for Large Speech-Text Models Oct 4, 2024 Automatic Speech Recognition Instruction Following
Code Code Available 0Multi-Dialect Vietnamese: Task, Dataset, Baseline Models and Challenges Oct 4, 2024 Dialect Identification Diversity
Code Code Available 0Team MTS @ AutoMin 2021: An Overview of Existing Summarization Approaches and Comparison to Unsupervised Summarization Techniques Oct 4, 2024 Automatic Speech Recognition speech-recognition
— Unverified 0HAINAN: Fast and Accurate Transducer for Hybrid-Autoregressive ASR Oct 3, 2024 speech-recognition Speech Recognition
— Unverified 0Algorithms For Automatic Accentuation And Transcription Of Russian Texts In Speech Recognition Systems Oct 3, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0A Pilot Study of Applying Sequence-to-Sequence Voice Conversion to Evaluate the Intelligibility of L2 Speech Using a Native Speaker's Shadowings Oct 3, 2024 speech-recognition Speech Recognition
— Unverified 0Convolutional Variational Autoencoders for Spectrogram Compression in Automatic Speech Recognition Oct 3, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Spoken Grammar Assessment Using LLM Oct 2, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Efficient Streaming LLM for Speech Recognition Oct 2, 2024 Decoder speech-recognition
— Unverified 0Automatic Speech Recognition for the Ika Language Oct 1, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0End-to-End Speech Recognition with Pre-trained Masked Language Model Oct 1, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Alignment-Free Training for Transducer-based Multi-Talker ASR Sep 30, 2024 All Automatic Speech Recognition
— Unverified 0Predictive Speech Recognition and End-of-Utterance Detection Towards Spoken Dialog Systems Sep 30, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0AfriHuBERT: A self-supervised speech representation model for African languages Sep 30, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Boosting Hybrid Autoregressive Transducer-based ASR with Internal Acoustic Model Training and Dual Blank Thresholding Sep 30, 2024 speech-recognition Speech Recognition
— Unverified 0Quantitative Analysis of Audio-Visual Tasks: An Information-Theoretic Perspective Sep 29, 2024 Audio-Visual Speech Recognition Lip Reading
— Unverified 0Fine-Tuning Automatic Speech Recognition for People with Parkinson's: An Effective Strategy for Enhancing Speech Technology Accessibility Sep 29, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0CoT-ST: Enhancing LLM-based Speech Translation with Multimodal Chain-of-Thought Sep 29, 2024 speech-recognition Speech Recognition
— Unverified 0Efficient Long-Form Speech Recognition for General Speech In-Context Learning Sep 29, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Advanced Clustering Techniques for Speech Signal Enhancement: A Review and Metanalysis of Fuzzy C-Means, K-Means, and Kernel Fuzzy C-Means Methods Sep 28, 2024 Clustering Speech Enhancement
— Unverified 0Improving Multilingual ASR in the Wild Using Simple N-best Re-ranking Sep 27, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Speech-Mamba: Long-Context Speech Recognition with Selective State Spaces Models Sep 27, 2024 Automatic Speech Recognition Mamba
— Unverified 0A GEN AI Framework for Medical Note Generation Sep 27, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Unveiling the Role of Pretraining in Direct Speech Translation Sep 26, 2024 Automatic Speech Recognition Decoder
— Unverified 0Paraformer-v2: An improved non-autoregressive transformer for noise-robust speech recognition Sep 26, 2024 Decoder Robust Speech Recognition
— Unverified 0Are Transformers in Pre-trained LM A Good ASR Encoder? An Empirical Study Sep 26, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Deep CLAS: Deep Contextual Listen, Attend and Spell Sep 26, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0MT2KD: Towards A General-Purpose Encoder for Speech, Speaker, and Audio Events Sep 25, 2024 Audio Tagging Automatic Speech Recognition
— Unverified 0Speech Recognition Rescoring with Large Speech-Text Foundation Models Sep 25, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0How to Connect Speech Foundation Models and Large Language Models? What Matters and What Does Not Sep 25, 2024 Automatic Speech Recognition speech-recognition
— Unverified 0Semi-Supervised Cognitive State Classification from Speech with Multi-View Pseudo-Labeling Sep 25, 2024 Automatic Speech Recognition Emotion Recognition
Code Code Available 0Weighted Cross-entropy for Low-Resource Languages in Multilingual Speech Recognition Sep 25, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Hypothesis Clustering and Merging: Novel MultiTalker Speech Recognition with Speaker Tokens Sep 24, 2024 Clustering Decoder
— Unverified 0Revisiting Acoustic Features for Robust ASR Sep 24, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Bridging Speech and Text: Enhancing ASR with Pinyin-to-Character Pre-training in LLMs Sep 24, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Boosting Code-Switching ASR with Mixture of Experts Enhanced Speech-Conditioned LLM Sep 24, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0