Text-Based Detection of On-Hold Scripts in Contact Center Calls Jul 13, 2024 Automatic Speech Recognition speech-recognition
Code Code Available 0Tamil Language Computing: the Present and the Future Jul 11, 2024 Language Modelling Machine Translation
— Unverified 0Explaining Spectrograms in Machine Learning: A Study on Neural Networks for Speech Classification Jul 10, 2024 Classification speech-recognition
Code Code Available 0Dynamic Encoder Size Based on Data-Driven Layer-wise Pruning for Speech Recognition Jul 10, 2024 speech-recognition Speech Recognition
— Unverified 0Evaluating Voice Command Pipelines for Drone Control: From STT and LLM to Direct Classification and Siamese Networks Jul 10, 2024 Language Modeling Language Modelling
— Unverified 0HebDB: a Weakly Supervised Dataset for Hebrew Speech Processing Jul 10, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0A voice and speech corpus of patients who underwent upper airway surgery in pre- and post-operative states Jul 9, 2024 Articles Classification
Code Code Available 0Analyzing Speech Unit Selection for Textless Speech-to-Speech Translation Jul 8, 2024 Automatic Speech Recognition Emotion Recognition
— Unverified 0Homogeneous Speaker Features for On-the-Fly Dysarthric and Elderly Speaker Adaptation Jul 8, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Morse Code-Enabled Speech Recognition for Individuals with Visual and Hearing Impairments Jul 7, 2024 speech-recognition Speech Recognition
— Unverified 0Multitaper mel-spectrograms for keyword spotting Jul 5, 2024 Keyword Spotting speech-recognition
— Unverified 0LearnerVoice: A Dataset of Non-Native English Learners' Spontaneous Speech Jul 5, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Speculative Speech Recognition by Audio-Prefixed Low-Rank Adaptation of Language Models Jul 5, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Romanization Encoding For Multilingual ASR Jul 5, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Written Term Detection Improves Spoken Term Detection Jul 5, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Performance Analysis of Speech Encoders for Low-Resource SLU and ASR in Tunisian Dialect Jul 5, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Semi-supervised Learning for Code-Switching ASR with Large Language Model Filter Jul 5, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0XLSR-Transducer: Streaming ASR for Self-Supervised Pretrained Models Jul 5, 2024 Automatic Speech Recognition speech-recognition
— Unverified 0Seed-ASR: Understanding Diverse Speech and Contexts with LLM-based Speech Recognition Jul 5, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Improving Accented Speech Recognition using Data Augmentation based on Unsupervised Text-to-Speech Synthesis Jul 4, 2024 Accented Speech Recognition Automatic Speech Recognition
— Unverified 0Serialized Output Training by Learned Dominance Jul 4, 2024 Decoder speech-recognition
— Unverified 0Multi-Convformer: Extending Conformer with Multiple Convolution Kernels Jul 4, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Finetuning End-to-End Models for Estonian Conversational Spoken Language Translation Jul 4, 2024 Machine Translation speech-recognition
— Unverified 0Qifusion-Net: Layer-adapted Stream/Non-stream Model for End-to-End Multi-Accent Speech Recognition Jul 3, 2024 speech-recognition Speech Recognition
— Unverified 0Self-supervised ASR Models and Features For Dysarthric and Elderly Speech Recognition Jul 3, 2024 Alzheimer's Disease Detection Self-Supervised Learning
— Unverified 0Codec-ASR: Training Performant Automatic Speech Recognition Systems with Discrete Speech Representations Jul 3, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Advanced Framework for Animal Sound Classification With Features Optimization Jul 3, 2024 Classification Diversity
— Unverified 0The USTC-NERCSLIP Systems for The ICMC-ASR Challenge Jul 2, 2024 Automatic Speech Recognition Pseudo Label
— Unverified 0Towards the Next Frontier in Speech Representation Learning Using Disentanglement Jul 2, 2024 Disentanglement Representation Learning
— Unverified 0Cross-Lingual Transfer Learning for Speech Translation Jul 1, 2024 Cross-Lingual Transfer Decoder
— Unverified 0Toward Automated Detection of Biased Social Signals from the Content of Clinical Conversations Jul 1, 2024 Fairness speech-recognition
— Unverified 0Less Forgetting for Better Generalization: Exploring Continual-learning Fine-tuning Methods for Speech Self-supervised Representations Jun 30, 2024 Continual Learning Domain Generalization
— Unverified 0Open-Source Conversational AI with SpeechBrain 1.0 Jun 29, 2024 Language Modeling Language Modelling
— Unverified 0Error Correction by Paying Attention to Both Acoustic and Confidence References for Automatic Speech Recognition Jun 29, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Less is More: Accurate Speech Recognition & Translation without Web-Scale Data Jun 28, 2024 Decoder Machine Translation
— Unverified 0Enhanced ASR Robustness to Packet Loss with a Front-End Adaptation Network Jun 27, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Tradition or Innovation: A Comparison of Modern ASR Methods for Forced Alignment Jun 27, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Applying LLMs for Rescoring N-best ASR Hypotheses of Casual Conversations: Effects of Domain Adaptation and Context Carry-over Jun 27, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Voices Unheard: NLP Resources and Models for Yorùbá Regional Dialects Jun 27, 2024 Automatic Speech Recognition Machine Translation
Code Code Available 0Automatic Speech Recognition for Hindi Jun 26, 2024 Action Detection Activity Detection
— Unverified 0MSR-86K: An Evolving, Multilingual Corpus with 86,300 Hours of Transcribed Audio for Speech Recognition Research Jun 26, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0SC-MoE: Switch Conformer Mixture of Experts for Unified Streaming and Non-streaming Code-Switching ASR Jun 26, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Dynamic Data Pruning for Automatic Speech Recognition Jun 26, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Sequential Editing for Lifelong Training of Speech Recognition Models Jun 25, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0MSRS: Training Multimodal Speech Recognition Models from Scratch with Sparse Mask Optimization Jun 25, 2024 Audio-Visual Speech Recognition speech-recognition
— Unverified 0FASA: a Flexible and Automatic Speech Aligner for Extracting High-quality Aligned Children Speech Data Jun 25, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0A Comprehensive Solution to Connect Speech Encoder and Large Language Model for ASR Jun 25, 2024 Language Modeling Language Modelling
— Unverified 0Investigating Confidence Estimation Measures for Speaker Diarization Jun 24, 2024 speaker-diarization Speaker Diarization
— Unverified 0Blending LLMs into Cascaded Speech Translation: KIT's Offline Speech Translation System for IWSLT 2024 Jun 24, 2024 Action Detection Activity Detection
— Unverified 0Decoder-only Architecture for Streaming End-to-end Speech Recognition Jun 23, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0