Tiny-Align: Bridging Automatic Speech Recognition and Large Language Model on the Edge Nov 21, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Hard-Synth: Synthesizing Diverse Hard Samples for ASR using Zero-Shot TTS and LLM Nov 20, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0CAFE A Novel Code switching Dataset for Algerian Dialect French and English Nov 20, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Towards Advanced Speech Signal Processing: A Statistical Perspective on Convolution-Based Architectures and its Applications Nov 20, 2024 Emotion Recognition Speaker Identification
— Unverified 0From Statistical Methods to Pre-Trained Models; A Survey on Automatic Speech Recognition for Resource Scarce Urdu Language Nov 20, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Whisper Finetuning on Nepali Language Nov 19, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0A Novel Speech Analysis and Correction Tool for Arabic-Speaking Children Nov 18, 2024 Diagnostic speech-recognition
— Unverified 0Inter-linguistic Phonetic Composition (IPC): A Theoretical and Computational Approach to Enhance Second Language Pronunciation Nov 17, 2024 Automatic Speech Recognition speech-recognition
— Unverified 0BanglaDialecto: An End-to-End AI-Powered Regional Speech Standardization Nov 16, 2024 Machine Translation speech-recognition
Code Code Available 0Systolic Arrays and Structured Pruning Co-design for Efficient Transformers in Edge Systems Nov 15, 2024 Machine Translation Quantization
— Unverified 0DiMoDif: Discourse Modality-information Differentiation for Audio-visual Deepfake Detection and Localization Nov 15, 2024 DeepFake Detection Face Swapping
Code Code Available 0Everyone deserves their voice to be heard: Analyzing Predictive Gender Bias in ASR Models Applied to Dutch Speech Data Nov 14, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Transferable Adversarial Attacks against ASR Nov 14, 2024 Action Detection Activity Detection
— Unverified 0DCF-DS: Deep Cascade Fusion of Diarization and Separation for Speech Recognition under Realistic Single-Channel Conditions Nov 11, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0CTC-Assisted LLM-Based Contextual ASR Nov 10, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Multistage Fine-tuning Strategies for Automatic Speech Recognition in Low-resource Languages Nov 7, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0SPES: Spectrogram Perturbation for Explainable Speech-to-Text Generation Nov 3, 2024 speech-recognition Speech Recognition
— Unverified 0Optimizing Contextual Speech Recognition Using Vector Quantization for Efficient Retrieval Nov 1, 2024 Quantization Retrieval
— Unverified 0Enhancing AAC Software for Dysarthric Speakers in e-Health Settings: An Evaluation Using TORGO Nov 1, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Speech is More Than Words: Do Speech-to-Text Translation Systems Leverage Prosody? Oct 31, 2024 Rhythm speech-recognition
— Unverified 0Augmenting Polish Automatic Speech Recognition System With Synthetic Data Oct 30, 2024 Automatic Speech Recognition speech-recognition
— Unverified 0Run-Time Adaptation of Neural Beamforming for Robust Speech Dereverberation and Denoising Oct 30, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Joint Beamforming and Speaker-Attributed ASR for Real Distant-Microphone Meeting Transcription Oct 29, 2024 Automatic Speech Recognition speech-recognition
— Unverified 0Multilingual Standalone Trustworthy Voice-Based Social Network for Disaster Situations Oct 28, 2024 speech-recognition Speech Recognition
— Unverified 0Asynchronous Tool Usage for Real-Time Agents Oct 28, 2024 Automatic Speech Recognition speech-recognition
— Unverified 0Improving Speech-based Emotion Recognition with Contextual Utterance Analysis and LLMs Oct 27, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Contextual Biasing to Improve Domain-specific Custom Vocabulary Audio Transcription without Explicit Fine-Tuning of Whisper Model Oct 24, 2024 speech-recognition Speech Recognition
— Unverified 0A Survey on Speech Large Language Models Oct 24, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Evaluating and Improving Automatic Speech Recognition Systems for Korean Meteorological Experts Oct 24, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0kNN For Whisper And Its Effect On Bias And Speaker Adaptation Oct 24, 2024 Machine Translation speech-recognition
— Unverified 0ELAICHI: Enhancing Low-resource TTS by Addressing Infrequent and Low-frequency Character Bigrams Oct 23, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0DENOASR: Debiasing ASRs through Selective Denoising Oct 22, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Enhancing Low-Resource ASR through Versatile TTS: Bridging the Data Gap Oct 22, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Improving Automatic Speech Recognition with Decoder-Centric Regularisation in Encoder-Decoder Models Oct 22, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Acoustic Model Optimization over Multiple Data Sources: Merging and Valuation Oct 21, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Interventional Speech Noise Injection for ASR Generalizable Spoken Language Understanding Oct 21, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0End-to-End Transformer-based Automatic Speech Recognition for Northern Kurdish: A Pioneering Approach Oct 19, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0AC-Mix: Self-Supervised Adaptation for Low-Resource Automatic Speech Recognition using Agnostic Contrastive Mixup Oct 18, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Roadmap towards Superhuman Speech Understanding using Large Language Models Oct 17, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Parameter-efficient Adaptation of Multilingual Multimodal Models for Low-resource ASR Oct 17, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Failing Forward: Improving Generative Error Correction for ASR with Synthetic Data and Retrieval Augmentation Oct 17, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Computational Approaches to Arabic-English Code-Switching Oct 17, 2024 Data Augmentation Language Identification
— Unverified 0Investigation of Speaker Representation for Target-Speaker Speech Processing Oct 15, 2024 Action Detection Activity Detection
— Unverified 0A Framework for Adapting Human-Robot Interaction to Diverse User Groups Oct 15, 2024 Action Detection Activity Detection
Code Code Available 0Character-aware audio-visual subtitling in context Oct 14, 2024 Language Modelling Large Language Model
— Unverified 0In-Materia Speech Recognition Oct 14, 2024 Autonomous Driving speech-recognition
— Unverified 0State of NLP in Kenya: A Survey Oct 13, 2024 Information Retrieval Machine Translation
— Unverified 0Automatic Speech Recognition with BERT and CTC Transformers: A Review Oct 12, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0SLAM-AAC: Enhancing Audio Captioning with Paraphrasing Augmentation and CLAP-Refine through LLMs Oct 12, 2024 AudioCaps Audio captioning
— Unverified 0UniGlyph: A Seven-Segment Script for Universal Language Representation Oct 11, 2024 Diversity speech-recognition
— Unverified 0