Trading Devil RL: Backdoor attack via Stock market, Bayesian Optimization and Reinforcement Learning Dec 23, 2024 Backdoor Attack Bayesian Optimization
— Unverified 0Deep Learning in Proteomics Informatics: Applications, Challenges, and Future Directions Dec 23, 2024 Deep Learning speech-recognition
— Unverified 0Investigating Prosodic Signatures via Speech Pre-Trained Models for Audio Deepfake Source Attribution Dec 23, 2024 Audio Deepfake Detection DeepFake Detection
— Unverified 0UME: Upcycling Mixture-of-Experts for Scalable and Efficient Automatic Speech Recognition Dec 23, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Uncovering the Visual Contribution in Audio-Visual Speech Recognition Dec 22, 2024 Audio-Visual Speech Recognition Informativeness
— Unverified 0Adapting Whisper for Code-Switching through Encoding Refining and Language-Aware Decoding Dec 21, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Transducer-Llama: Integrating LLMs into Streamable Transducer-based Speech Recognition Dec 21, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Enhancing Multilingual ASR for Unseen Languages via Language Embedding Modeling Dec 21, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Speech Retrieval-Augmented Generation without Automatic Speech Recognition Dec 21, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0TouchASP: Elastic Automatic Speech Perception that Everyone Can Touch Dec 20, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Transcribing and Translating, Fast and Slow: Joint Speech Translation and Recognition Dec 19, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0LAMA-UT: Language Agnostic Multilingual ASR through Orthography Unification and Language-Specific Transliteration Dec 19, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0MERaLiON-SpeechEncoder: Towards a Speech Foundation Model for Singapore and Beyond Dec 16, 2024 Language Modeling Language Modelling
— Unverified 0Speak & Improve Challenge 2025: Tasks and Baseline Systems Dec 16, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Speak & Improve Corpus 2025: an L2 English Speech Corpus for Language Assessment and Feedback Dec 16, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Transliterated Zero-Shot Domain Adaptation for Automatic Speech Recognition Dec 15, 2024 Automatic Speech Recognition Domain Adaptation
— Unverified 0Robust Persian Digit Recognition in Noisy Environments Using Hybrid CNN-BiGRU Model Dec 14, 2024 speech-recognition Speech Recognition
— Unverified 0Efficient Adaptation of Multilingual Models for Japanese ASR Dec 14, 2024 Automatic Speech Recognition speech-recognition
Code Code Available 0MERaLiON-AudioLLM: Bridging Audio and Language with Large Language Models Dec 13, 2024 speech-recognition Speech Recognition
— Unverified 0Greek2MathTex: A Greek Speech-to-Text Framework for LaTeX Equations Generation Dec 11, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Bilevel Joint Unsupervised and Supervised Training for Automatic Speech Recognition Dec 11, 2024 Automatic Speech Recognition speech-recognition
— Unverified 0Style-agnostic evaluation of ASR using multiple reference transcripts Dec 10, 2024 speech-recognition Speech Recognition
— Unverified 0Effective Text Adaptation for LLM-based ASR through Soft Prompt Fine-Tuning Dec 9, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Ensemble Machine Learning Model for Inner Speech Recognition: A Subject-Specific Investigation Dec 9, 2024 EEG feature selection
— Unverified 0Not All Errors Are Equal: Investigation of Speech Recognition Errors in Alzheimer's Disease Detection Dec 9, 2024 All Alzheimer's Disease Detection
— Unverified 0Leveraging Prompt Learning and Pause Encoding for Alzheimer's Disease Detection Dec 9, 2024 Alzheimer's Disease Detection Automatic Speech Recognition
— Unverified 0Harnessing Transfer Learning from Swahili: Advancing Solutions for Comorian Dialects Dec 9, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0SQ-Whisper: Speaker-Querying based Whisper Model for Target-Speaker ASR Dec 7, 2024 Automatic Speech Recognition Data Augmentation
Code Code Available 0Adaptive Dropout for Pruning Conformers Dec 6, 2024 speech-recognition Speech Recognition
— Unverified 0Comprehensive Audio Query Handling System with Integrated Expert Models and Contextual Understanding Dec 5, 2024 Audio Generation Automatic Speech Recognition
— Unverified 0Speech Recognition-based Feature Extraction for Enhanced Automatic Severity Classification in Dysarthric Speech Dec 5, 2024 severity prediction speech-recognition
— Unverified 0ASR-EC Benchmark: Evaluating Large Language Models on Chinese ASR Error Correction Dec 4, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Automating Feedback Analysis in Surgical Training: Detection, Categorization, and Assessment Dec 1, 2024 Action Detection Activity Detection
Code Code Available 0Late fusion ensembles for speech recognition on diverse input audio representations Dec 1, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0A Comparative Study of LLM-based ASR and Whisper in Low Resource and Code Switching Scenario Dec 1, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Empowering the Deaf and Hard of Hearing Community: Enhancing Video Captions Using Large Language Models Nov 30, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0ArEEG_Words: Dataset for Envisioned Speech Recognition using EEG for Arabic Words Nov 28, 2024 Brain Computer Interface EEG
— Unverified 0How to Learn a New Language? An Efficient Solution for Self-Supervised Learning Models Unseen Languages Adaption in Low-Resource Scenario Nov 27, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0MSA-ASR: Efficient Multilingual Speaker Attribution with frozen ASR Models Nov 27, 2024 Automatic Speech Recognition speech-recognition
— Unverified 0AMPS: ASR with Multimodal Paraphrase Supervision Nov 27, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Aligning Pre-trained Models for Spoken Language Translation Nov 27, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Continual Learning in Machine Speech Chain Using Gradient Episodic Memory Nov 27, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0SALMONN-omni: A Codec-free LLM for Full-duplex Speech Understanding and Generation Nov 27, 2024 Question Answering Speech Enhancement
— Unverified 0Enhancing Code-Switching ASR Leveraging Non-Peaky CTC Loss and Deep Language Posterior Injection Nov 26, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0k2SSL: A Faster and Better Framework for Self-Supervised Speech Representation Learning Nov 26, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Towards Maximum Likelihood Training for Transducer-based Streaming Speech Recognition Nov 26, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Disentangled-Transformer: An Explainable End-to-End Automatic Speech Recognition Model with Speech Content-Context Separation Nov 26, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0High-precision medical speech recognition through synthetic data and semantic correction: UNITED-MEDASR Nov 24, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0TSkips: Efficiency Through Explicit Temporal Delay Connections in Spiking Neural Networks Nov 22, 2024 Gesture Recognition Hand Gesture Recognition
— Unverified 0Transforming NLU with Babylon: A Case Study in Development of Real-time, Edge-Efficient, Multi-Intent Translation System for Automated Drive-Thru Ordering Nov 22, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0