FairLENS: Assessing Fairness in Law Enforcement Speech Recognition May 21, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Non-autoregressive real-time Accent Conversion model with voice cloning May 21, 2024 Speech Enhancement speech-recognition
— Unverified 0Could a Computer Architect Understand our Brain? May 21, 2024 Descriptive ERP
— Unverified 0Continuous Sign Language Recognition with Adapted Conformer via Unsupervised Pretraining May 20, 2024 Sign Language Recognition speech-recognition
— Unverified 0Listen Again and Choose the Right Answer: A New Paradigm for Automatic Speech Recognition with Large Language Models May 16, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0No More Mumbles: Enhancing Robot Intelligibility through Speech Adaptation May 15, 2024 speech-recognition Speech Recognition
Code Code Available 0Continued Pretraining for Domain Adaptation of Wav2vec2.0 in Automatic Speech Recognition for Elementary Math Classroom Settings May 15, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Towards Evaluating the Robustness of Automatic Speech Recognition Systems via Audio Style Transfer May 15, 2024 Adversarial Attack Automatic Speech Recognition
— Unverified 0Investigating the 'Autoencoder Behavior' in Speech Self-Supervised Models: a focus on HuBERT's Pretraining May 14, 2024 Self-Supervised Learning speech-recognition
— Unverified 0Sonos Voice Control Bias Assessment Dataset: A Methodology for Demographic Bias Assessment in Voice Assistants May 14, 2024 Automatic Speech Recognition Diversity
— Unverified 0SpeechVerse: A Large-scale Generalizable Audio Language Model May 14, 2024 Automatic Speech Recognition Benchmarking
— Unverified 0Rene: A Pre-trained Multi-modal Architecture for Auscultation of Respiratory Diseases May 13, 2024 Audio Classification Diagnostic
Code Code Available 0Large Language Models for Education: A Survey May 12, 2024 Autonomous Driving speech-recognition
— Unverified 0Lost in Transcription: Identifying and Quantifying the Accuracy Biases of Automatic Speech Recognition Systems Against Disfluent Speech May 10, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0DP-DyLoRA: Fine-Tuning Transformer-Based Models On-Device under Differentially Private Federated Learning using Dynamic Low-Rank Adaptation May 10, 2024 Federated Learning Natural Language Understanding
— Unverified 0Audio-Visual Speech Recognition based on Regulated Transformer and Spatio-Temporal Fusion Strategy for Driver Assistive Systems May 9, 2024 Audio-Visual Speech Recognition Lipreading
Code Code Available 0Open Implementation and Study of BEST-RQ for Speech Processing May 7, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Whispy: Adapting STT Whisper Models to Real-Time Environments May 6, 2024 Action Detection Activity Detection
— Unverified 0MMGER: Multi-modal and Multi-granularity Generative Error Correction with LLM for Joint Accent and Speech Recognition May 6, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Mixat: A Data Set of Bilingual Emirati-English Speech May 4, 2024 speech-recognition Speech Recognition
Code Code Available 0Combining X-Vectors and Bayesian Batch Active Learning: Two-Stage Active Learning Pipeline for Speech Recognition May 3, 2024 Active Learning Automatic Speech Recognition
— Unverified 0Deep Learning Models in Speech Recognition: Measuring GPU Energy Consumption, Impact of Noise and Model Quantization for Edge Deployment May 2, 2024 GPU NVIDIA Jetson Orin Nano
Code Code Available 0Efficient Compression of Multitask Multilingual Speech Models May 2, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Low-resource speech recognition and dialect identification of Irish in a multi-task framework May 2, 2024 Decoder Dialect Identification
— Unverified 0Improving Membership Inference in ASR Model Auditing with Perturbed Loss Features May 2, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Sequence-to-sequence models in peer-to-peer learning: A practical application May 2, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Efficient Sample-Specific Encoder Perturbations May 1, 2024 Attribute Decoder
— Unverified 0Active Learning with Task Adaptation Pre-training for Speech Emotion Recognition May 1, 2024 Active Learning Emotion Recognition
Code Code Available 0Does Whisper understand Swiss German? An automatic, qualitative, and human evaluation Apr 30, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Towards Dog Bark Decoding: Leveraging Human Speech Processing for Automated Bark Classification Apr 29, 2024 Classification Gender Classification
— Unverified 0A cost minimization approach to fix the vocabulary size in a tokenizer for an End-to-End ASR system Apr 29, 2024 speech-recognition Speech Recognition
— Unverified 0Child Speech Recognition in Human-Robot Interaction: Problem Solved? Apr 26, 2024 GPU speech-recognition
— Unverified 0Developing Acoustic Models for Automatic Speech Recognition in Swedish Apr 25, 2024 Automatic Speech Recognition speech-recognition
— Unverified 0Automatic Speech Recognition System-Independent Word Error Rate Estimation Apr 25, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0U2++ MoE: Scaling 4.7x parameters with minimal impact on RTF Apr 25, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Gated Low-rank Adaptation for personalized Code-Switching Automatic Speech Recognition on the low-spec devices Apr 24, 2024 Automatic Speech Recognition CPU
— Unverified 0Rethinking Processing Distortions: Disentangling the Impact of Speech Enhancement Errors on Speech Recognition Performance Apr 23, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Killkan: The Automatic Speech Recognition Dataset for Kichwa with Morphosyntactic Information Apr 23, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Breaking Walls: Pioneering Automatic Speech Recognition for Central Kurdish: End-to-End Transformer Paradigm Apr 23, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Exploring neural oscillations during speech perception via surrogate gradient spiking neural networks Apr 22, 2024 speech-recognition Speech Recognition
Code Code Available 0Semantically Corrected Amharic Automatic Speech Recognition Apr 20, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Efficient infusion of self-supervised representations in Automatic Speech Recognition Apr 19, 2024 Automatic Speech Recognition Decoder
— Unverified 0Learn2Talk: 3D Talking Face Learns from 2D Talking Face Apr 19, 2024 Audio-Visual Speech Recognition speech-recognition
— Unverified 0Artificial Neural Networks to Recognize Speakers Division from Continuous Bengali Speech Apr 18, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Teaching a Multilingual Large Language Model to Understand Multilingual Speech via Multi-Instructional Training Apr 16, 2024 Language Modeling Language Modelling
Code Code Available 0Anatomy of Industrial Scale Multilingual ASR Apr 15, 2024 Anatomy Automatic Speech Recognition
— Unverified 0Resilience of Large Language Models for Noisy Instructions Apr 15, 2024 Automatic Speech Recognition Optical Character Recognition
— Unverified 0Automatic Speech Recognition Advancements for Indigenous Languages of the Americas Apr 12, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Comparing Apples to Oranges: LLM-powered Multimodal Intention Prediction in an Object Categorization Task Apr 12, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0An Effective Automated Speaking Assessment Approach to Mitigating Data Scarcity and Imbalanced Distribution Apr 11, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0