Conformer-1: Robust ASR via Large-Scale Semisupervised Bootstrapping Apr 10, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0An inclusive review on deep learning techniques and their scope in handwriting recognition Apr 10, 2024 Deep Learning Handwriting Recognition
— Unverified 0The X-LANCE Technical Report for Interspeech 2024 Speech Processing Using Discrete Speech Unit Challenge Apr 9, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0VietMed: A Dataset and Benchmark for Automatic Speech Recognition of Vietnamese in the Medical Domain Apr 8, 2024 Language Modelling Speech Recognition
— Unverified 0Transducers with Pronunciation-aware Embeddings for Automatic Speech Recognition Apr 4, 2024 Automatic Speech Recognition Decoder
— Unverified 0Mai Ho'omāuna i ka 'Ai: Language Models Improve Automatic Speech Recognition in Hawaiian Apr 3, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Noise Masking Attacks and Defenses for Pretrained Speech Models Apr 2, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Transfer Learning from Whisper for Microscopic Intelligibility Prediction Apr 2, 2024 Automatic Speech Recognition Deep Learning
— Unverified 0Houston we have a Divergence: A Subgroup Performance Analysis of ASR Models Mar 31, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0ELITR-Bench: A Meeting Assistant Benchmark for Long-Context Language Models Mar 29, 2024 Automatic Speech Recognition speech-recognition
Code Code Available 0Multi-Stage Multi-Modal Pre-Training for Automatic Speech Recognition Mar 28, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0LV-CTC: Non-autoregressive ASR with CTC and latent variable models Mar 28, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0ZAEBUC-Spoken: A Multilingual Multidialectal Arabic-English Speech Corpus Mar 27, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Extracting Biomedical Entities from Noisy Audio Transcripts Mar 26, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0DANCER: Entity Description Augmented Named Entity Corrector for Automatic Speech Recognition Mar 26, 2024 Automatic Speech Recognition Language Modelling
— Unverified 0Grammatical vs Spelling Error Correction: An Investigation into the Responsiveness of Transformer-based Language Models using BART and MarianMT Mar 25, 2024 Optical Character Recognition (OCR) speech-recognition
— Unverified 0Hierarchical Recurrent Adapters for Efficient Multi-Task Adaptation of Large Speech Models Mar 25, 2024 Automatic Speech Recognition speech-recognition
— Unverified 0Privacy-Preserving End-to-End Spoken Language Understanding Mar 22, 2024 Privacy Preserving speech-recognition
— Unverified 0M^3AV: A Multimodal, Multigenre, and Multipurpose Audio-Visual Academic Lecture Dataset Mar 21, 2024 Diversity Script Generation
— Unverified 0A Multimodal Approach to Device-Directed Speech Detection with Large Language Models Mar 21, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0XLAVS-R: Cross-Lingual Audio-Visual Speech Representation Learning for Noise-Robust Speech Perception Mar 21, 2024 Audio-Visual Speech Recognition Representation Learning
— Unverified 0BanglaNum -- A Public Dataset for Bengali Digit Recognition from Speech Mar 20, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Open Access NAO (OAN): a ROS2-based software framework for HRI applications with the NAO robot Mar 20, 2024 speech-recognition Speech Recognition
— Unverified 0Isometric Neural Machine Translation using Phoneme Count Ratio Reward-based Reinforcement Learning Mar 20, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0AdaMER-CTC: Connectionist Temporal Classification with Adaptive Maximum Entropy Regularization for Automatic Speech Recognition Mar 18, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Artificial Intelligence for Cochlear Implants: Review of Strategies, Challenges, and Perspectives Mar 17, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Energy-Based Models with Applications to Speech and Language Processing Mar 16, 2024 Language Modeling Language Modelling
— Unverified 0Initial Decoding with Minimally Augmented Language Model for Improved Lattice Rescoring in Low Resource ASR Mar 16, 2024 Language Modeling Language Modelling
— Unverified 0Hearing-Loss Compensation Using Deep Neural Networks: A Framework and Results From a Listening Test Mar 15, 2024 Music Classification Speaker Identification
— Unverified 0More than words: Advancements and challenges in speech recognition for singing Mar 14, 2024 Keyword Spotting Language Identification
— Unverified 0SpokeN-100: A Cross-Lingual Benchmarking Dataset for The Classification of Spoken Numbers in Different Languages Mar 14, 2024 Benchmarking Dimensionality Reduction
Code Code Available 0Multilingual Audio-Visual Speech Recognition with Hybrid CTC/RNN-T Fast Conformer Mar 14, 2024 Audio-Visual Speech Recognition Robust Speech Recognition
— Unverified 0Automatic Speech Recognition (ASR) for the Diagnosis of pronunciation of Speech Sound Disorders in Korean children Mar 13, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Skipformer: A Skip-and-Recover Strategy for Efficient Speech Recognition Mar 13, 2024 Automatic Speech Recognition speech-recognition
— Unverified 0Gujarati-English Code-Switching Speech Recognition using ensemble prediction of spoken language Mar 12, 2024 Automatic Speech Recognition speech-recognition
— Unverified 0Beyond the Labels: Unveiling Text-Dependency in Paralinguistic Speech Recognition Datasets Mar 12, 2024 speech-recognition Speech Recognition
— Unverified 0The evaluation of a code-switched Sepedi-English automatic speech recognition system Mar 11, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0SCORE: Self-supervised Correspondence Fine-tuning for Improved Content Representations Mar 10, 2024 Automatic Speech Recognition Data Augmentation
Code Code Available 0Aligning Speech to Languages to Enhance Code-switching Speech Recognition Mar 9, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0A Study of Dropout-Induced Modality Bias on Robustness to Missing Video Frames for Audio-Visual Speech Recognition Mar 7, 2024 Audio-Visual Speech Recognition Knowledge Distillation
Code Code Available 0Classist Tools: Social Class Correlates with Performance in NLP Mar 7, 2024 Automatic Speech Recognition Language Modelling
— Unverified 0A New Benchmark for Evaluating Automatic Speech Recognition in the Arabic Call Domain Mar 7, 2024 Arabic Speech Recognition Automatic Speech Recognition
— Unverified 0RADIA -- Radio Advertisement Detection with Intelligent Analytics Mar 6, 2024 Marketing speech-recognition
— Unverified 0Non-verbal information in spontaneous speech -- towards a new framework of analysis Mar 6, 2024 speech-recognition Speech Recognition
— Unverified 0AIx Speed: Playback Speed Optimization Using Listening Comprehension of Speech Recognition Models Mar 5, 2024 speech-recognition Speech Recognition
— Unverified 0What has LeBenchmark Learnt about French Syntax? Mar 4, 2024 Automatic Speech Recognition speech-recognition
— Unverified 0JEP-KD: Joint-Embedding Predictive Architecture Based Knowledge Distillation for Visual Speech Recognition Mar 4, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0A Closer Look at Wav2Vec2 Embeddings for On-Device Single-Channel Speech Enhancement Mar 3, 2024 Automatic Speech Recognition Keyword Spotting
— Unverified 0Automatic Speech Recognition using Advanced Deep Learning Approaches: A survey Mar 2, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Post-decoder Biasing for End-to-End Speech Recognition of Multi-turn Medical Interview Mar 1, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0