Keep Decoding Parallel with Effective Knowledge Distillation from Language Models to End-to-end Speech Recognisers Jan 22, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Using Large Language Model for End-to-End Chinese ASR and NER Jan 21, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Contextualized Automatic Speech Recognition with Attention-Based Bias Phrase Boosted Beam Search Jan 19, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Investigating Training Strategies and Model Robustness of Low-Rank Adaptation for Language Modeling in Speech Recognition Jan 19, 2024 Language Modeling Language Modelling
— Unverified 0SlideAVSR: A Dataset of Paper Explanation Videos for Audio-Visual Speech Recognition Jan 18, 2024 Audio-Visual Speech Recognition Automatic Speech Recognition
— Unverified 0Communication-Efficient Personalized Federated Learning for Speech-to-Text Tasks Jan 18, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0AGADIR: Towards Array-Geometry Agnostic Directional Speech Recognition Jan 18, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Two-pass Endpoint Detection for Speech Recognition Jan 17, 2024 speech-recognition Speech Recognition
— Unverified 0On Speech Pre-emphasis as a Simple and Inexpensive Method to Boost Speech Enhancement Jan 17, 2024 Automatic Speech Recognition Speech Enhancement
— Unverified 0NOTSOFAR-1 Challenge: New Datasets, Baseline, and Tasks for Distant Meeting Transcription Jan 16, 2024 Automatic Speech Recognition Benchmarking
— Unverified 0Improving ASR Contextual Biasing with Guided Attention Jan 16, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Multi-Input Multi-Output Target-Speaker Voice Activity Detection For Unified, Flexible, and Robust Audio-Visual Speaker Diarization Jan 16, 2024 Action Detection Activity Detection
— Unverified 0Revisiting Self-supervised Learning of Speech Representation from a Mutual Information Perspective Jan 16, 2024 Representation Learning Self-Supervised Learning
— Unverified 0SeMaScore : a new evaluation metric for automatic speech recognition tasks Jan 15, 2024 Automatic Speech Recognition speech-recognition
— Unverified 0Machine Perceptual Quality: Evaluating the Impact of Severe Lossy Compression on Audio and Image Models Jan 15, 2024 Data Compression image-classification
Code Code Available 0Cascaded Cross-Modal Transformer for Audio-Textual Classification Jan 15, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Promptformer: Prompted Conformer Transducer for ASR Jan 14, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Joint Unsupervised and Supervised Training for Automatic Speech Recognition via Bilevel Optimization Jan 13, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Transcending Controlled Environments Assessing the Transferability of ASRRobust NLU Models to Real-World Applications Jan 12, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0XLS-R Deep Learning Model for Multilingual ASR on Low- Resource Languages: Indonesian, Javanese, and Sundanese Jan 12, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Dynamic Behaviour of Connectionist Speech Recognition with Strong Latency Constraints Jan 12, 2024 Decoder Language Modeling
— Unverified 0End to end Hindi to English speech conversion using Bark, mBART and a finetuned XLSR Wav2Vec2 Jan 11, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0UCorrect: An Unsupervised Framework for Automatic Speech Recognition Error Correction Jan 11, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Towards Online Continuous Sign Language Recognition and Translation Jan 10, 2024 Sign Language Recognition speech-recognition
— Unverified 0Useful Blunders: Can Automated Speech Recognition Errors Improve Downstream Dementia Classification? Jan 10, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Continuously Learning New Words in Automatic Speech Recognition Jan 9, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0LUPET: Incorporating Hierarchical Information Path into Multilingual ASR Jan 8, 2024 Acoustic Unit Discovery Automatic Speech Recognition
— Unverified 0Exploratory Evaluation of Speech Content Masking Jan 8, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0BS-PLCNet: Band-split Packet Loss Concealment Network with Multi-task Learning Framework and Multi-discriminators Jan 8, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0High-precision Voice Search Query Correction via Retrievable Speech-text Embedings Jan 8, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0ICMC-ASR: The ICASSP 2024 In-Car Multi-Channel Automatic Speech Recognition Challenge Jan 7, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0MLCA-AVSR: Multi-Layer Cross Attention Fusion based Audio-Visual Speech Recognition Jan 7, 2024 Audio-Visual Speech Recognition Automatic Speech Recognition
— Unverified 0Multichannel AV-wav2vec2: A Framework for Learning Multichannel Multi-Modal Speech Representation Jan 7, 2024 Audio-Visual Speech Recognition Automatic Speech Recognition
Code Code Available 0Part-of-Speech Tagger for Bodo Language using Deep Learning approach Jan 6, 2024 Language Modeling Language Modelling
— Unverified 0TeLeS: Temporal Lexeme Similarity Score to Estimate Confidence in End-to-End ASR Jan 6, 2024 Active Learning Automatic Speech Recognition
Code Code Available 0A unified multichannel far-field speech recognition system: combining neural beamforming with attention based end-to-end model Jan 5, 2024 Speech Enhancement speech-recognition
— Unverified 0Towards ASR Robust Spoken Language Understanding Through In-Context Learning With Word Confusion Networks Jan 5, 2024 In-Context Learning intent-classification
— Unverified 0Nonlinear functional regression by functional deep neural network with kernel embedding Jan 5, 2024 Dimensionality Reduction Image Classification
— Unverified 0CTC Blank Triggered Dynamic Layer-Skipping for Efficient CTC-based Speech Recognition Jan 4, 2024 Knowledge Distillation speech-recognition
— Unverified 0Task Oriented Dialogue as a Catalyst for Self-Supervised Automatic Speech Recognition Jan 4, 2024 Attribute Automatic Speech Recognition
Code Code Available 0Hallucinations in Neural Automatic Speech Recognition: Identifying Errors and Hallucinatory Models Jan 3, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0The Art of Deception: Robust Backdoor Attack using Dynamic Stacking of Triggers Jan 3, 2024 Backdoor Attack speech-recognition
— Unverified 0ES3: Evolving Self-Supervised Learning of Robust Audio-Visual Speech Representations Jan 1, 2024 Audio-Visual Speech Recognition Lipreading
— Unverified 0Stateful Conformer with Cache-based Inference for Streaming Automatic Speech Recognition Dec 27, 2023 Automatic Speech Recognition Decoder
— Unverified 0Towards Probing Contact Center Large Language Models Dec 26, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0The NUS-HLT System for ICASSP2024 ICMC-ASR Grand Challenge Dec 26, 2023 Automatic Speech Recognition Data Augmentation
— Unverified 0Exploring data augmentation in bias mitigation against non-native-accented speech Dec 24, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Multimodal Attention Merging for Improved Speech Recognition and Audio Event Classification Dec 22, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0BLSTM-Based Confidence Estimation for End-to-End Speech Recognition Dec 22, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0BANSpEmo: A Bangla Emotional Speech Recognition Dataset Dec 21, 2023 speech-recognition Speech Recognition
— Unverified 0