MF-AED-AEC: Speech Emotion Recognition by Leveraging Multimodal Fusion, Asr Error Detection, and Asr Error Correction Jan 24, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0CNN architecture extraction on edge GPU Jan 24, 2024 GPU image-classification
— Unverified 0SpeechDPR: End-to-End Spoken Passage Retrieval for Open-Domain Spoken Question Answering Jan 24, 2024 Passage Retrieval Question Answering
— Unverified 0Locality enhanced dynamic biasing and sampling strategies for contextual ASR Jan 23, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Multilingual and Fully Non-Autoregressive ASR with Large Language Model Fusion: A Comprehensive Study Jan 23, 2024 Language Modeling Language Modelling
— Unverified 0Consistency Based Unsupervised Self-training For ASR Personalisation Jan 22, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Keep Decoding Parallel with Effective Knowledge Distillation from Language Models to End-to-end Speech Recognisers Jan 22, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Using Large Language Model for End-to-End Chinese ASR and NER Jan 21, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Word-Level ASR Quality Estimation for Efficient Corpus Sampling and Post-Editing through Analyzing Attentions of a Reference-Free Metric Jan 20, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Contextualized Automatic Speech Recognition with Attention-Based Bias Phrase Boosted Beam Search Jan 19, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Investigating Training Strategies and Model Robustness of Low-Rank Adaptation for Language Modeling in Speech Recognition Jan 19, 2024 Language Modeling Language Modelling
— Unverified 0Large Language Models are Efficient Learners of Noise-Robust Speech Recognition Jan 19, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 2Communication-Efficient Personalized Federated Learning for Speech-to-Text Tasks Jan 18, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0AGADIR: Towards Array-Geometry Agnostic Directional Speech Recognition Jan 18, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Efficient Training for Multilingual Visual Speech Recognition: Pre-training with Discretized Visual Speech Representation Jan 18, 2024 Sentence speech-recognition
Code Code Available 1SlideAVSR: A Dataset of Paper Explanation Videos for Audio-Visual Speech Recognition Jan 18, 2024 Audio-Visual Speech Recognition Automatic Speech Recognition
— Unverified 0On Speech Pre-emphasis as a Simple and Inexpensive Method to Boost Speech Enhancement Jan 17, 2024 Automatic Speech Recognition Speech Enhancement
— Unverified 0Two-pass Endpoint Detection for Speech Recognition Jan 17, 2024 speech-recognition Speech Recognition
— Unverified 0Multi-Input Multi-Output Target-Speaker Voice Activity Detection For Unified, Flexible, and Robust Audio-Visual Speaker Diarization Jan 16, 2024 Action Detection Activity Detection
— Unverified 0Improving ASR Contextual Biasing with Guided Attention Jan 16, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Revisiting Self-supervised Learning of Speech Representation from a Mutual Information Perspective Jan 16, 2024 Representation Learning Self-Supervised Learning
— Unverified 0NOTSOFAR-1 Challenge: New Datasets, Baseline, and Tasks for Distant Meeting Transcription Jan 16, 2024 Automatic Speech Recognition Benchmarking
— Unverified 0SeMaScore : a new evaluation metric for automatic speech recognition tasks Jan 15, 2024 Automatic Speech Recognition speech-recognition
— Unverified 0Machine Perceptual Quality: Evaluating the Impact of Severe Lossy Compression on Audio and Image Models Jan 15, 2024 Data Compression image-classification
Code Code Available 0Cascaded Cross-Modal Transformer for Audio-Textual Classification Jan 15, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Promptformer: Prompted Conformer Transducer for ASR Jan 14, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Joint Unsupervised and Supervised Training for Automatic Speech Recognition via Bilevel Optimization Jan 13, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Transcending Controlled Environments Assessing the Transferability of ASRRobust NLU Models to Real-World Applications Jan 12, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0XLS-R Deep Learning Model for Multilingual ASR on Low- Resource Languages: Indonesian, Javanese, and Sundanese Jan 12, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Dynamic Behaviour of Connectionist Speech Recognition with Strong Latency Constraints Jan 12, 2024 Decoder Language Modeling
— Unverified 0End to end Hindi to English speech conversion using Bark, mBART and a finetuned XLSR Wav2Vec2 Jan 11, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0UCorrect: An Unsupervised Framework for Automatic Speech Recognition Error Correction Jan 11, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Useful Blunders: Can Automated Speech Recognition Errors Improve Downstream Dementia Classification? Jan 10, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Towards Online Continuous Sign Language Recognition and Translation Jan 10, 2024 Sign Language Recognition speech-recognition
— Unverified 0Continuously Learning New Words in Automatic Speech Recognition Jan 9, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0LUPET: Incorporating Hierarchical Information Path into Multilingual ASR Jan 8, 2024 Acoustic Unit Discovery Automatic Speech Recognition
— Unverified 0High-precision Voice Search Query Correction via Retrievable Speech-text Embedings Jan 8, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Cross-Speaker Encoding Network for Multi-Talker Speech Recognition Jan 8, 2024 Decoder speech-recognition
Code Code Available 1BS-PLCNet: Band-split Packet Loss Concealment Network with Multi-task Learning Framework and Multi-discriminators Jan 8, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Exploratory Evaluation of Speech Content Masking Jan 8, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0The NPU-ASLP-LiAuto System Description for Visual Speech Recognition in CNVSRC 2023 Jan 7, 2024 Decoder speech-recognition
Code Code Available 1Multichannel AV-wav2vec2: A Framework for Learning Multichannel Multi-Modal Speech Representation Jan 7, 2024 Audio-Visual Speech Recognition Automatic Speech Recognition
Code Code Available 0ICMC-ASR: The ICASSP 2024 In-Car Multi-Channel Automatic Speech Recognition Challenge Jan 7, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0DiarizationLM: Speaker Diarization Post-Processing with Large Language Models Jan 7, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 3MLCA-AVSR: Multi-Layer Cross Attention Fusion based Audio-Visual Speech Recognition Jan 7, 2024 Audio-Visual Speech Recognition Automatic Speech Recognition
— Unverified 0Part-of-Speech Tagger for Bodo Language using Deep Learning approach Jan 6, 2024 Language Modeling Language Modelling
— Unverified 0TeLeS: Temporal Lexeme Similarity Score to Estimate Confidence in End-to-End ASR Jan 6, 2024 Active Learning Automatic Speech Recognition
Code Code Available 0A unified multichannel far-field speech recognition system: combining neural beamforming with attention based end-to-end model Jan 5, 2024 Speech Enhancement speech-recognition
— Unverified 0Nonlinear functional regression by functional deep neural network with kernel embedding Jan 5, 2024 Dimensionality Reduction Image Classification
— Unverified 0Towards ASR Robust Spoken Language Understanding Through In-Context Learning With Word Confusion Networks Jan 5, 2024 In-Context Learning intent-classification
— Unverified 0