A Quantitative Approach to Understand Self-Supervised Models as Cross-lingual Feature Extractors Nov 27, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Multilingual self-supervised speech representations improve the speech recognition of low-resource African languages with codeswitching Nov 25, 2023 Language Modeling Language Modelling
— Unverified 0Weak Alignment Supervision from Hybrid Model Improves End-to-end ASR Nov 24, 2023 Automatic Speech Recognition speech-recognition
— Unverified 0Analysis of Visual Features for Continuous Lipreading in Spanish Nov 21, 2023 Lipreading speech-recognition
— Unverified 0Speaker-Adapted End-to-End Visual Speech Recognition for Continuous Spanish Nov 21, 2023 speech-recognition Speech Recognition
— Unverified 0Soft Random Sampling: A Theoretical and Empirical Analysis Nov 21, 2023 Automatic Speech Recognition speech-recognition
— Unverified 0Investigating Weight-Perturbed Deep Neural Networks With Application in Iris Presentation Attack Detection Nov 21, 2023 image-classification Image Classification
Code Code Available 0LIP-RTVE: An Audiovisual Database for Continuous Spanish in the Wild Nov 21, 2023 Automatic Speech Recognition speech-recognition
Code Code Available 0How does end-to-end speech recognition training impact speech enhancement artifacts? Nov 20, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0App for Resume-Based Job Matching with Speech Interviews and Grammar Analysis: A Review Nov 20, 2023 Automatic Speech Recognition speech-recognition
— Unverified 0Beyond Boundaries: A Comprehensive Survey of Transferable Attacks on AI Systems Nov 20, 2023 Autonomous Driving Autonomous Vehicles
— Unverified 0ML-LMCL: Mutual Learning and Large-Margin Contrastive Learning for Improving ASR Robustness in Spoken Language Understanding Nov 19, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Label-Synchronous Neural Transducer for Adaptable Online E2E Speech Recognition Nov 19, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0GhostVec: A New Threat to Speaker Privacy of End-to-End Speech Recognition System Nov 17, 2023 Decoder Privacy Preserving
— Unverified 0Improving Large-scale Deep Biasing with Phoneme Features and Text-only Data in Streaming Transducer Nov 15, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Multi-channel Conversational Speaker Separation via Neural Diarization Nov 15, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Investigating the Emergent Audio Classification Ability of ASR Foundation Models Nov 15, 2023 Audio Classification Decoder
Code Code Available 0Retrieve and Copy: Scaling ASR Personalization to Large Catalogs Nov 14, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Enhanced Generative Adversarial Networks for Unseen Word Generation from EEG Signals Nov 14, 2023 Brain Computer Interface Data Augmentation
— Unverified 0On the Effectiveness of ASR Representations in Real-world Noisy Speech Emotion Recognition Nov 13, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0ChatGPT in the context of precision agriculture data analytics Nov 10, 2023 Language Modelling speech-recognition
Code Code Available 0Whisper in Focus: Enhancing Stuttered Speech Classification with Encoder Layer Optimization Nov 9, 2023 speech-recognition Speech Recognition
— Unverified 0Towards End-to-End Spoken Grammatical Error Correction Nov 9, 2023 Grammatical Error Correction speech-recognition
— Unverified 01SPU: 1-step Speech Processing Unit Nov 8, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0A comparative analysis between Conformer-Transducer, Whisper, and wav2vec2 for improving the child speech recognition Nov 7, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Fine-tuning convergence model in Bengali speech recognition Nov 7, 2023 Automatic Speech Recognition model
— Unverified 0Pseudo-Labeling for Domain-Agnostic Bangla Automatic Speech Recognition Nov 6, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0COSMIC: Data Efficient Instruction-tuning For Speech In-Context Learning Nov 3, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Server-side Rescoring of Spoken Entity-centric Knowledge Queries for Virtual Assistants Nov 2, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0RIR-SF: Room Impulse Response Based Spatial Feature for Target Speech Recognition in Multi-Channel Multi-Speaker Scenarios Oct 31, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Combining Language Models For Specialized Domains: A Colorful Approach Oct 30, 2023 Automatic Speech Recognition speech-recognition
— Unverified 0MUST: A Multilingual Student-Teacher Learning approach for low-resource speech recognition Oct 29, 2023 Knowledge Distillation speech-recognition
— Unverified 0MixRep: Hidden Representation Mixup for Low-Resource Speech Recognition Oct 27, 2023 Data Augmentation speech-recognition
Code Code Available 0Unified Segment-to-Segment Framework for Simultaneous Sequence Generation Oct 27, 2023 Machine Translation Multi-Task Learning
— Unverified 0Dialect Adaptation and Data Augmentation for Low-Resource ASR: TalTech Systems for the MADASR 2023 Challenge Oct 26, 2023 Automatic Speech Recognition Data Augmentation
— Unverified 0Back Transcription as a Method for Evaluating Robustness of Natural Language Understanding Models to Speech Recognition Errors Oct 25, 2023 en-US domain classification en-US Intent Classification
Code Code Available 0UniX-Encoder: A Universal X-Channel Speech Encoder for Ad-Hoc Microphone Array Speech Processing Oct 25, 2023 speaker-diarization Speaker Diarization
— Unverified 0DISCO: A Large Scale Human Annotated Corpus for Disfluency Correction in Indo-European Languages Oct 25, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Quantifying the Dialect Gap and its Correlates Across Languages Oct 23, 2023 Automatic Speech Recognition Machine Translation
— Unverified 0Leveraging Timestamp Information for Serialized Joint Streaming Recognition and Translation Oct 23, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Delayed Memory Unit: Modelling Temporal Dependency Through Delay Gate Oct 23, 2023 Computational Efficiency Gesture Recognition
Code Code Available 0Key Frame Mechanism For Efficient Conformer Based End-to-end Speech Recognition Oct 23, 2023 Automatic Speech Recognition speech-recognition
Code Code Available 0Modality Dropout for Multimodal Device Directed Speech Detection using Verbal and Non-Verbal Features Oct 23, 2023 Automatic Speech Recognition Binary Classification
— Unverified 0Conversational Speech Recognition by Learning Audio-textual Cross-modal Contextual Representation Oct 22, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Intelligibility prediction with a pretrained noise-robust automatic speech recognition model Oct 20, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Unintended Memorization in Large ASR Models, and How to Mitigate It Oct 18, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0The CHiME-7 Challenge: System Description and Performance of NeMo Team's DASR System Oct 18, 2023 Automatic Speech Recognition speaker-diarization
— Unverified 0Multi-stage Large Language Model Correction for Speech Recognition Oct 17, 2023 Language Modeling Language Modelling
— Unverified 0Audio-AdapterFusion: A Task-ID-free Approach for Efficient and Non-Destructive Multi-task Speech Recognition Oct 17, 2023 speech-recognition Speech Recognition
— Unverified 0VoxArabica: A Robust Dialect-Aware Arabic Speech Recognition System Oct 17, 2023 Arabic Speech Recognition Automatic Speech Recognition
— Unverified 0