Domain Specific Wav2vec 2.0 Fine-tuning For The SE&R 2022 Challenge Jul 29, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 05 Distributed Learning of Deep Neural Networks using Independent Subnet Training Oct 4, 2019 BIG-bench Machine Learning Image Classification
Code Code Available 05 Discrete Speech Unit Extraction via Independent Component Analysis Jan 11, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 05 Enhancing Quantised End-to-End ASR Models via Personalisation Sep 17, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 05 Discovering Phonetic Inventories with Crosslingual Automatic Speech Recognition Jan 26, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 05 An Online Multilingual Hate speech Recognition System Nov 23, 2020 Hate Speech Detection speech-recognition
Code Code Available 05 Discrete Cross-Modal Alignment Enables Zero-Shot Speech Translation Oct 18, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 05 Disentangling Speech and Non-Speech Components for Building Robust Acoustic Models from Found Data Sep 25, 2019 speech-recognition Speech Recognition
Code Code Available 05 BERT Attends the Conversation: Improving Low-Resource Conversational ASR Oct 5, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 05 DiMoDif: Discourse Modality-information Differentiation for Audio-visual Deepfake Detection and Localization Nov 15, 2024 DeepFake Detection Face Swapping
Code Code Available 05 Direct Segmentation Models for Streaming Speech Translation Nov 1, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 05 Evaluating Variants of wav2vec 2.0 on Affective Vocal Burst Tasks May 5, 2023 Automatic Speech Recognition Cultural Vocal Bursts Intensity Prediction
Code Code Available 05 DISCO: A Large Scale Human Annotated Corpus for Disfluency Correction in Indo-European Languages Oct 25, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 05 Do Prompts Really Prompt? Exploring the Prompt Understanding Capability of Whisper Jun 9, 2024 speech-recognition Speech Recognition
Code Code Available 05 Did you hear that? Adversarial Examples Against Automatic Speech Recognition Jan 2, 2018 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 05 Calibrated Structured Prediction Dec 1, 2015 Medical Diagnosis Optical Character Recognition
Code Code Available 05 DEVI: Open-source Human-Robot Interface for Interactive Receptionist Systems Jan 2, 2021 Face Recognition Self-Learning
Code Code Available 05 Detecting and Defending Against Adversarial Attacks on Automatic Speech Recognition via Diffusion Models Sep 12, 2024 Adversarial Attack Adversarial Purification
Code Code Available 05 Exploiting Hidden Representations from a DNN-based Speech Recogniser for Speech Intelligibility Prediction in Hearing-impaired Listeners Apr 8, 2022 Prediction Speech Enhancement
Code Code Available 05 DiaCorrect: End-to-end error correction for speaker diarization Oct 31, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 05 Dementia Assessment Using Mandarin Speech with an Attention-based Speech Recognition Encoder Oct 6, 2023 Alzheimer's Disease Detection speech-recognition
Code Code Available 05 Exploring spectro-temporal features in end-to-end convolutional neural networks Jan 1, 2019 speech-recognition Speech Recognition
Code Code Available 05 Detecting Adversarial Examples for Speech Recognition via Uncertainty Quantification May 24, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 05 Differentiable Allophone Graphs for Language-Universal Speech Recognition Jul 24, 2021 speech-recognition Speech Recognition
Code Code Available 05 Deep word embeddings for visual speech recognition Oct 30, 2017 Lipreading speech-recognition
Code Code Available 05 Delayed Memory Unit: Modelling Temporal Dependency Through Delay Gate Oct 23, 2023 Computational Efficiency Gesture Recognition
Code Code Available 05 Deep Spiking Neural Networks for Large Vocabulary Automatic Speech Recognition Nov 19, 2019 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 05 Deep Learning Models in Speech Recognition: Measuring GPU Energy Consumption, Impact of Noise and Model Quantization for Edge Deployment May 2, 2024 GPU NVIDIA Jetson Orin Nano
Code Code Available 05 Boosting Cross-Domain Speech Recognition with Self-Supervision Jun 20, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 05 Deep Learning using Linear Support Vector Machines Jun 2, 2013 Deep Learning General Classification
Code Code Available 05 Multi-Stage Speaker Diarization for Noisy Classrooms May 16, 2025 Action Detection Activity Detection
Code Code Available 05 Deep Learning for Audio Signal Processing Apr 30, 2019 Audio Signal Processing Automatic Speech Recognition
Code Code Available 05 DELTA: A DEep learning based Language Technology plAtform Aug 2, 2019 Abstractive Text Summarization Deep Learning
Code Code Available 05 End-to-End Speech Recognition With Joint Dereverberation Of Sub-Band Autoregressive Envelopes Aug 9, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 05 DeepEMO: Deep Learning for Speech Emotion Recognition Sep 9, 2021 Deep Learning Emotion Recognition
Code Code Available 05 DeepCover: Advancing RNN Test Coverage and Online Error Prediction using State Machine Extraction Feb 10, 2024 Decision Making speech-recognition
Code Code Available 05 Deep-FSMN for Large Vocabulary Continuous Speech Recognition Mar 4, 2018 Language Modeling Language Modelling
Code Code Available 05 Deep convolutional acoustic word embeddings using word-pair side information Oct 5, 2015 speech-recognition Speech Recognition
Code Code Available 05 DeepGD: A Multi-Objective Black-Box Test Selection Approach for Deep Neural Networks Mar 8, 2023 Fault Detection speech-recognition
Code Code Available 05 First-Pass Large Vocabulary Continuous Speech Recognition using Bi-Directional Recurrent DNNs Aug 12, 2014 Language Modeling Language Modelling
Code Code Available 05 Decoding P300 Variability using Convolutional Neural Networks Jun 14, 2019 EEG Eeg Decoding
Code Code Available 05 Anti-Transfer Learning for Task Invariance in Convolutional Neural Networks for Speech Processing Jun 11, 2020 Emotion Recognition speech-recognition
Code Code Available 05 DecoderLens: Layerwise Interpretation of Encoder-Decoder Transformers Oct 5, 2023 Decoder Logical Reasoning
Code Code Available 05 Deep Gradient Compression Reduce the Communication Bandwidth For distributed Traning Dec 5, 2017 Federated Learning image-classification
Code Code Available 05 Data Fusion for Audiovisual Speaker Localization: Extending Dynamic Stream Weights to the Spatial Domain Feb 23, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 05 Cascaded Cross-Modal Transformer for Audio-Textual Classification Jan 15, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 05 Data augmentation using prosody and false starts to recognize non-native children's speech Aug 29, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 05 Data Quality Measures and Efficient Evaluation Algorithms for Large-Scale High-Dimensional Data Jan 5, 2021 BIG-bench Machine Learning speech-recognition
Code Code Available 05 Blank Collapse: Compressing CTC emission for the faster decoding Oct 31, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 05 Deep Gradient Compression: Reducing the Communication Bandwidth for Distributed Training Dec 5, 2017 Federated Learning image-classification
Code Code Available 05