Efficient Sequence Transduction by Jointly Predicting Tokens and Durations Apr 13, 2023 Intent Classification Intent Classification and Slot Filling
Code Code Available 15 EH-MAM: Easy-to-Hard Masked Acoustic Modeling for Self-Supervised Speech Representation Learning Oct 17, 2024 Representation Learning Self-Supervised Learning
Code Code Available 15 Attention-based Audio-Visual Fusion for Robust Automatic Speech Recognition Sep 5, 2018 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Emotionless: Privacy-Preserving Speech Analysis for Voice Assistants Aug 9, 2019 Emotion Recognition Privacy Preserving
Code Code Available 15 Attention-based Contextual Language Model Adaptation for Speech Recognition Jun 2, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Empowering Whisper as a Joint Multi-Talker and Target-Talker Speech Recognition System Jul 13, 2024 Decoder speech-recognition
Code Code Available 15 HuBERT: Self-Supervised Speech Representation Learning by Masked Prediction of Hidden Units Jun 14, 2021 Clustering Language Modelling
Code Code Available 15 Quilt-1M: One Million Image-Text Pairs for Histopathology Jun 20, 2023 Automatic Speech Recognition Cross-Modal Retrieval
Code Code Available 15 End-to-End Automatic Speech Recognition for Gujarati Dec 1, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Real-Time Multimodal Cognitive Assistant for Emergency Medical Services Mar 11, 2024 Action Recognition Edge-computing
Code Code Available 15 Attack on practical speaker verification system using universal adversarial perturbations May 19, 2021 Real-World Adversarial Attack Room Impulse Response (RIR)
Code Code Available 15 End-to-end Audio-visual Speech Recognition with Conformers Feb 12, 2021 Audio-Visual Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 A transfer learning based approach for pronunciation scoring Nov 1, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 End-to-End Speech Recognition from Federated Acoustic Models Apr 29, 2021 2k 4k
Code Code Available 15 End-to-End Single-Channel Speaker-Turn Aware Conversational Speech Translation Nov 1, 2023 Automatic Speech Recognition speech-recognition
Code Code Available 15 End-to-End Speech Recognition and Disfluency Removal Sep 22, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Attention-Based Models for Speech Recognition Jun 24, 2015 Machine Translation Phoneme Recognition
Code Code Available 15 RETURNN as a Generic Flexible Neural Toolkit with Application to Translation and Speech Recognition May 14, 2018 Decoder speech-recognition
Code Code Available 15 Enhancing Dysarthric Speech Recognition for Unseen Speakers via Prototype-Based Adaptation Jul 26, 2024 Contrastive Learning speech-recognition
Code Code Available 15 RNN Transducer Models For Spoken Language Understanding Apr 8, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 HyPoradise: An Open Baseline for Generative Speech Recognition with Large Language Models Sep 27, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Romanian Speech Recognition Experiments from the ROBIN Project Nov 23, 2021 Language Modelling speech-recognition
Code Code Available 15 RWTH ASR Systems for LibriSpeech: Hybrid vs Attention -- w/o Data Augmentation May 8, 2019 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 ESB: A Benchmark For Multi-Domain End-to-End Speech Recognition Oct 24, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 ATCO2 corpus: A Large-Scale Dataset for Research on Automatic Speech Recognition and Natural Language Understanding of Air Traffic Control Communications Nov 8, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Evaluating the visualization of what a Deep Neural Network has learned Sep 21, 2015 Classification General Classification
Code Code Available 15 Evaluating Speech Synthesis by Training Recognizers on Synthetic Speech Oct 1, 2023 speech-recognition Speech Recognition
Code Code Available 15 Deep Discriminative Feature Learning for Accent Recognition Nov 25, 2020 Face Recognition Speaker Identification
Code Code Available 15 How Much Context Does My Attention-Based ASR System Need? Oct 24, 2023 speech-recognition Speech Recognition
Code Code Available 15 Self-Supervised Learning for speech recognition with Intermediate layer supervision Dec 16, 2021 Language Modeling Language Modelling
Code Code Available 15 "How Robust r u?": Evaluating Task-Oriented Dialogue Systems on Spoken Conversations Sep 28, 2021 Benchmarking Dialogue State Tracking
Code Code Available 15 ExKaldi-RT: A Real-Time Automatic Speech Recognition Extension Toolkit of Kaldi Apr 3, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 How2: A Large-scale Dataset for Multimodal Language Understanding Nov 1, 2018 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Sentiment Word Aware Multimodal Refinement for Multimodal Sentiment Analysis with ASR Errors Mar 1, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 SGEM: Test-Time Adaptation for Automatic Speech Recognition via Sequential-Level Generalized Entropy Minimization Jun 3, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Shifted Chunk Encoder for Transformer Based Streaming End-to-End ASR Mar 29, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Extending Whisper with prompt tuning to target-speaker ASR Dec 13, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Fast Development of ASR in African Languages using Self Supervised Speech Representation Learning Mar 16, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 SLUE: New Benchmark Tasks for Spoken Language Understanding Evaluation on Natural Speech Nov 19, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Exploring Wav2vec 2.0 fine-tuning for improved speech emotion recognition Oct 12, 2021 Emotion Recognition Speech Emotion Recognition
Code Code Available 15 A Systematic Comparison of Phonetic Aware Techniques for Speech Enhancement Jun 22, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 How Does Pre-trained Wav2Vec 2.0 Perform on Domain Shifted ASR? An Extensive Benchmark on Air Traffic Control Communications Mar 31, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 SoftCTC -- Semi-Supervised Learning for Text Recognition using Soft Pseudo-Labels Dec 5, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 How to Teach DNNs to Pay Attention to the Visual Modality in Speech Recognition Apr 17, 2020 Audio-Visual Speech Recognition speech-recognition
Code Code Available 15 HypR: A comprehensive study for ASR hypothesis revising with a reference corpus Sep 18, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Fast-HuBERT: An Efficient Training Framework for Self-Supervised Speech Representation Learning Sep 25, 2023 Representation Learning Self-Supervised Learning
Code Code Available 15 FAST-RIR: Fast neural diffuse room impulse response generator Oct 7, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 HAPI: A Large-scale Longitudinal Dataset of Commercial ML API Predictions Sep 18, 2022 object-detection Object Detection
Code Code Available 15 FedScale: Benchmarking Model and System Performance of Federated Learning at Scale May 24, 2021 Benchmarking Federated Learning
Code Code Available 15 A Study of Multilingual End-to-End Speech Recognition for Kazakh, Russian, and English Aug 3, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15