Deep Contextualized Acoustic Representations For Semi-Supervised Speech Recognition Dec 3, 2019 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Deep Learning Enabled Semantic Communications with Speech Recognition and Synthesis May 9, 2022 Deep Learning Semantic Communication
Code Code Available 1Convolutional Neural Network (CNN) to reduce construction loss in JPEG compression caused by Discrete Fourier Transform (DFT) Aug 26, 2022 Data Compression Image Compression
Code Code Available 1Confidence Estimation for Attention-based Sequence-to-sequence Models for Speech Recognition Oct 22, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1DENT-DDSP: Data-efficient noisy speech generator using differentiable digital signal processors for explicit distortion modelling and noise-robust speech recognition Aug 1, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Adaptation of Whisper models to child speech recognition Jul 24, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Framework for Curating Speech Datasets and Evaluating ASR Systems: A Case Study for Polish Jul 18, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Computer-Generated Music for Tabletop Role-Playing Games Aug 16, 2020 speech-recognition Speech Recognition
Code Code Available 1Discriminative Multi-modality Speech Recognition May 12, 2020 Audio-Visual Speech Recognition Lipreading
Code Code Available 1Disentangling Speakers in Multi-Talker Speech Recognition with Speaker-Aware CTC Sep 19, 2024 Disentanglement speech-recognition
Code Code Available 1Adapting End-to-End Speech Recognition for Readable Subtitles May 25, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1A Baseline for Detecting Misclassified and Out-of-Distribution Examples in Neural Networks Oct 7, 2016 Anomaly Detection Automatic Speech Recognition
Code Code Available 1Compiling ONNX Neural Network Models Using MLIR Aug 19, 2020 speech-recognition Speech Recognition
Code Code Available 1A Crowdsourced Open-Source Kazakh Speech Corpus and Initial Speech Recognition Baseline Sep 22, 2020 speech-recognition Speech Recognition
Code Code Available 1DNN-based mask estimation for distributed speech enhancement in spatially unconstrained microphone arrays Nov 3, 2020 Diversity Noise Estimation
Code Code Available 1Complex Dynamic Neurons Improved Spiking Transformer Network for Efficient Automatic Speech Recognition Feb 2, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1DOVER: A Method for Combining Diarization Outputs Sep 17, 2019 speech-recognition Speech Recognition
Code Code Available 1Do VSR Models Generalize Beyond LRS3? Nov 23, 2023 Lip Reading speech-recognition
Code Code Available 1Adapting Pretrained Transformer to Lattices for Spoken Language Understanding Nov 2, 2020 Natural Language Understanding speech-recognition
Code Code Available 1DUAL: Discrete Spoken Unit Adaptive Learning for Textless Spoken Question Answering Mar 9, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion Probabilistic Model Jun 18, 2023 Data Augmentation Decoder
Code Code Available 1Dynamic Language Group-Based MoE: Enhancing Code-Switching Speech Recognition with Hierarchical Routing Jul 26, 2024 Attribute Language Modelling
Code Code Available 1Efficiently Modeling Long Sequences with Structured State Spaces Oct 31, 2021 Data Augmentation Language Modeling
Code Code Available 1Efficient Neural Architecture Search for End-to-end Speech Recognition via Straight-Through Gradients Nov 11, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Consistent Training and Decoding For End-to-end Speech Recognition Using Lattice-free MMI Dec 5, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1EH-MAM: Easy-to-Hard Masked Acoustic Modeling for Self-Supervised Speech Representation Learning Oct 17, 2024 Representation Learning Self-Supervised Learning
Code Code Available 1Emotion Recognition from Speech Using Wav2vec 2.0 Embeddings Apr 8, 2021 Emotion Recognition Speech Emotion Recognition
Code Code Available 1Emotion Recognition in Audio and Video Using Deep Neural Networks Jun 15, 2020 Deep Learning Emotion Recognition
Code Code Available 1EnCodecMAE: Leveraging neural codecs for universal audio representation learning Sep 14, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1End-to-end Audio-visual Speech Recognition with Conformers Feb 12, 2021 Audio-Visual Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1ALIF: Low-Cost Adversarial Audio Attacks on Black-Box Speech Platforms using Linguistic Features Aug 3, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1End-to-End Speech Recognition and Disfluency Removal Sep 22, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Enhancing Dysarthric Speech Recognition for Unseen Speakers via Prototype-Based Adaptation Jul 26, 2024 Contrastive Learning speech-recognition
Code Code Available 1Enhancing Monotonic Multihead Attention for Streaming ASR May 19, 2020 All Automatic Speech Recognition
Code Code Available 1Espresso: A Fast End-to-end Neural Speech Recognition Toolkit Sep 18, 2019 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Evaluating Speech Synthesis by Training Recognizers on Synthetic Speech Oct 1, 2023 speech-recognition Speech Recognition
Code Code Available 1A Comparison of Methods for OOV-word Recognition on a New Public Dataset Jul 16, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Evolutionary Prompt Design for LLM-Based Post-ASR Error Correction Jul 23, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1CopyNE: Better Contextual ASR by Copying Named Entities May 22, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1A Cross-Modal Approach to Silent Speech with LLM-Enhanced Recognition Mar 2, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Combining Frame-Synchronous and Label-Synchronous Systems for Speech Recognition Jul 1, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Fast-HuBERT: An Efficient Training Framework for Self-Supervised Speech Representation Learning Sep 25, 2023 Representation Learning Self-Supervised Learning
Code Code Available 1Common Voice: A Massively-Multilingual Speech Corpus Dec 13, 2019 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Radically Old Way of Computing Spectra: Applications in End-to-End ASR Mar 25, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Fine-Tuning Self-Supervised Learning Models for End-to-End Pronunciation Scoring Sep 19, 2023 Feature Engineering Phone-level pronunciation scoring
Code Code Available 1CMULAB: An Open-Source Framework for Training and Deployment of Natural Language Processing Models Apr 3, 2024 Optical Character Recognition (OCR) speech-recognition
Code Code Available 1Communication-Efficient Learning of Deep Networks from Decentralized Data Feb 17, 2016 Federated Learning Speech Recognition
Code Code Available 1ClovaCall: Korean Goal-Oriented Dialog Speech Corpus for Automatic Speech Recognition of Contact Centers Apr 20, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1CL-MASR: A Continual Learning Benchmark for Multilingual ASR Oct 25, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1CLSRIL-23: Cross Lingual Speech Representations for Indic Languages Jul 15, 2021 Self-Supervised Learning speech-recognition
Code Code Available 1