End-to-End Speech Recognition and Disfluency Removal Sep 22, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1End-to-End Speech Recognition from Federated Acoustic Models Apr 29, 2021 2k 4k
Code Code Available 1A Hybrid Continuity Loss to Reduce Over-Suppression for Time-domain Target Speaker Extraction Mar 31, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Enhancing Monotonic Multihead Attention for Streaming ASR May 19, 2020 All Automatic Speech Recognition
Code Code Available 1Espresso: A Fast End-to-end Neural Speech Recognition Toolkit Sep 18, 2019 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1AI Accelerator Survey and Trends Sep 18, 2021 Benchmarking Computational Efficiency
Code Code Available 1Evaluating the visualization of what a Deep Neural Network has learned Sep 21, 2015 Classification General Classification
Code Code Available 1Evolutionary Prompt Design for LLM-Based Post-ASR Error Correction Jul 23, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Comparative layer-wise analysis of self-supervised speech models Nov 8, 2022 speech-recognition Speech Recognition
Code Code Available 1CAPE: Encoding Relative Positions with Continuous Augmented Positional Embeddings Jun 6, 2021 Machine Translation speech-recognition
Code Code Available 1Can we use Common Voice to train a Multi-Speaker TTS system? Oct 12, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Fast-HuBERT: An Efficient Training Framework for Self-Supervised Speech Representation Learning Sep 25, 2023 Representation Learning Self-Supervised Learning
Code Code Available 1CB-Conformer: Contextual biasing Conformer for biased word recognition Apr 19, 2023 Automatic Speech Recognition Language Modeling
Code Code Available 1Advancing Test-Time Adaptation in Wild Acoustic Test Settings Oct 14, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1AISHELL-1: An Open-Source Mandarin Speech Corpus and A Speech Recognition Baseline Sep 16, 2017 speech-recognition Speech Recognition
Code Code Available 1A Cross-Modal Approach to Silent Speech with LLM-Enhanced Recognition Mar 2, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Can Contextual Biasing Remain Effective with Whisper and GPT-2? Jun 2, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1A Crowdsourced Open-Source Kazakh Speech Corpus and Initial Speech Recognition Baseline Sep 22, 2020 speech-recognition Speech Recognition
Code Code Available 1FlowerFormer: Empowering Neural Architecture Encoding using a Flow-aware Graph Transformer Mar 19, 2024 Representation Learning speech-recognition
Code Code Available 1FlowMur: A Stealthy and Practical Audio Backdoor Attack with Limited Knowledge Dec 15, 2023 Backdoor Attack Data Poisoning
Code Code Available 1Can We Read Speech Beyond the Lips? Rethinking RoI Selection for Deep Visual Speech Recognition Mar 6, 2020 Lipreading Lip Reading
Code Code Available 1CI-AVSR: A Cantonese Audio-Visual Speech Dataset for In-car Command Recognition Jan 11, 2022 Audio-Visual Speech Recognition speech-recognition
Code Code Available 1Generalizing in the Real World with Representation Learning Oct 18, 2022 Drug Discovery Representation Learning
Code Code Available 1Generative Pre-Training for Speech with Autoregressive Predictive Coding Oct 23, 2019 Representation Learning Speaker Identification
Code Code Available 1GigaSpeech: An Evolving, Multi-domain ASR Corpus with 10,000 Hours of Transcribed Audio Jun 13, 2021 Sentence speech-recognition
Code Code Available 1Brouhaha: multi-task training for voice activity detection, speech-to-noise ratio, and C50 room acoustics estimation Oct 24, 2022 Action Detection Activity Detection
Code Code Available 1Google Crowdsourced Speech Corpora and Related Open-Source Resources for Low-Resource Languages and Dialects: An Overview Oct 14, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1GPU-accelerated Guided Source Separation for Meeting Transcription Dec 10, 2022 blind source separation CPU
Code Code Available 1Gradient Remedy for Multi-Task Learning in End-to-End Noise-Robust Speech Recognition Feb 22, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Graph Convolutions Enrich the Self-Attention in Transformers! Dec 7, 2023 Clone Detection
Code Code Available 1A context-aware knowledge transferring strategy for CTC-based ASR Oct 12, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1HiFi-VC: High Quality ASR-Based Voice Conversion Mar 31, 2022 speech-recognition Speech Recognition
Code Code Available 1Byakto Speech: Real-time long speech synthesis with convolutional neural network: Transfer learning from English to Bangla May 31, 2021 Deep Learning speech-recognition
Code Code Available 1HypR: A comprehensive study for ASR hypothesis revising with a reference corpus Sep 18, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Bridging the Gaps of Both Modality and Language: Synchronous Bilingual CTC for Speech Translation and Speech Recognition Sep 21, 2023 speech-recognition Speech Recognition
Code Code Available 1Improved acoustic word embeddings for zero-resource languages using multilingual transfer Jun 2, 2020 speech-recognition Speech Recognition
Code Code Available 1Improved Noisy Student Training for Automatic Speech Recognition May 19, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Improved Open Source Automatic Subtitling for Lecture Videos Sep 1, 2022 Speech Recognition
Code Code Available 1Improving Audio-Visual Speech Recognition by Lip-Subword Correlation Based Visual Pre-training and Cross-Modal Fusion Encoder Aug 14, 2023 Audio-Visual Speech Recognition Automatic Speech Recognition
Code Code Available 1Improving End-to-End Contextual Speech Recognition with Fine-Grained Contextual Knowledge Selection Jan 30, 2022 speech-recognition Speech Recognition
Code Code Available 1Improving Self-supervised Pre-training using Accent-Specific Codebooks Jul 4, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Improving Transformer-based Speech Recognition Using Unsupervised Pre-training Oct 22, 2019 speech-recognition Speech Recognition
Code Code Available 1Improving Whispered Speech Recognition Performance using Pseudo-whispered based Data Augmentation Nov 9, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1AlignVSR: Audio-Visual Cross-Modal Alignment for Visual Speech Recognition Oct 21, 2024 cross-modal alignment speech-recognition
Code Code Available 1indic-punct: An automatic punctuation restoration and inverse text normalization framework for Indic languages Mar 31, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1IndicSUPERB: A Speech Processing Universal Performance Benchmark for Indian languages Aug 24, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Interactive Feature Fusion for End-to-End Noise-Robust Speech Recognition Oct 11, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Investigating the Reordering Capability in CTC-based Non-Autoregressive End-to-End Speech Translation May 11, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1A^3T: Alignment-Aware Acoustic and Text Pretraining for Speech Synthesis and Editing Mar 18, 2022 Representation Learning Speaker Verification
Code Code Available 1Bridging the Gap between Spatial and Spectral Domains: A Unified Framework for Graph Neural Networks Jul 21, 2021 Image Classification Natural Language Understanding
Code Code Available 1