CLSRIL-23: Cross Lingual Speech Representations for Indic Languages Jul 15, 2021 Self-Supervised Learning speech-recognition
Code Code Available 1Nanopore Base Calling on the Edge Nov 9, 2020 speech-recognition Speech Recognition
Code Code Available 1A Toolbox for Construction and Analysis of Speech Datasets Apr 11, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Neural Predictor for Black-Box Adversarial Attacks on Speech Recognition Mar 18, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Consistent Training and Decoding For End-to-end Speech Recognition Using Lattice-free MMI Dec 5, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Non-autoregressive Error Correction for CTC-based ASR with Phone-conditioned Masked LM Sep 8, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Deep Sparse Conformer for Speech Recognition Sep 1, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1OLKAVS: An Open Large-Scale Korean Audio-Visual Speech Dataset Jan 16, 2023 Audio-Visual Speech Recognition Lip Reading
Code Code Available 1Brazilian Portuguese Speech Recognition Using Wav2vec 2.0 Jul 23, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1On the Comparison of Popular End-to-End Models for Large Scale Speech Recognition May 28, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Bridging the Gap between Spatial and Spectral Domains: A Unified Framework for Graph Neural Networks Jul 21, 2021 Image Classification Natural Language Understanding
Code Code Available 1OpenSR: Open-Modality Speech Recognition via Maintaining Multi-Modality Alignment Jun 10, 2023 Audio-Visual Speech Recognition Lip Reading
Code Code Available 1BLSP: Bootstrapping Language-Speech Pre-training via Behavior Alignment of Continuation Writing Sep 2, 2023 speech-recognition Speech Recognition
Code Code Available 1BrainBERT: Self-supervised representation learning for intracranial recordings Feb 28, 2023 Language Modeling Language Modelling
Code Code Available 1Pinyin Regularization in Error Correction for Chinese Speech Recognition with Large Language Models Jul 2, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Bridging the Gaps of Both Modality and Language: Synchronous Bilingual CTC for Speech Translation and Speech Recognition Sep 21, 2023 speech-recognition Speech Recognition
Code Code Available 1Beyond Performance Plateaus: A Comprehensive Study on Scalability in Speech Enhancement Jun 6, 2024 Diversity Speech Enhancement
Code Code Available 1PriMock57: A Dataset Of Primary Care Mock Consultations Apr 1, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Prompting Large Language Models with Audio for General-Purpose Speech Summarization Jun 10, 2024 speech-recognition Speech Recognition
Code Code Available 1Prompting the Hidden Talent of Web-Scale Speech Models for Zero-Shot Task Generalization May 18, 2023 Audio-Visual Speech Recognition Prompt Engineering
Code Code Available 1PyChain: A Fully Parallelized PyTorch Implementation of LF-MMI for End-to-End ASR May 20, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Integer-only Zero-shot Quantization for Efficient Speech Recognition Mar 31, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1BERTraffic: BERT-based Joint Speaker Role and Speaker Change Detection for Air Traffic Control Communications Oct 12, 2021 Action Detection Activity Detection
Code Code Available 1BIG-C: a Multimodal Multi-Purpose Dataset for Bemba May 26, 2023 Machine Translation speech-recognition
Code Code Available 1A Comparison of Methods for OOV-word Recognition on a New Public Dataset Jul 16, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1BembaSpeech: A Speech Recognition Corpus for the Bemba Language Feb 9, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Regularizing End-to-End Speech Translation with Triangular Decomposition Agreement Dec 21, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Relaxed Attention: A Simple Method to Boost Performance of End-to-End Automatic Speech Recognition Jul 2, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1BASPRO: a balanced script producer for speech corpus collection based on the genetic algorithm Dec 11, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1BENDR: using transformers and a contrastive self-supervised learning task to learn from massive amounts of EEG data Jan 28, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Bridging the Granularity Gap for Acoustic Modeling May 27, 2023 speech-recognition Speech Recognition
Code Code Available 1Romanian Speech Recognition Experiments from the ROBIN Project Nov 23, 2021 Language Modelling speech-recognition
Code Code Available 1AdaScale SGD: A User-Friendly Algorithm for Distributed Training Jul 9, 2020 image-classification Image Classification
Code Code Available 1RyanSpeech: A Corpus for Conversational Text-to-Speech Synthesis Jun 15, 2021 speech-recognition Speech Recognition
Code Code Available 1AVLnet: Learning Audio-Visual Language Representations from Instructional Videos Jun 16, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Deep Discriminative Feature Learning for Accent Recognition Nov 25, 2020 Face Recognition Speaker Identification
Code Code Available 1AVATAR: Unconstrained Audiovisual Speech Recognition Jun 15, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Consecutive Decoding for Speech-to-text Translation Sep 21, 2020 Decoder Machine Translation
Code Code Available 1AV Taris: Online Audio-Visual Speech Recognition Dec 14, 2020 Action Detection Activity Detection
Code Code Available 1Self-supervised Learning with Random-projection Quantizer for Speech Recognition Feb 3, 2022 Self-Supervised Learning speech-recognition
Code Code Available 1Automatic Speech Recognition in Sanskrit: A New Speech Corpus and Modelling Insights Jun 2, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1SER Evals: In-domain and Out-of-domain Benchmarking for Speech Emotion Recognition Aug 14, 2024 Automatic Speech Recognition Benchmarking
Code Code Available 1Shifted Chunk Encoder for Transformer Based Streaming End-to-End ASR Mar 29, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1A Comprehensive Survey on Graph Neural Networks Jan 3, 2019 BIG-bench Machine Learning image-classification
Code Code Available 1A Variance-Preserving Interpolation Approach for Diffusion Models with Applications to Single Channel Speech Enhancement and Recognition May 27, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1BackdoorMBTI: A Backdoor Learning Multimodal Benchmark Tool Kit for Backdoor Defense Evaluation Nov 17, 2024 Action Recognition backdoor defense
Code Code Available 1SoccerNet-Echoes: A Soccer Game Audio Commentary Dataset May 12, 2024 Action Spotting Automatic Speech Recognition
Code Code Available 1SoftCTC -- Semi-Supervised Learning for Text Recognition using Soft Pseudo-Labels Dec 5, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Speaker Recognition in the Wild May 5, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Automatic Speech Recognition Benchmark for Air-Traffic Communications Jun 18, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1