Towards Improved Room Impulse Response Estimation for Speech Recognition Nov 8, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1ATCO2 corpus: A Large-Scale Dataset for Research on Automatic Speech Recognition and Natural Language Understanding of Air Traffic Control Communications Nov 8, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Multi-blank Transducers for Speech Recognition Nov 4, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Losses Can Be Blessings: Routing Self-Supervised Speech Representations Towards Efficient Multilingual and Multitask Speech Processing Nov 2, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Automatic Severity Classification of Dysarthric speech by using Self-supervised Model with Multi-task Learning Oct 27, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Robust Data2vec: Noise-robust Speech Representation Learning for ASR by Combining Regression and Improved Contrastive Learning Oct 27, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1There is more than one kind of robustness: Fooling Whisper with adversarial examples Oct 26, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Brouhaha: multi-task training for voice activity detection, speech-to-noise ratio, and C50 room acoustics estimation Oct 24, 2022 Action Detection Activity Detection
Code Code Available 1ESB: A Benchmark For Multi-Domain End-to-End Speech Recognition Oct 24, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Generalizing in the Real World with Representation Learning Oct 18, 2022 Drug Discovery Representation Learning
Code Code Available 1Towards Relation Extraction From Speech Oct 17, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1TransFusion: Transcribing Speech with Multinomial Diffusion Oct 14, 2022 Denoising Image Generation
Code Code Available 1Can we use Common Voice to train a Multi-Speaker TTS system? Oct 12, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1A context-aware knowledge transferring strategy for CTC-based ASR Oct 12, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Foundation Transformers Oct 12, 2022 Language Modeling Language Modelling
Code Code Available 1JoeyS2T: Minimalistic Speech-to-Text Modeling with JoeyNMT Oct 5, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1HAPI: A Large-scale Longitudinal Dataset of Commercial ML API Predictions Sep 18, 2022 object-detection Object Detection
Code Code Available 1Non-autoregressive Error Correction for CTC-based ASR with Phone-conditioned Masked LM Sep 8, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Goodness of Pronunciation Pipelines for OOV Problem Sep 8, 2022 Accented Speech Recognition Speech Recognition
Code Code Available 1ASR2K: Speech Recognition for Around 2000 Languages without Audio Sep 6, 2022 Language Modeling Language Modelling
Code Code Available 1Deep Sparse Conformer for Speech Recognition Sep 1, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Improved Open Source Automatic Subtitling for Lecture Videos Sep 1, 2022 Speech Recognition
Code Code Available 1Convolutional Neural Network (CNN) to reduce construction loss in JPEG compression caused by Discrete Fourier Transform (DFT) Aug 26, 2022 Data Compression Image Compression
Code Code Available 1IndicSUPERB: A Speech Processing Universal Performance Benchmark for Indian languages Aug 24, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1ASR Error Correction with Constrained Decoding on Operation Prediction Aug 9, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1DENT-DDSP: Data-efficient noisy speech generator using differentiable digital signal processors for explicit distortion modelling and noise-robust speech recognition Aug 1, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Improving Mandarin Speech Recogntion with Block-augmented Transformer Jul 24, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1AutoDiCE: Fully Automated Distributed CNN Inference at the Edge Jul 20, 2022 Code Generation image-classification
Code Code Available 1Transfer Learning of wav2vec 2.0 for Automatic Lyric Transcription Jul 20, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1MM-ALT: A Multimodal Automatic Lyric Transcription System Jul 13, 2022 Action Detection Activity Detection
Code Code Available 1Visual Context-driven Audio Feature Enhancement for Robust End-to-End Audio-Visual Speech Recognition Jul 13, 2022 Audio-Visual Speech Recognition Decoder
Code Code Available 1CPrune: Compiler-Informed Model Pruning for Efficient Target-Aware DNN Execution Jul 4, 2022 Compiler Optimization image-classification
Code Code Available 1Nextformer: A ConvNeXt Augmented Conformer For End-To-End Speech Recognition Jun 29, 2022 speech-recognition Speech Recognition
Code Code Available 1Distilling a Pretrained Language Model to a Multilingual ASR Model Jun 25, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1A Systematic Comparison of Phonetic Aware Techniques for Speech Enhancement Jun 22, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1AVATAR: Unconstrained Audiovisual Speech Recognition Jun 15, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Towards Understanding and Mitigating Audio Adversarial Examples for Speaker Recognition Jun 7, 2022 Speaker Recognition speech-recognition
Code Code Available 1Variable-rate hierarchical CPC leads to acoustic unit discovery in speech Jun 5, 2022 Acoustic Unit Discovery Disentanglement
Code Code Available 1LAE: Language-Aware Encoder for Monolingual and Multilingual ASR Jun 5, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1CI-AVSR: A Cantonese Audio-Visual Speech Datasetfor In-car Command Recognition Jun 1, 2022 Audio-Visual Speech Recognition speech-recognition
Code Code Available 1Global Normalization for Streaming Speech Recognition in a Modular Framework May 26, 2022 speech-recognition Speech Recognition
Code Code Available 1Language Models with Image Descriptors are Strong Few-Shot Video-Language Learners May 22, 2022 Attribute Automatic Speech Recognition
Code Code Available 1Deep Learning Enabled Semantic Communications with Speech Recognition and Synthesis May 9, 2022 Deep Learning Semantic Communication
Code Code Available 1Wav2vec2 Base Vietnamese 160h May 8, 2022 Speech Recognition
Code Code Available 1Vietnamese Automatic Speech Recognition using Wav2vec 2.0 May 8, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Transformer-Based Multi-Aspect Multi-Granularity Non-Native English Speaker Pronunciation Assessment May 6, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Speaker Recognition in the Wild May 5, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Wav2Seq: Pre-training Speech-to-Text Encoder-Decoder Models Using Pseudo Languages May 2, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1A Survey on Non-Autoregressive Generation for Neural Machine Translation and Beyond Apr 20, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Deep transfer operator learning for partial differential equations under conditional shift Apr 20, 2022 Domain Adaptation Operator learning
Code Code Available 1