BERTraffic: BERT-based Joint Speaker Role and Speaker Change Detection for Air Traffic Control Communications Oct 12, 2021 Action Detection Activity Detection
Code Code Available 1UniSpeech-SAT: Universal Speech Representation Learning with Speaker Aware Pre-Training Oct 12, 2021 Data Augmentation Multi-Task Learning
Code Code Available 1K-Wav2vec 2.0: Automatic Speech Recognition based on Joint Decoding of Graphemes and Syllables Oct 11, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Interactive Feature Fusion for End-to-End Noise-Robust Speech Recognition Oct 11, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Long Expressive Memory for Sequence Modeling Oct 10, 2021 Language Modeling Language Modelling
Code Code Available 1Arabic Speech Emotion Recognition Employing Wav2vec2.0 and HuBERT Based on BAVED Dataset Oct 9, 2021 Deep Learning Emotion Recognition
Code Code Available 1WenetSpeech: A 10000+ Hours Multi-domain Mandarin Corpus for Speech Recognition Oct 7, 2021 Label Error Detection Optical Character Recognition
Code Code Available 1FAST-RIR: Fast neural diffuse room impulse response generator Oct 7, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Late reverberation suppression using U-nets Oct 5, 2021 Decoder Speech Dereverberation
Code Code Available 1"How Robust r u?": Evaluating Task-Oriented Dialogue Systems on Spoken Conversations Sep 28, 2021 Benchmarking Dialogue State Tracking
Code Code Available 1Factorized Neural Transducer for Efficient Language Model Adaptation Sep 27, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1SD-QA: Spoken Dialectal Question Answering for the Real World Sep 24, 2021 Fairness Question Answering
Code Code Available 1AI Accelerator Survey and Trends Sep 18, 2021 Benchmarking Computational Efficiency
Code Code Available 1Performance-Efficiency Trade-offs in Unsupervised Pre-training for Speech Recognition Sep 14, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Vietnamese end-to-end speech recognition using wav2vec 2.0 Sep 2, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1You Only Hear Once: A YOLO-like Algorithm for Audio Segmentation and Sound Event Detection Sep 1, 2021 Articles Classification
Code Code Available 1Efficient conformer: Progressive downsampling and grouped attention for automatic speech recognition Aug 31, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Knowledge Distillation from BERT Transformer to Speech Transformer for Intent Classification Aug 5, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1A Study of Multilingual End-to-End Speech Recognition for Kazakh, Russian, and English Aug 3, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1The History of Speech Recognition to the Year 2030 Jul 30, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1USC: An Open-Source Uzbek Speech Corpus and Initial Speech Recognition Experiments Jul 30, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1SVEva Fair: A Framework for Evaluating Fairness in Speaker Verification Jul 26, 2021 Fairness Speaker Verification
Code Code Available 1Brazilian Portuguese Speech Recognition Using Wav2vec 2.0 Jul 23, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Bridging the Gap between Spatial and Spectral Domains: A Unified Framework for Graph Neural Networks Jul 21, 2021 Image Classification Natural Language Understanding
Code Code Available 1Token-Level Supervised Contrastive Learning for Punctuation Restoration Jul 19, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1STRODE: Stochastic Boundary Ordinary Differential Equation Jul 17, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1A Comparison of Methods for OOV-word Recognition on a New Public Dataset Jul 16, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1CLSRIL-23: Cross Lingual Speech Representations for Indic Languages Jul 15, 2021 Self-Supervised Learning speech-recognition
Code Code Available 1Layer-wise Analysis of a Self-supervised Speech Representation Model Jul 10, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Kosp2e: Korean Speech to English Translation Corpus Jul 6, 2021 speech-recognition Speech Recognition
Code Code Available 1TENET: A Time-reversal Enhancement Network for Noise-robust ASR Jul 4, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Relaxed Attention: A Simple Method to Boost Performance of End-to-End Automatic Speech Recognition Jul 2, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Combining Frame-Synchronous and Label-Synchronous Systems for Speech Recognition Jul 1, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1MeshRIR: A Dataset of Room Impulse Responses on Meshed Grid Points For Evaluating Sound Field Analysis and Synthesis Methods Jun 21, 2021 Distant Speech Recognition Room Impulse Response (RIR)
Code Code Available 1Efficient Deep Learning: A Survey on Making Deep Learning Models Smaller, Faster, and Better Jun 16, 2021 Deep Learning Information Retrieval
Code Code Available 1RyanSpeech: A Corpus for Conversational Text-to-Speech Synthesis Jun 15, 2021 speech-recognition Speech Recognition
Code Code Available 1Learning Audio-Visual Dereverberation Jun 14, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1HuBERT: Self-Supervised Speech Representation Learning by Masked Prediction of Hidden Units Jun 14, 2021 Clustering Language Modelling
Code Code Available 1GigaSpeech: An Evolving, Multi-domain ASR Corpus with 10,000 Hours of Transcribed Audio Jun 13, 2021 Sentence speech-recognition
Code Code Available 1Incorporating External POS Tagger for Punctuation Restoration Jun 12, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1CAPE: Encoding Relative Positions with Continuous Augmented Positional Embeddings Jun 6, 2021 Machine Translation speech-recognition
Code Code Available 1Lightweight Adapter Tuning for Multilingual Speech Translation Jun 2, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Automatic Speech Recognition in Sanskrit: A New Speech Corpus and Modelling Insights Jun 2, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Attention-based Contextual Language Model Adaptation for Speech Recognition Jun 2, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Byakto Speech: Real-time long speech synthesis with convolutional neural network: Transfer learning from English to Bangla May 31, 2021 Deep Learning speech-recognition
Code Code Available 1FedScale: Benchmarking Model and System Performance of Federated Learning at Scale May 24, 2021 Benchmarking Federated Learning
Code Code Available 1Attack on practical speaker verification system using universal adversarial perturbations May 19, 2021 Real-World Adversarial Attack Room Impulse Response (RIR)
Code Code Available 1Investigating the Reordering Capability in CTC-based Non-Autoregressive End-to-End Speech Translation May 11, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1SpeechMoE: Scaling to Large Acoustic Models with Dynamic Routing Mixture of Experts May 7, 2021 Diversity Mixture-of-Experts
Code Code Available 1Software Engineering for AI-Based Systems: A Survey May 5, 2021 Autonomous Driving software testing
Code Code Available 1