Mlphon: A Multifunctional Grapheme-Phoneme Conversion Tool Using Finite State Transducers Sep 5, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Distilling the Knowledge of BERT for CTC-based ASR Sep 5, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Towards Deep Learning-aided Wireless Channel Estimation and Channel State Information Feedback for 6G Sep 5, 2022 image-classification Image Classification
— Unverified 0A Review of Sparse Expert Models in Deep Learning Sep 4, 2022 Deep Learning Mixture-of-Experts
— Unverified 0Semantically Meaningful Metrics for Norwegian ASR Systems Sep 3, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Universal Fourier Attack for Time Series Sep 2, 2022 speech-recognition Speech Recognition
— Unverified 0DeepCon: An End-to-End Multilingual Toolkit for Automatic Minuting of Multi-Party Dialogues Sep 1, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Robust Translation of French Live Speech Transcripts Sep 1, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Evaluation of Automatic Speech Recognition for Conversational Speech in Dutch, English and German: What Goes Missing? Sep 1, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Improved Open Source Automatic Subtitling for Lecture Videos Sep 1, 2022 Speech Recognition
Code Code Available 1A Wavelet Transform Based Scheme to Extract Speech Pitch and Formant Frequencies Sep 1, 2022 speech-recognition Speech Recognition
— Unverified 0Deep Sparse Conformer for Speech Recognition Sep 1, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Attention Enhanced Citrinet for Speech Recognition Sep 1, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0RecLight: A Recurrent Neural Network Accelerator with Integrated Silicon Photonics Aug 31, 2022 Activity Recognition Anomaly Detection
— Unverified 0Visual Speech Recognition in a Driver Assistance System Aug 29, 2022 Data Augmentation Lipreading
— Unverified 0A Language Agnostic Multilingual Streaming On-Device ASR System Aug 29, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Turn-Taking Prediction for Natural Conversational Speech Aug 29, 2022 Prediction speech-recognition
— Unverified 0Bayesian Neural Network Language Modeling for Speech Recognition Aug 28, 2022 Data Augmentation Language Modeling
Code Code Available 0Minimal Feature Analysis for Isolated Digit Recognition for varying encoding rates in noisy environments Aug 27, 2022 speech-recognition Speech Recognition
— Unverified 0Convolutional Neural Network (CNN) to reduce construction loss in JPEG compression caused by Discrete Fourier Transform (DFT) Aug 26, 2022 Data Compression Image Compression
Code Code Available 1Investigating data partitioning strategies for crosslinguistic low-resource ASR evaluation Aug 26, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Effectiveness of Mining Audio and Text Pairs from Public Data for Improving ASR Systems for Low-Resource Languages Aug 26, 2022 Diversity Optical Character Recognition (OCR)
— Unverified 0IndicSUPERB: A Speech Processing Universal Performance Benchmark for Indian languages Aug 24, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Low-Level Physiological Implications of End-to-End Learning of Speech Recognition Aug 22, 2022 speech-recognition Speech Recognition
— Unverified 0DualVoice: Speech Interaction that Discriminates between Normal and Whispered Voice Input Aug 22, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Are disentangled representations all you need to build speaker anonymization systems? Aug 22, 2022 All Automatic Speech Recognition
— Unverified 0Analyzing Robustness of End-to-End Neural Models for Automatic Speech Recognition Aug 17, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Building a Public Domain Voice Database for Odia Aug 16, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Uconv-Conformer: High Reduction of Input Sequence Length for End-to-End Speech Recognition Aug 16, 2022 CPU GPU
— Unverified 0Improving Hypernasality Estimation with Automatic Speech Recognition in Cleft Palate Speech Aug 10, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Comparison and Analysis of New Curriculum Criteria for End-to-End ASR Aug 10, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Speaker-adaptive Lip Reading with User-dependent Padding Aug 9, 2022 Lip Reading speech-recognition
Code Code Available 0Thai Wav2Vec2.0 with CommonVoice V8 Aug 9, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0ASR Error Correction with Constrained Decoding on Operation Prediction Aug 9, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Model Blending for Text Classification Aug 5, 2022 Classification Machine Translation
— Unverified 0Large vocabulary speech recognition for languages of Africa: multilingual modeling and self-supervised learning Aug 5, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Automatic Speech Recognition in German: A Detailed Error Analysis Aug 3, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Adversarial Attacks on ASR Systems: An Overview Aug 3, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Multiclass ASMA vs Targeted PGD Attack in Image Segmentation Aug 3, 2022 Adversarial Attack Classification
— Unverified 0DENT-DDSP: Data-efficient noisy speech generator using differentiable digital signal processors for explicit distortion modelling and noise-robust speech recognition Aug 1, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Global Performance Disparities Between English-Language Accents in Automatic Speech Recognition Aug 1, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Domain Specific Wav2vec 2.0 Fine-tuning For The SE&R 2022 Challenge Jul 29, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Multiple-hypothesis RNN-T Loss for Unsupervised Fine-tuning and Self-training of Neural Transducer Jul 29, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Pronunciation-aware unique character encoding for RNN Transducer-based Mandarin speech recognition Jul 29, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Thutmose Tagger: Single-pass neural model for Inverse Text Normalization Jul 29, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Extending RNN-T-based speech recognition systems with emotion and language classification Jul 28, 2022 Emotion Classification Emotion Recognition
— Unverified 0Knowledge-driven Subword Grammar Modeling for Automatic Speech Recognition in Tamil and Kannada Jul 27, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Subword Dictionary Learning and Segmentation Techniques for Automatic Speech Recognition in Tamil and Kannada Jul 27, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0SoundChoice: Grapheme-to-Phoneme Models with Semantic Disambiguation Jul 27, 2022 Language Modeling Language Modelling
— Unverified 0Perception-Aware Attack: Creating Adversarial Music via Reverse-Engineering Human Perception Jul 26, 2022 Adversarial Attack Speaker Recognition
— Unverified 0