Learning a Dual-Mode Speech Recognition Model via Self-Pruning Jul 25, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Unsupervised data selection for Speech Recognition with contrastive loss ratios Jul 25, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0A Deep Dive into Deep Cluster Jul 24, 2022 Clustering speech-recognition
— Unverified 0Improving Mandarin Speech Recogntion with Block-augmented Transformer Jul 24, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Implementation Of Tiny Machine Learning Models On Arduino 33 BLE For Gesture And Speech Recognition Jul 23, 2022 Gesture Recognition Hand Gesture Recognition
— Unverified 0ASR Error Detection via Audio-Transcript entailment Jul 22, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Toward Fairness in Speech Recognition: Discovery and mitigation of performance disparities Jul 22, 2022 Fairness speech-recognition
— Unverified 0Bayesian Recurrent Units and the Forward-Backward Algorithm Jul 21, 2022 Speech Recognition
Code Code Available 0AutoDiCE: Fully Automated Distributed CNN Inference at the Edge Jul 20, 2022 Code Generation image-classification
Code Code Available 1When Is TTS Augmentation Through a Pivot Language Useful? Jul 20, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Transfer Learning of wav2vec 2.0 for Automatic Lyric Transcription Jul 20, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Improving Data Driven Inverse Text Normalization using Data Augmentation Jul 20, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0ESPnet-SE++: Speech Enhancement for Robust Speech Recognition, Translation, and Understanding Jul 19, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0ILASR: Privacy-Preserving Incremental Learning for Automatic Speech Recognition at Production Scale Jul 19, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0WideResNet with Joint Representation Learning and Data Augmentation for Cover Song Identification Jul 18, 2022 Cover song identification Data Augmentation
— Unverified 0End-to-End Spoken Language Understanding: Performance analyses of a voice command task in a low resource setting Jul 17, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Reducing Geographic Disparities in Automatic Speech Recognition via Elastic Weight Consolidation Jul 16, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Sotto Voce: Federated Speech Recognition with Differential Privacy Guarantees Jul 16, 2022 Federated Learning speech-recognition
— Unverified 0MAC-DO: An Efficient Output-Stationary GEMM Accelerator for CNNs Using DRAM Technology Jul 16, 2022 speech-recognition Speech Recognition
— Unverified 0Direction-Aware Joint Adaptation of Neural Speech Enhancement and Recognition in Real Multiparty Conversational Environments Jul 15, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Position Prediction as an Effective Pretraining Strategy Jul 15, 2022 Position Prediction
Code Code Available 0Two-Pass Low Latency End-to-End Spoken Language Understanding Jul 14, 2022 speech-recognition Speech Recognition
— Unverified 0Deep versus Wide: An Analysis of Student Architectures for Task-Agnostic Knowledge Distillation of Self-Supervised Speech Models Jul 14, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Data Augmentation for Low-Resource Quechua ASR Improvement Jul 14, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Open Terminology Management and Sharing Toolkit for Federation of Terminology Databases Jul 14, 2022 Machine Translation Management
— Unverified 0u-HuBERT: Unified Mixed-Modal Speech Pretraining And Zero-Shot Transfer to Unlabeled Modality Jul 14, 2022 Speaker Verification speech-recognition
Code Code Available 2Efficient spike encoding algorithms for neuromorphic speech recognition Jul 14, 2022 speech-recognition Speech Recognition
— Unverified 0MM-ALT: A Multimodal Automatic Lyric Transcription System Jul 13, 2022 Action Detection Activity Detection
Code Code Available 1Visual Context-driven Audio Feature Enhancement for Robust End-to-End Audio-Visual Speech Recognition Jul 13, 2022 Audio-Visual Speech Recognition Decoder
Code Code Available 1Huqariq: A Multilingual Speech Corpus of Native Languages of Peru for Speech Recognition Jul 12, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0End-to-end speech recognition modeling from de-identified data Jul 12, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Online Continual Learning of End-to-End Speech Recognition Models Jul 11, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0pMCT: Patched Multi-Condition Training for Robust Speech Recognition Jul 11, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Speaker Anonymization with Phonetic Intermediate Representations Jul 11, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Intermediate-layer output Regularization for Attention-based Speech Recognition with Shared Decoder Jul 9, 2022 Decoder speech-recognition
— Unverified 0Internal Language Model Estimation based Language Model Fusion for Cross-Domain Code-Switching Speech Recognition Jul 9, 2022 Language Modeling Language Modelling
— Unverified 0Tandem Multitask Training of Speaker Diarisation and Speech Recognition for Meeting Transcription Jul 8, 2022 Action Detection Activity Detection
— Unverified 0Investigating the Impact of Cross-lingual Acoustic-Phonetic Similarities on Multilingual Speech Recognition Jul 7, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Non-Linear Pairwise Language Mappings for Low-Resource Multilingual Acoustic Model Fusion Jul 7, 2022 speech-recognition Speech Recognition
— Unverified 0End-to-end Speech-to-Punctuated-Text Recognition Jul 7, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Low-resource Low-footprint Wake-word Detection using Knowledge Distillation Jul 6, 2022 Knowledge Distillation speech-recognition
— Unverified 0Kaggle Competition: Cantonese Audio-Visual Speech Recognition for In-car Commands Jul 6, 2022 Audio-Visual Speech Recognition speech-recognition
— Unverified 0Improving Streaming End-to-End ASR on Transformer-based Causal Models with Encoder States Revision Strategies Jul 6, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Branchformer: Parallel MLP-Attention Architectures to Capture Local and Global Context for Speech Recognition and Understanding Jul 6, 2022 speech-recognition Speech Recognition
— Unverified 0Compute Cost Amortized Transformer for Streaming ASR Jul 5, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0DEFORMER: Coupling Deformed Localized Patterns with Global Context for Robust End-to-end Speech Recognition Jul 4, 2022 speech-recognition Speech Recognition
— Unverified 0CPrune: Compiler-Informed Model Pruning for Efficient Target-Aware DNN Execution Jul 4, 2022 Compiler Optimization image-classification
Code Code Available 1Vietnamese Capitalization and Punctuation Recovery Models Jul 4, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Generating gender-ambiguous voices for privacy-preserving speech recognition Jul 3, 2022 Attribute Generative Adversarial Network
Code Code Available 0Leveraging Acoustic Contextual Representation by Audio-textual Cross-modal Learning for Conversational ASR Jul 3, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0