Blank-regularized CTC for Frame Skipping in Neural Transducer May 19, 2023 Automatic Speech Recognition speech-recognition
— Unverified 0A New Benchmark of Aphasia Speech Recognition and Detection Based on E-Branchformer and Multi-task Learning May 19, 2023 Multi-Task Learning speech-recognition
— Unverified 0ML-SUPERB: Multilingual Speech Universal PERformance Benchmark May 18, 2023 Automatic Speech Recognition Language Identification
— Unverified 0A Comparative Study on E-Branchformer vs Conformer in Speech Recognition, Translation, and Understanding Tasks May 18, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0A Lexical-aware Non-autoregressive Transformer-based ASR Model May 18, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Accurate and Reliable Confidence Estimation Based on Non-Autoregressive End-to-End Speech Recognition System May 18, 2023 speech-recognition Speech Recognition
— Unverified 0Use of Speech Impairment Severity for Dysarthric Speech Recognition May 18, 2023 Diversity severity prediction
— Unverified 0FunASR: A Fundamental End-to-End Speech Recognition Toolkit May 18, 2023 Action Detection Activity Detection
— Unverified 0DQ-Whisper: Joint Distillation and Quantization for Efficient Multilingual Speech Recognition May 18, 2023 Knowledge Distillation Quantization
— Unverified 0Boosting Local Spectro-Temporal Features for Speech Analysis May 17, 2023 Classification object-detection
— Unverified 0Adversarial Speaker Disentanglement Using Unannotated External Data for Self-supervised Representation Based Voice Conversion May 16, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Application-Agnostic Language Modeling for On-Device ASR May 16, 2023 Automatic Speech Recognition Language Modeling
— Unverified 0Critical Appraisal of Artificial Intelligence-Mediated Communication May 15, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0OOD-Speech: A Large Bengali Speech Recognition Dataset for Out-of-Distribution Benchmarking May 15, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Self-supervised Neural Factor Analysis for Disentangling Utterance-level Speech Representations May 14, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Investigating the Sensitivity of Automatic Speech Recognition Systems to Phonetic Variation in L2 Englishes May 12, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Continual Learning for End-to-End ASR by Averaging Domain Experts May 12, 2023 Automatic Speech Recognition Continual Learning
— Unverified 0Accelerator-Aware Training for Transducer-Based Speech Recognition May 12, 2023 CPU Quantization
— Unverified 0Masked Audio Text Encoders are Effective Multi-Modal Rescorers May 11, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Quran Recitation Recognition using End-to-End Deep Learning May 10, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Exploration of Language Dependency for Japanese Self-Supervised Speech Representation Models May 9, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Robust Acoustic and Semantic Contextual Biasing in Neural Transducers for Speech Recognition May 9, 2023 Automatic Speech Recognition Language Modelling
— Unverified 0Who Needs Decoders? Efficient Estimation of Sequence-level Attributes May 9, 2023 Attribute Automatic Speech Recognition
— Unverified 0Neural Steerer: Novel Steering Vector Synthesis with a Causal Neural Field over Frequency and Source Positions May 8, 2023 Novel View Synthesis speech-recognition
— Unverified 0Multi-Temporal Lip-Audio Memory for Visual Speech Recognition May 8, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Fast Conformer with Linearly Scalable Attention for Efficient Speech Recognition May 8, 2023 Automatic Speech Recognition Decoder
— Unverified 0Lookahead When It Matters: Adaptive Non-causal Transformers for Streaming Neural Transducers May 7, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Evaluating Variants of wav2vec 2.0 on Affective Vocal Burst Tasks May 5, 2023 Automatic Speech Recognition Cultural Vocal Bursts Intensity Prediction
Code Code Available 0Hybrid Transducer and Attention based Encoder-Decoder Modeling for Speech-to-Text Tasks May 4, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0End-to-end spoken language understanding using joint CTC loss and self-supervised, pretrained acoustic encoders May 4, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Employing Hybrid Deep Neural Networks on Dari Speech May 4, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Considerations for Ethical Speech Recognition Datasets May 3, 2023 Automatic Speech Recognition Diversity
— Unverified 0A Study on the Integration of Pipeline and E2E SLU systems for Spoken Semantic Parsing toward STOP Quality Challenge May 2, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Lessons Learned in ATCO2: 5000 hours of Air Traffic Control Communications for Robust Automatic Speech Recognition and Understanding May 2, 2023 Automatic Speech Recognition Language Identification
— Unverified 0Deep Learning-based Spatio Temporal Facial Feature Visual Speech Recognition Apr 30, 2023 Deep Learning Face Recognition
— Unverified 0A Review of Deep Learning Techniques for Speech Processing Apr 30, 2023 Automatic Speech Recognition Deep Learning
— Unverified 0Building a Non-native Speech Corpus Featuring Chinese-English Bilingual Children: Compilation and Rationale Apr 30, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Towards Better Domain Adaptation for Self-supervised Models: A Case Study of Child ASR Apr 28, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Deep Transfer Learning for Automatic Speech Recognition: Towards Better Generalization Apr 27, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Understanding Shared Speech-Text Representations Apr 27, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Optimizing Deep Learning Models For Raspberry Pi Apr 25, 2023 CPU Deep Learning
Code Code Available 0Modeling Spoken Information Queries for Virtual Assistants: Open Problems, Challenges and Opportunities Apr 25, 2023 domain classification Information Retrieval
— Unverified 0Self-regularised Minimum Latency Training for Streaming Transformer-based Speech Recognition Apr 24, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Recurrent Neural Networks and Long Short-Term Memory Networks: Tutorial and Survey Apr 22, 2023 Language Modeling Language Modelling
— Unverified 0Non-autoregressive End-to-end Approaches for Joint Automatic Speech Recognition and Spoken Language Understanding Apr 21, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Towards the Universal Defense for Query-Based Audio Adversarial Attacks Apr 20, 2023 Audio Fingerprint Automatic Speech Recognition
— Unverified 0OLISIA: a Cascade System for Spoken Dialogue State Tracking Apr 20, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Security and Privacy Problems in Voice Assistant Applications: A Survey Apr 19, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Dynamic Chunk Convolution for Unified Streaming and Non-Streaming Conformer ASR Apr 18, 2023 speech-recognition Speech Recognition
— Unverified 0Towards the Transferable Audio Adversarial Attack via Ensemble Methods Apr 18, 2023 Adversarial Attack Autonomous Driving
— Unverified 0