| SE-Bridge: Speech Enhancement with Consistent Brownian Bridge | May 23, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Evaluating OpenAI's Whisper ASR for Punctuation Prediction and Topic Modeling of life histories of the Museum of the Person | May 23, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Personalized Predictive ASR for Latency Reduction in Voice Assistants | May 23, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Text Generation with Speech Synthesis for ASR Data Augmentation | May 22, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Debiased Automatic Speech Recognition for Dysarthric Speech via Sample Reweighting with Sample Affinity Test | May 22, 2023 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |
| GNCformer Enhanced Self-attention for Automatic Speech Recognition | May 22, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| On the Efficacy and Noise-Robustness of Jointly Learned Speech Emotion and Automatic Speech Recognition | May 21, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Semantic VAD: Low-Latency Voice Activity Detection for Speech Interaction | May 21, 2023 | Action DetectionActivity Detection | —Unverified | 0 |
| Hystoc: Obtaining word confidences for fusion of end-to-end ASR systems | May 21, 2023 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |
| CASA-ASR: Context-Aware Speaker-Attributed ASR | May 21, 2023 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |
| VAKTA-SETU: A Speech-to-Speech Machine Translation Service in Select Indic Languages | May 21, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Self-supervised representations in speech-based depression detection | May 20, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Blank-regularized CTC for Frame Skipping in Neural Transducer | May 19, 2023 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |
| Unsupervised ASR via Cross-Lingual Pseudo-Labeling | May 19, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| BAT: Boundary aware transducer for memory-efficient and low-latency ASR | May 19, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| A Comparative Study on E-Branchformer vs Conformer in Speech Recognition, Translation, and Understanding Tasks | May 18, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| A Lexical-aware Non-autoregressive Transformer-based ASR Model | May 18, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| ML-SUPERB: Multilingual Speech Universal PERformance Benchmark | May 18, 2023 | Automatic Speech RecognitionLanguage Identification | —Unverified | 0 |
| Adversarial Speaker Disentanglement Using Unannotated External Data for Self-supervised Representation Based Voice Conversion | May 16, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Application-Agnostic Language Modeling for On-Device ASR | May 16, 2023 | Automatic Speech RecognitionLanguage Modeling | —Unverified | 0 |
| Critical Appraisal of Artificial Intelligence-Mediated Communication | May 15, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| OOD-Speech: A Large Bengali Speech Recognition Dataset for Out-of-Distribution Benchmarking | May 15, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Self-supervised Neural Factor Analysis for Disentangling Utterance-level Speech Representations | May 14, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Continual Learning for End-to-End ASR by Averaging Domain Experts | May 12, 2023 | Automatic Speech RecognitionContinual Learning | —Unverified | 0 |
| Investigating the Sensitivity of Automatic Speech Recognition Systems to Phonetic Variation in L2 Englishes | May 12, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Masked Audio Text Encoders are Effective Multi-Modal Rescorers | May 11, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Quran Recitation Recognition using End-to-End Deep Learning | May 10, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Who Needs Decoders? Efficient Estimation of Sequence-level Attributes | May 9, 2023 | AttributeAutomatic Speech Recognition | —Unverified | 0 |
| Robust Acoustic and Semantic Contextual Biasing in Neural Transducers for Speech Recognition | May 9, 2023 | Automatic Speech RecognitionLanguage Modelling | —Unverified | 0 |
| Exploration of Language Dependency for Japanese Self-Supervised Speech Representation Models | May 9, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Multi-Temporal Lip-Audio Memory for Visual Speech Recognition | May 8, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Fast Conformer with Linearly Scalable Attention for Efficient Speech Recognition | May 8, 2023 | Automatic Speech RecognitionDecoder | —Unverified | 0 |
| Lookahead When It Matters: Adaptive Non-causal Transformers for Streaming Neural Transducers | May 7, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Evaluating Variants of wav2vec 2.0 on Affective Vocal Burst Tasks | May 5, 2023 | Automatic Speech RecognitionCultural Vocal Bursts Intensity Prediction | CodeCode Available | 0 |
| Hybrid Transducer and Attention based Encoder-Decoder Modeling for Speech-to-Text Tasks | May 4, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| End-to-end spoken language understanding using joint CTC loss and self-supervised, pretrained acoustic encoders | May 4, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Employing Hybrid Deep Neural Networks on Dari Speech | May 4, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Considerations for Ethical Speech Recognition Datasets | May 3, 2023 | Automatic Speech RecognitionDiversity | —Unverified | 0 |
| A Study on the Integration of Pipeline and E2E SLU systems for Spoken Semantic Parsing toward STOP Quality Challenge | May 2, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Lessons Learned in ATCO2: 5000 hours of Air Traffic Control Communications for Robust Automatic Speech Recognition and Understanding | May 2, 2023 | Automatic Speech RecognitionLanguage Identification | —Unverified | 0 |
| A Review of Deep Learning Techniques for Speech Processing | Apr 30, 2023 | Automatic Speech RecognitionDeep Learning | —Unverified | 0 |
| Building a Non-native Speech Corpus Featuring Chinese-English Bilingual Children: Compilation and Rationale | Apr 30, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Towards Better Domain Adaptation for Self-supervised Models: A Case Study of Child ASR | Apr 28, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Understanding Shared Speech-Text Representations | Apr 27, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Deep Transfer Learning for Automatic Speech Recognition: Towards Better Generalization | Apr 27, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Self-regularised Minimum Latency Training for Streaming Transformer-based Speech Recognition | Apr 24, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Non-autoregressive End-to-end Approaches for Joint Automatic Speech Recognition and Spoken Language Understanding | Apr 21, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| OLISIA: a Cascade System for Spoken Dialogue State Tracking | Apr 20, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Towards the Universal Defense for Query-Based Audio Adversarial Attacks | Apr 20, 2023 | Audio FingerprintAutomatic Speech Recognition | —Unverified | 0 |
| Security and Privacy Problems in Voice Assistant Applications: A Survey | Apr 19, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |