| Hallucinations in Neural Automatic Speech Recognition: Identifying Errors and Hallucinatory Models | Jan 3, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Stateful Conformer with Cache-based Inference for Streaming Automatic Speech Recognition | Dec 27, 2023 | Automatic Speech RecognitionDecoder | —Unverified | 0 |
| Towards Probing Contact Center Large Language Models | Dec 26, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| The NUS-HLT System for ICASSP2024 ICMC-ASR Grand Challenge | Dec 26, 2023 | Automatic Speech RecognitionData Augmentation | —Unverified | 0 |
| Exploring data augmentation in bias mitigation against non-native-accented speech | Dec 24, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Multimodal Attention Merging for Improved Speech Recognition and Audio Event Classification | Dec 22, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| BLSTM-Based Confidence Estimation for End-to-End Speech Recognition | Dec 22, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Lattice Rescoring Based on Large Ensemble of Complementary Neural Language Models | Dec 20, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Stable Distillation: Regularizing Continued Pre-training for Low-Resource Automatic Speech Recognition | Dec 20, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Automated speech audiometry: Can it work using open-source pre-trained Kaldi-NL automatic speech recognition? | Dec 19, 2023 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |
| SpokesBiz -- an Open Corpus of Conversational Polish | Dec 19, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Efficiency-oriented approaches for self-supervised speech representation learning | Dec 18, 2023 | Automatic Speech RecognitionRepresentation Learning | —Unverified | 0 |
| Seq2seq for Automatic Paraphasia Detection in Aphasic Speech | Dec 16, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Conformer-Based Speech Recognition On Extreme Edge-Computing Devices | Dec 16, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| OAVA: the open audio-visual archives aggregator | Dec 16, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Generative Context-aware Fine-tuning of Self-supervised Speech Models | Dec 15, 2023 | Automatic Speech Recognitionnamed-entity-recognition | —Unverified | 0 |
| LiteVSR: Efficient Visual Speech Recognition by Learning from Speech Representations of Unlabeled Data | Dec 15, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Leveraging Language ID to Calculate Intermediate CTC Loss for Enhanced Code-Switching Speech Recognition | Dec 15, 2023 | Automatic Speech RecognitionLanguage Identification | —Unverified | 0 |
| Audio-visual fine-tuning of audio-only ASR models | Dec 14, 2023 | Automatic Speech RecognitionSelf-Supervised Learning | —Unverified | 0 |
| FastInject: Injecting Unpaired Text Data into CTC-based ASR training | Dec 14, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| PhasePerturbation: Speech Data Augmentation via Phase Perturbation for Automatic Speech Recognition | Dec 13, 2023 | Automatic Speech RecognitionData Augmentation | —Unverified | 0 |
| USM-Lite: Quantization and Sparsity Aware Fine-tuning for Speech Recognition with Universal Speech Models | Dec 13, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Self-supervised Adaptive Pre-training of Multilingual Speech Models for Language and Dialect Identification | Dec 12, 2023 | Automatic Speech RecognitionDialect Identification | —Unverified | 0 |
| Creating Spoken Dialog Systems in Ultra-Low Resourced Settings | Dec 11, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| ROSE: A Recognition-Oriented Speech Enhancement Framework in Air Traffic Control Using Multi-Objective Learning | Dec 11, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Multimodal Data and Resource Efficient Device-Directed Speech Detection with Large Foundation Models | Dec 6, 2023 | Automatic Speech RecognitionDecoder | —Unverified | 0 |
| Integrating Pre-Trained Speech and Language Models for End-to-End Speech Recognition | Dec 6, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Bigger is not Always Better: The Effect of Context Size on Speech Pre-Training | Dec 3, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| End-to-End Speech-to-Text Translation: A Survey | Dec 2, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| End-to-end Joint Punctuated and Normalized ASR with a Limited Amount of Punctuated Training Data | Nov 29, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| A Quantitative Approach to Understand Self-Supervised Models as Cross-lingual Feature Extractors | Nov 27, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Weak Alignment Supervision from Hybrid Model Improves End-to-end ASR | Nov 24, 2023 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |
| Soft Random Sampling: A Theoretical and Empirical Analysis | Nov 21, 2023 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |
| LIP-RTVE: An Audiovisual Database for Continuous Spanish in the Wild | Nov 21, 2023 | Automatic Speech Recognitionspeech-recognition | CodeCode Available | 0 |
| How does end-to-end speech recognition training impact speech enhancement artifacts? | Nov 20, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| App for Resume-Based Job Matching with Speech Interviews and Grammar Analysis: A Review | Nov 20, 2023 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |
| Label-Synchronous Neural Transducer for Adaptable Online E2E Speech Recognition | Nov 19, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| ML-LMCL: Mutual Learning and Large-Margin Contrastive Learning for Improving ASR Robustness in Spoken Language Understanding | Nov 19, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Multi-channel Conversational Speaker Separation via Neural Diarization | Nov 15, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Improving Large-scale Deep Biasing with Phoneme Features and Text-only Data in Streaming Transducer | Nov 15, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Retrieve and Copy: Scaling ASR Personalization to Large Catalogs | Nov 14, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| On the Effectiveness of ASR Representations in Real-world Noisy Speech Emotion Recognition | Nov 13, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| 1SPU: 1-step Speech Processing Unit | Nov 8, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| A comparative analysis between Conformer-Transducer, Whisper, and wav2vec2 for improving the child speech recognition | Nov 7, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Fine-tuning convergence model in Bengali speech recognition | Nov 7, 2023 | Automatic Speech Recognitionmodel | —Unverified | 0 |
| Pseudo-Labeling for Domain-Agnostic Bangla Automatic Speech Recognition | Nov 6, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| COSMIC: Data Efficient Instruction-tuning For Speech In-Context Learning | Nov 3, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Server-side Rescoring of Spoken Entity-centric Knowledge Queries for Virtual Assistants | Nov 2, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| RIR-SF: Room Impulse Response Based Spatial Feature for Target Speech Recognition in Multi-Channel Multi-Speaker Scenarios | Oct 31, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Combining Language Models For Specialized Domains: A Colorful Approach | Oct 30, 2023 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |