| High-precision Voice Search Query Correction via Retrievable Speech-text Embedings | Jan 8, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Exploratory Evaluation of Speech Content Masking | Jan 8, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Multichannel AV-wav2vec2: A Framework for Learning Multichannel Multi-Modal Speech Representation | Jan 7, 2024 | Audio-Visual Speech RecognitionAutomatic Speech Recognition | CodeCode Available | 0 |
| MLCA-AVSR: Multi-Layer Cross Attention Fusion based Audio-Visual Speech Recognition | Jan 7, 2024 | Audio-Visual Speech RecognitionAutomatic Speech Recognition | —Unverified | 0 |
| ICMC-ASR: The ICASSP 2024 In-Car Multi-Channel Automatic Speech Recognition Challenge | Jan 7, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| DiarizationLM: Speaker Diarization Post-Processing with Large Language Models | Jan 7, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 3 |
| TeLeS: Temporal Lexeme Similarity Score to Estimate Confidence in End-to-End ASR | Jan 6, 2024 | Active LearningAutomatic Speech Recognition | CodeCode Available | 0 |
| Task Oriented Dialogue as a Catalyst for Self-Supervised Automatic Speech Recognition | Jan 4, 2024 | AttributeAutomatic Speech Recognition | CodeCode Available | 0 |
| Hallucinations in Neural Automatic Speech Recognition: Identifying Errors and Hallucinatory Models | Jan 3, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Stateful Conformer with Cache-based Inference for Streaming Automatic Speech Recognition | Dec 27, 2023 | Automatic Speech RecognitionDecoder | —Unverified | 0 |
| Towards Probing Contact Center Large Language Models | Dec 26, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| The NUS-HLT System for ICASSP2024 ICMC-ASR Grand Challenge | Dec 26, 2023 | Automatic Speech RecognitionData Augmentation | —Unverified | 0 |
| Exploring data augmentation in bias mitigation against non-native-accented speech | Dec 24, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Multimodal Attention Merging for Improved Speech Recognition and Audio Event Classification | Dec 22, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| BLSTM-Based Confidence Estimation for End-to-End Speech Recognition | Dec 22, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Lattice Rescoring Based on Large Ensemble of Complementary Neural Language Models | Dec 20, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Stable Distillation: Regularizing Continued Pre-training for Low-Resource Automatic Speech Recognition | Dec 20, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Automated speech audiometry: Can it work using open-source pre-trained Kaldi-NL automatic speech recognition? | Dec 19, 2023 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |
| SpokesBiz -- an Open Corpus of Conversational Polish | Dec 19, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Efficiency-oriented approaches for self-supervised speech representation learning | Dec 18, 2023 | Automatic Speech RecognitionRepresentation Learning | —Unverified | 0 |
| OAVA: the open audio-visual archives aggregator | Dec 16, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Conformer-Based Speech Recognition On Extreme Edge-Computing Devices | Dec 16, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Seq2seq for Automatic Paraphasia Detection in Aphasic Speech | Dec 16, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Generative Context-aware Fine-tuning of Self-supervised Speech Models | Dec 15, 2023 | Automatic Speech Recognitionnamed-entity-recognition | —Unverified | 0 |
| Leveraging Language ID to Calculate Intermediate CTC Loss for Enhanced Code-Switching Speech Recognition | Dec 15, 2023 | Automatic Speech RecognitionLanguage Identification | —Unverified | 0 |
| LiteVSR: Efficient Visual Speech Recognition by Learning from Speech Representations of Unlabeled Data | Dec 15, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| FastInject: Injecting Unpaired Text Data into CTC-based ASR training | Dec 14, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Audio-visual fine-tuning of audio-only ASR models | Dec 14, 2023 | Automatic Speech RecognitionSelf-Supervised Learning | —Unverified | 0 |
| USM-Lite: Quantization and Sparsity Aware Fine-tuning for Speech Recognition with Universal Speech Models | Dec 13, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| PhasePerturbation: Speech Data Augmentation via Phase Perturbation for Automatic Speech Recognition | Dec 13, 2023 | Automatic Speech RecognitionData Augmentation | —Unverified | 0 |
| Extending Whisper with prompt tuning to target-speaker ASR | Dec 13, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Self-supervised Adaptive Pre-training of Multilingual Speech Models for Language and Dialect Identification | Dec 12, 2023 | Automatic Speech RecognitionDialect Identification | —Unverified | 0 |
| Creating Spoken Dialog Systems in Ultra-Low Resourced Settings | Dec 11, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| ROSE: A Recognition-Oriented Speech Enhancement Framework in Air Traffic Control Using Multi-Objective Learning | Dec 11, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Multimodal Data and Resource Efficient Device-Directed Speech Detection with Large Foundation Models | Dec 6, 2023 | Automatic Speech RecognitionDecoder | —Unverified | 0 |
| Integrating Pre-Trained Speech and Language Models for End-to-End Speech Recognition | Dec 6, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Bigger is not Always Better: The Effect of Context Size on Speech Pre-Training | Dec 3, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| End-to-End Speech-to-Text Translation: A Survey | Dec 2, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| End-to-end Joint Punctuated and Normalized ASR with a Limited Amount of Punctuated Training Data | Nov 29, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| D4AM: A General Denoising Framework for Downstream Acoustic Models | Nov 28, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| A Quantitative Approach to Understand Self-Supervised Models as Cross-lingual Feature Extractors | Nov 27, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Weak Alignment Supervision from Hybrid Model Improves End-to-end ASR | Nov 24, 2023 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |
| LIP-RTVE: An Audiovisual Database for Continuous Spanish in the Wild | Nov 21, 2023 | Automatic Speech Recognitionspeech-recognition | CodeCode Available | 0 |
| Soft Random Sampling: A Theoretical and Empirical Analysis | Nov 21, 2023 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |
| App for Resume-Based Job Matching with Speech Interviews and Grammar Analysis: A Review | Nov 20, 2023 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |
| How does end-to-end speech recognition training impact speech enhancement artifacts? | Nov 20, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| ML-LMCL: Mutual Learning and Large-Margin Contrastive Learning for Improving ASR Robustness in Spoken Language Understanding | Nov 19, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Label-Synchronous Neural Transducer for Adaptable Online E2E Speech Recognition | Nov 19, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Multi-channel Conversational Speaker Separation via Neural Diarization | Nov 15, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Improving Large-scale Deep Biasing with Phoneme Features and Text-only Data in Streaming Transducer | Nov 15, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |