| Communication-Efficient Personalized Federated Learning for Speech-to-Text Tasks | Jan 18, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| AGADIR: Towards Array-Geometry Agnostic Directional Speech Recognition | Jan 18, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| On Speech Pre-emphasis as a Simple and Inexpensive Method to Boost Speech Enhancement | Jan 17, 2024 | Automatic Speech RecognitionSpeech Enhancement | —Unverified | 0 |
| Improving ASR Contextual Biasing with Guided Attention | Jan 16, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| NOTSOFAR-1 Challenge: New Datasets, Baseline, and Tasks for Distant Meeting Transcription | Jan 16, 2024 | Automatic Speech RecognitionBenchmarking | —Unverified | 0 |
| Multi-Input Multi-Output Target-Speaker Voice Activity Detection For Unified, Flexible, and Robust Audio-Visual Speaker Diarization | Jan 16, 2024 | Action DetectionActivity Detection | —Unverified | 0 |
| SeMaScore : a new evaluation metric for automatic speech recognition tasks | Jan 15, 2024 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |
| Cascaded Cross-Modal Transformer for Audio-Textual Classification | Jan 15, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Promptformer: Prompted Conformer Transducer for ASR | Jan 14, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Joint Unsupervised and Supervised Training for Automatic Speech Recognition via Bilevel Optimization | Jan 13, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Transcending Controlled Environments Assessing the Transferability of ASRRobust NLU Models to Real-World Applications | Jan 12, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| XLS-R Deep Learning Model for Multilingual ASR on Low- Resource Languages: Indonesian, Javanese, and Sundanese | Jan 12, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| UCorrect: An Unsupervised Framework for Automatic Speech Recognition Error Correction | Jan 11, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| End to end Hindi to English speech conversion using Bark, mBART and a finetuned XLSR Wav2Vec2 | Jan 11, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Useful Blunders: Can Automated Speech Recognition Errors Improve Downstream Dementia Classification? | Jan 10, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Continuously Learning New Words in Automatic Speech Recognition | Jan 9, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Exploratory Evaluation of Speech Content Masking | Jan 8, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| High-precision Voice Search Query Correction via Retrievable Speech-text Embedings | Jan 8, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| LUPET: Incorporating Hierarchical Information Path into Multilingual ASR | Jan 8, 2024 | Acoustic Unit DiscoveryAutomatic Speech Recognition | —Unverified | 0 |
| BS-PLCNet: Band-split Packet Loss Concealment Network with Multi-task Learning Framework and Multi-discriminators | Jan 8, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| ICMC-ASR: The ICASSP 2024 In-Car Multi-Channel Automatic Speech Recognition Challenge | Jan 7, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| MLCA-AVSR: Multi-Layer Cross Attention Fusion based Audio-Visual Speech Recognition | Jan 7, 2024 | Audio-Visual Speech RecognitionAutomatic Speech Recognition | —Unverified | 0 |
| Multichannel AV-wav2vec2: A Framework for Learning Multichannel Multi-Modal Speech Representation | Jan 7, 2024 | Audio-Visual Speech RecognitionAutomatic Speech Recognition | CodeCode Available | 0 |
| TeLeS: Temporal Lexeme Similarity Score to Estimate Confidence in End-to-End ASR | Jan 6, 2024 | Active LearningAutomatic Speech Recognition | CodeCode Available | 0 |
| Task Oriented Dialogue as a Catalyst for Self-Supervised Automatic Speech Recognition | Jan 4, 2024 | AttributeAutomatic Speech Recognition | CodeCode Available | 0 |