| Multimodal Data and Resource Efficient Device-Directed Speech Detection with Large Foundation Models | Dec 6, 2023 | Automatic Speech RecognitionDecoder | —Unverified | 0 |
| Integrating Pre-Trained Speech and Language Models for End-to-End Speech Recognition | Dec 6, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Bigger is not Always Better: The Effect of Context Size on Speech Pre-Training | Dec 3, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| End-to-End Speech-to-Text Translation: A Survey | Dec 2, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| End-to-end Joint Punctuated and Normalized ASR with a Limited Amount of Punctuated Training Data | Nov 29, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| A Quantitative Approach to Understand Self-Supervised Models as Cross-lingual Feature Extractors | Nov 27, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Weak Alignment Supervision from Hybrid Model Improves End-to-end ASR | Nov 24, 2023 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |
| Soft Random Sampling: A Theoretical and Empirical Analysis | Nov 21, 2023 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |
| LIP-RTVE: An Audiovisual Database for Continuous Spanish in the Wild | Nov 21, 2023 | Automatic Speech Recognitionspeech-recognition | CodeCode Available | 0 |
| How does end-to-end speech recognition training impact speech enhancement artifacts? | Nov 20, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| App for Resume-Based Job Matching with Speech Interviews and Grammar Analysis: A Review | Nov 20, 2023 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |
| Label-Synchronous Neural Transducer for Adaptable Online E2E Speech Recognition | Nov 19, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| ML-LMCL: Mutual Learning and Large-Margin Contrastive Learning for Improving ASR Robustness in Spoken Language Understanding | Nov 19, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Multi-channel Conversational Speaker Separation via Neural Diarization | Nov 15, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Improving Large-scale Deep Biasing with Phoneme Features and Text-only Data in Streaming Transducer | Nov 15, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Retrieve and Copy: Scaling ASR Personalization to Large Catalogs | Nov 14, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| On the Effectiveness of ASR Representations in Real-world Noisy Speech Emotion Recognition | Nov 13, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| 1SPU: 1-step Speech Processing Unit | Nov 8, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| A comparative analysis between Conformer-Transducer, Whisper, and wav2vec2 for improving the child speech recognition | Nov 7, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Fine-tuning convergence model in Bengali speech recognition | Nov 7, 2023 | Automatic Speech Recognitionmodel | —Unverified | 0 |
| Pseudo-Labeling for Domain-Agnostic Bangla Automatic Speech Recognition | Nov 6, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| COSMIC: Data Efficient Instruction-tuning For Speech In-Context Learning | Nov 3, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Server-side Rescoring of Spoken Entity-centric Knowledge Queries for Virtual Assistants | Nov 2, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| RIR-SF: Room Impulse Response Based Spatial Feature for Target Speech Recognition in Multi-Channel Multi-Speaker Scenarios | Oct 31, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Combining Language Models For Specialized Domains: A Colorful Approach | Oct 30, 2023 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |