| Multimodal Transformer Models for Turn-taking Prediction: Effects on Conversational Dynamics of Human-Agent Interaction during Cooperative Gameplay | Feb 5, 2025 | Spoken Dialogue Systems | —Unverified | 0 |
| An LLM Benchmark for Addressee Recognition in Multi-modal Multi-party Dialogue | Jan 28, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Real-Time Textless Dialogue Generation | Jan 8, 2025 | Dialogue GenerationRhythm | CodeCode Available | 0 |
| OmniChat: Enhancing Spoken Dialogue Systems with Scalable Synthetic Data for Diverse Scenarios | Jan 2, 2025 | feature selectionSpoken Dialogue Systems | —Unverified | 0 |
| SLAM-Omni: Timbre-Controllable Voice Interaction System with Single-Stage Training | Dec 20, 2024 | Spoken Dialogue Systems | CodeCode Available | 0 |
| WavChat: A Survey of Spoken Dialogue Models | Nov 15, 2024 | speech-recognitionSpeech Recognition | CodeCode Available | 3 |
| OmniFlatten: An End-to-end GPT Model for Seamless Voice Conversation | Oct 23, 2024 | Large Language ModelSpoken Dialogue Systems | CodeCode Available | 0 |
| Large Language Models Know What To Say But Not When To Speak | Oct 21, 2024 | Spoken Dialogue Systems | —Unverified | 0 |
| Data Augmentation Integrating Dialogue Flow and Style to Adapt Spoken Dialogue Systems to Low-Resource User Groups | Aug 20, 2024 | Data AugmentationLanguage Modeling | —Unverified | 0 |
| SaSLaW: Dialogue Speech Corpus with Audio-visual Egocentric Information Toward Environment-adaptive Dialogue Speech Synthesis | Aug 13, 2024 | Speech SynthesisSpoken Dialogue Systems | CodeCode Available | 0 |