| ESPnet-SDS: Unified Toolkit and Demo for Spoken Dialogue Systems | Mar 11, 2025 | DiversitySpoken Dialogue Systems | —Unverified | 0 |
| Talking Turns: Benchmarking Audio Foundation Models on Turn-Taking Dynamics | Mar 3, 2025 | BenchmarkingSpoken Dialogue Systems | —Unverified | 0 |
| LLM-Enhanced Dialogue Management for Full-Duplex Spoken Dialogue Systems | Feb 19, 2025 | Action DetectionActivity Detection | —Unverified | 0 |
| FlexDuo: A Pluggable System for Enabling Full-Duplex Capabilities in Speech Dialogue Systems | Feb 19, 2025 | Action DetectionActivity Detection | —Unverified | 0 |
| Multimodal Transformer Models for Turn-taking Prediction: Effects on Conversational Dynamics of Human-Agent Interaction during Cooperative Gameplay | Feb 5, 2025 | Spoken Dialogue Systems | —Unverified | 0 |
| An LLM Benchmark for Addressee Recognition in Multi-modal Multi-party Dialogue | Jan 28, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Real-Time Textless Dialogue Generation | Jan 8, 2025 | Dialogue GenerationRhythm | CodeCode Available | 0 |
| OmniChat: Enhancing Spoken Dialogue Systems with Scalable Synthetic Data for Diverse Scenarios | Jan 2, 2025 | feature selectionSpoken Dialogue Systems | —Unverified | 0 |
| SLAM-Omni: Timbre-Controllable Voice Interaction System with Single-Stage Training | Dec 20, 2024 | Spoken Dialogue Systems | CodeCode Available | 0 |
| OmniFlatten: An End-to-end GPT Model for Seamless Voice Conversation | Oct 23, 2024 | Large Language ModelSpoken Dialogue Systems | CodeCode Available | 0 |