| Prompt-Guided Turn-Taking Prediction | Jun 26, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Towards Efficient Speech-Text Jointly Decoding within One Speech Language Model | Jun 4, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Towards a Japanese Full-duplex Spoken Dialogue System | Jun 3, 2025 | Spoken Dialogue Systemstext-to-speech | —Unverified | 0 |
| Chain-of-Thought Training for Open E2E Spoken Dialogue Systems | May 31, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| WavReward: Spoken Dialogue Models With Generalist Reward Evaluators | May 14, 2025 | Spoken Dialogue Systems | CodeCode Available | 2 |
| Speculative End-Turn Detector for Efficient Speech Chatbot Assistant | Mar 30, 2025 | ChatbotCollaborative Inference | —Unverified | 0 |
| ESPnet-SDS: Unified Toolkit and Demo for Spoken Dialogue Systems | Mar 11, 2025 | DiversitySpoken Dialogue Systems | —Unverified | 0 |
| Talking Turns: Benchmarking Audio Foundation Models on Turn-Taking Dynamics | Mar 3, 2025 | BenchmarkingSpoken Dialogue Systems | —Unverified | 0 |
| LLM-Enhanced Dialogue Management for Full-Duplex Spoken Dialogue Systems | Feb 19, 2025 | Action DetectionActivity Detection | —Unverified | 0 |
| FlexDuo: A Pluggable System for Enabling Full-Duplex Capabilities in Speech Dialogue Systems | Feb 19, 2025 | Action DetectionActivity Detection | —Unverified | 0 |