| Prompt-Guided Turn-Taking Prediction | Jun 26, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Towards Efficient Speech-Text Jointly Decoding within One Speech Language Model | Jun 4, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Towards a Japanese Full-duplex Spoken Dialogue System | Jun 3, 2025 | Spoken Dialogue Systemstext-to-speech | —Unverified | 0 |
| Chain-of-Thought Training for Open E2E Spoken Dialogue Systems | May 31, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| WavReward: Spoken Dialogue Models With Generalist Reward Evaluators | May 14, 2025 | Spoken Dialogue Systems | CodeCode Available | 2 |
| Speculative End-Turn Detector for Efficient Speech Chatbot Assistant | Mar 30, 2025 | ChatbotCollaborative Inference | —Unverified | 0 |
| ESPnet-SDS: Unified Toolkit and Demo for Spoken Dialogue Systems | Mar 11, 2025 | DiversitySpoken Dialogue Systems | —Unverified | 0 |
| Talking Turns: Benchmarking Audio Foundation Models on Turn-Taking Dynamics | Mar 3, 2025 | BenchmarkingSpoken Dialogue Systems | —Unverified | 0 |
| LLM-Enhanced Dialogue Management for Full-Duplex Spoken Dialogue Systems | Feb 19, 2025 | Action DetectionActivity Detection | —Unverified | 0 |
| FlexDuo: A Pluggable System for Enabling Full-Duplex Capabilities in Speech Dialogue Systems | Feb 19, 2025 | Action DetectionActivity Detection | —Unverified | 0 |
| Multimodal Transformer Models for Turn-taking Prediction: Effects on Conversational Dynamics of Human-Agent Interaction during Cooperative Gameplay | Feb 5, 2025 | Spoken Dialogue Systems | —Unverified | 0 |
| An LLM Benchmark for Addressee Recognition in Multi-modal Multi-party Dialogue | Jan 28, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Real-Time Textless Dialogue Generation | Jan 8, 2025 | Dialogue GenerationRhythm | CodeCode Available | 0 |
| OmniChat: Enhancing Spoken Dialogue Systems with Scalable Synthetic Data for Diverse Scenarios | Jan 2, 2025 | feature selectionSpoken Dialogue Systems | —Unverified | 0 |
| SLAM-Omni: Timbre-Controllable Voice Interaction System with Single-Stage Training | Dec 20, 2024 | Spoken Dialogue Systems | CodeCode Available | 0 |
| WavChat: A Survey of Spoken Dialogue Models | Nov 15, 2024 | speech-recognitionSpeech Recognition | CodeCode Available | 3 |
| OmniFlatten: An End-to-end GPT Model for Seamless Voice Conversation | Oct 23, 2024 | Large Language ModelSpoken Dialogue Systems | CodeCode Available | 0 |
| Large Language Models Know What To Say But Not When To Speak | Oct 21, 2024 | Spoken Dialogue Systems | —Unverified | 0 |
| Data Augmentation Integrating Dialogue Flow and Style to Adapt Spoken Dialogue Systems to Low-Resource User Groups | Aug 20, 2024 | Data AugmentationLanguage Modeling | —Unverified | 0 |
| SaSLaW: Dialogue Speech Corpus with Audio-visual Egocentric Information Toward Environment-adaptive Dialogue Speech Synthesis | Aug 13, 2024 | Speech SynthesisSpoken Dialogue Systems | CodeCode Available | 0 |
| PSLM: Parallel Generation of Text and Speech with LLMs for Low-Latency Spoken Dialogue Systems | Jun 18, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Evaluation of a semi-autonomous attentive listening system with takeover prompting | Feb 21, 2024 | Spoken Dialogue Systems | —Unverified | 0 |
| An Analysis of User Behaviors for Objectively Evaluating Spoken Dialogue Systems | Jan 10, 2024 | Spoken Dialogue Systems | —Unverified | 0 |
| An Analysis of Dialogue Repair in Voice Assistants | Nov 7, 2023 | Spoken Dialogue Systems | —Unverified | 0 |
| Towards Joint Modeling of Dialogue Response and Speech Synthesis based on Large Language Model | Sep 20, 2023 | ChatbotLanguage Modeling | CodeCode Available | 1 |