| WavChat: A Survey of Spoken Dialogue Models | Nov 15, 2024 | speech-recognitionSpeech Recognition | CodeCode Available | 3 | 5 |
| WavReward: Spoken Dialogue Models With Generalist Reward Evaluators | May 14, 2025 | Spoken Dialogue Systems | CodeCode Available | 2 | 5 |
| Plato Dialogue System: A Flexible Conversational AI Research Platform | Jan 17, 2020 | Spoken Dialogue Systems | CodeCode Available | 1 | 5 |
| Towards Joint Modeling of Dialogue Response and Speech Synthesis based on Large Language Model | Sep 20, 2023 | ChatbotLanguage Modeling | CodeCode Available | 1 | 5 |
| "How Robust r u?": Evaluating Task-Oriented Dialogue Systems on Spoken Conversations | Sep 28, 2021 | BenchmarkingDialogue State Tracking | CodeCode Available | 1 | 5 |
| Hierarchical Multi-Task Natural Language Understanding for Cross-domain Conversational AI: HERMIT NLU | Oct 2, 2019 | Natural Language UnderstandingSentence | CodeCode Available | 0 | 5 |
| PSLM: Parallel Generation of Text and Speech with LLMs for Low-Latency Spoken Dialogue Systems | Jun 18, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Natural Language Generation by Hierarchical Decoding with Linguistic Patterns | Aug 8, 2018 | DecoderSentence | CodeCode Available | 0 | 5 |
| A dataset for resolving referring expressions in spoken dialogue via contextual query rewrites (CQR) | Mar 28, 2019 | Spoken Dialogue SystemsSpoken Language Understanding | CodeCode Available | 0 | 5 |
| Slot-Gated Modeling for Joint Slot Filling and Intent Prediction | Jun 1, 2018 | global-optimizationIntent Detection | CodeCode Available | 0 | 5 |
| SLAM-Omni: Timbre-Controllable Voice Interaction System with Single-Stage Training | Dec 20, 2024 | Spoken Dialogue Systems | CodeCode Available | 0 | 5 |
| OLISIA: a Cascade System for Spoken Dialogue State Tracking | Apr 20, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 | 5 |
| EVI: Multilingual Spoken Dialogue Tasks and Dataset for Knowledge-Based Enrolment, Verification, and Identification | Apr 28, 2022 | Speaker IdentificationSpeaker Verification | CodeCode Available | 0 | 5 |
| Towards Learning Transferable Conversational Skills using Multi-dimensional Dialogue Modelling | Mar 31, 2018 | Dialogue ManagementDomain Adaptation | CodeCode Available | 0 | 5 |
| OmniFlatten: An End-to-end GPT Model for Seamless Voice Conversation | Oct 23, 2024 | Large Language ModelSpoken Dialogue Systems | CodeCode Available | 0 | 5 |
| SaSLaW: Dialogue Speech Corpus with Audio-visual Egocentric Information Toward Environment-adaptive Dialogue Speech Synthesis | Aug 13, 2024 | Speech SynthesisSpoken Dialogue Systems | CodeCode Available | 0 | 5 |
| How Time Matters: Learning Time-Decay Attention for Contextual Spoken Language Understanding in Dialogues | Jun 1, 2018 | Dialogue State TrackingImage Captioning | CodeCode Available | 0 | 5 |
| Real-Time Textless Dialogue Generation | Jan 8, 2025 | Dialogue GenerationRhythm | CodeCode Available | 0 | 5 |
| Sequence-to-Sequence Generation for Spoken Dialogue via Deep Syntax Trees and Strings | Aug 1, 2016 | Spoken Dialogue SystemsText Generation | CodeCode Available | 0 | 5 |
| Variational Cross-domain Natural Language Generation for Spoken Dialogue Systems | Dec 20, 2018 | DiversitySentence | CodeCode Available | 0 | 5 |
| A Context-aware Natural Language Generator for Dialogue Systems | Sep 1, 2016 | Spoken Dialogue SystemsText Generation | CodeCode Available | 0 | 5 |
| When can I Speak? Predicting initiation points for spoken dialogue agents | Aug 7, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Findings of the E2E NLG Challenge | Oct 2, 2018 | Data-to-Text GenerationSpoken Dialogue Systems | CodeCode Available | 0 | 5 |
| Modeling ASR Ambiguity for Dialogue State Tracking Using Word Confusion Networks | Feb 3, 2020 | Dialogue State TrackingSpoken Dialogue Systems | CodeCode Available | 0 | 5 |
| A Context-aware Natural Language Generator for Dialogue Systems | Aug 25, 2016 | Spoken Dialogue SystemsText Generation | CodeCode Available | 0 | 5 |
| Semantically Conditioned LSTM-based Natural Language Generation for Spoken Dialogue Systems | Aug 7, 2015 | InformativenessSentence | CodeCode Available | 0 | 5 |
| An Empirical Study of Self-Disclosure in Spoken Dialogue Systems | Jul 1, 2018 | Spoken Dialogue Systems | —Unverified | 0 | 0 |
| Adversarial Training for Multi-task and Multi-lingual Joint Modeling of Utterance Intent Classification | Oct 1, 2018 | General Classificationintent-classification | —Unverified | 0 | 0 |
| An Analysis of User Behaviors for Objectively Evaluating Spoken Dialogue Systems | Jan 10, 2024 | Spoken Dialogue Systems | —Unverified | 0 | 0 |
| An LLM Benchmark for Addressee Recognition in Multi-modal Multi-party Dialogue | Jan 28, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| An Analysis of the Effect of Emotional Speech Synthesis on Non-Task-Oriented Dialogue System | Jul 1, 2018 | Dialogue ManagementEmotional Speech Synthesis | —Unverified | 0 | 0 |
| Adversarial Domain Adaptation for Variational Neural Language Generation in Dialogue Systems | Aug 8, 2018 | DecoderDomain Adaptation | —Unverified | 0 | 0 |
| Acquisition and Assessment of Semantic Content for the Generation of Elaborateness and Indirectness in Spoken Dialogue Systems | Nov 1, 2017 | Cultural Vocal Bursts Intensity PredictionSpoken Dialogue Systems | —Unverified | 0 | 0 |
| An Analysis of Older Users' Interactions with Spoken Dialogue Systems | May 1, 2014 | Dialogue ManagementSpoken Dialogue Systems | —Unverified | 0 | 0 |
| An Analysis of Dialogue Repair in Voice Assistants | Nov 7, 2023 | Spoken Dialogue Systems | —Unverified | 0 | 0 |
| Analysis and Utilization of Entrainment on Acoustic and Emotion Features in User-agent Dialogue | Dec 7, 2022 | Spoken Dialogue Systemstext-to-speech | —Unverified | 0 | 0 |
| Chat Detection in an Intelligent Assistant: Combining Task-oriented and Non-task-oriented Spoken Dialogue Systems | May 2, 2017 | Spoken Dialogue Systems | —Unverified | 0 | 0 |
| A Multithreaded Conversational Interface for Pedestrian Navigation and Question Answering | Aug 1, 2013 | Question AnsweringSpoken Dialogue Systems | —Unverified | 0 | 0 |
| Addressing Objects and Their Relations: The Conversational Entity Dialogue Model | Jan 5, 2019 | Spoken Dialogue Systems | —Unverified | 0 | 0 |
| Acoustic Modeling for End-to-End Empathetic Dialogue Speech Synthesis Using Linguistic and Prosodic Contexts of Dialogue History | Jun 16, 2022 | Self-Supervised LearningSentence | —Unverified | 0 | 0 |
| A Multimodal Corpus of Rapid Dialogue Games | May 1, 2014 | Dialogue ManagementManagement | —Unverified | 0 | 0 |
| Book Review: Interactive Multi-Modal Question-Answering by Antal van den Bosch and Gosse Bouma | Jan 1, 2012 | Dialogue ManagementQuestion Answering | —Unverified | 0 | 0 |
| Bielefeld SC: Orthonormal Topic Modelling for Grammar Induction | Aug 1, 2014 | Information RetrievalSemantic Textual Similarity | —Unverified | 0 | 0 |
| A Model of Zero-Shot Learning of Spoken Language Understanding | Sep 1, 2015 | One-Shot LearningSpoken Dialogue Systems | —Unverified | 0 | 0 |
| HUMBO: Bridging Response Generation and Facial Expression Synthesis | May 24, 2019 | Dialogue Generationmultimodal interaction | —Unverified | 0 | 0 |
| Chain-of-Thought Training for Open E2E Spoken Dialogue Systems | May 31, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Adaptive Generation in Dialogue Systems Using Dynamic User Modeling | Dec 1, 2014 | Dialogue ManagementSpoken Dialogue Systems | —Unverified | 0 | 0 |
| Code-switched inspired losses for spoken dialog representations | Nov 1, 2021 | RetrievalSpoken Dialogue Systems | —Unverified | 0 | 0 |
| Combining Incremental Language Generation and Incremental Speech Synthesis for Adaptive Information Presentation | Jul 1, 2012 | Speech SynthesisSpoken Dialogue Systems | —Unverified | 0 | 0 |
| Automation and Optimisation of Humor Trait Generation in a Vocal Dialogue System | Nov 1, 2018 | Dialogue ManagementSpoken Dialogue Systems | —Unverified | 0 | 0 |