Spoken Dialogue Systems

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1–50 of 254 papers

Title	Date	Tasks	Status	Hype
WavChat: A Survey of Spoken Dialogue Models	Nov 15, 2024	speech-recognitionSpeech Recognition	CodeCode Available	3
WavReward: Spoken Dialogue Models With Generalist Reward Evaluators	May 14, 2025	Spoken Dialogue Systems	CodeCode Available	2
Towards Joint Modeling of Dialogue Response and Speech Synthesis based on Large Language Model	Sep 20, 2023	ChatbotLanguage Modeling	CodeCode Available	1
"How Robust r u?": Evaluating Task-Oriented Dialogue Systems on Spoken Conversations	Sep 28, 2021	BenchmarkingDialogue State Tracking	CodeCode Available	1
Plato Dialogue System: A Flexible Conversational AI Research Platform	Jan 17, 2020	Spoken Dialogue Systems	CodeCode Available	1
Prompt-Guided Turn-Taking Prediction	Jun 26, 2025	Language ModelingLanguage Modelling	—Unverified	0
Towards Efficient Speech-Text Jointly Decoding within One Speech Language Model	Jun 4, 2025	Language ModelingLanguage Modelling	—Unverified	0
Towards a Japanese Full-duplex Spoken Dialogue System	Jun 3, 2025	Spoken Dialogue Systemstext-to-speech	—Unverified	0
Chain-of-Thought Training for Open E2E Spoken Dialogue Systems	May 31, 2025	Language ModelingLanguage Modelling	—Unverified	0
Speculative End-Turn Detector for Efficient Speech Chatbot Assistant	Mar 30, 2025	ChatbotCollaborative Inference	—Unverified	0
ESPnet-SDS: Unified Toolkit and Demo for Spoken Dialogue Systems	Mar 11, 2025	DiversitySpoken Dialogue Systems	—Unverified	0
Talking Turns: Benchmarking Audio Foundation Models on Turn-Taking Dynamics	Mar 3, 2025	BenchmarkingSpoken Dialogue Systems	—Unverified	0
LLM-Enhanced Dialogue Management for Full-Duplex Spoken Dialogue Systems	Feb 19, 2025	Action DetectionActivity Detection	—Unverified	0
FlexDuo: A Pluggable System for Enabling Full-Duplex Capabilities in Speech Dialogue Systems	Feb 19, 2025	Action DetectionActivity Detection	—Unverified	0
Multimodal Transformer Models for Turn-taking Prediction: Effects on Conversational Dynamics of Human-Agent Interaction during Cooperative Gameplay	Feb 5, 2025	Spoken Dialogue Systems	—Unverified	0
An LLM Benchmark for Addressee Recognition in Multi-modal Multi-party Dialogue	Jan 28, 2025	Language ModelingLanguage Modelling	—Unverified	0
Real-Time Textless Dialogue Generation	Jan 8, 2025	Dialogue GenerationRhythm	CodeCode Available	0
OmniChat: Enhancing Spoken Dialogue Systems with Scalable Synthetic Data for Diverse Scenarios	Jan 2, 2025	feature selectionSpoken Dialogue Systems	—Unverified	0
SLAM-Omni: Timbre-Controllable Voice Interaction System with Single-Stage Training	Dec 20, 2024	Spoken Dialogue Systems	—Unverified	0
OmniFlatten: An End-to-end GPT Model for Seamless Voice Conversation	Oct 23, 2024	Large Language ModelSpoken Dialogue Systems	—Unverified	0
Large Language Models Know What To Say But Not When To Speak	Oct 21, 2024	Spoken Dialogue Systems	—Unverified	0
Data Augmentation Integrating Dialogue Flow and Style to Adapt Spoken Dialogue Systems to Low-Resource User Groups	Aug 20, 2024	Data AugmentationLanguage Modeling	—Unverified	0
SaSLaW: Dialogue Speech Corpus with Audio-visual Egocentric Information Toward Environment-adaptive Dialogue Speech Synthesis	Aug 13, 2024	Speech SynthesisSpoken Dialogue Systems	CodeCode Available	0
PSLM: Parallel Generation of Text and Speech with LLMs for Low-Latency Spoken Dialogue Systems	Jun 18, 2024	Language ModelingLanguage Modelling	—Unverified	0
Evaluation of a semi-autonomous attentive listening system with takeover prompting	Feb 21, 2024	Spoken Dialogue Systems	—Unverified	0
An Analysis of User Behaviors for Objectively Evaluating Spoken Dialogue Systems	Jan 10, 2024	Spoken Dialogue Systems	—Unverified	0
An Analysis of Dialogue Repair in Voice Assistants	Nov 7, 2023	Spoken Dialogue Systems	—Unverified	0
Dialogue Systems Can Generate Appropriate Responses without the Use of Question Marks? -- Investigation of the Effects of Question Marks on Dialogue Systems	Aug 7, 2023	Sentencespeech-recognition	—Unverified	0
Unified Conversational Models with System-Initiated Transitions between Chit-Chat and Task-Oriented Dialogues	Jul 4, 2023	SentenceSpoken Dialogue Systems	—Unverified	0
OLISIA: a Cascade System for Spoken Dialogue State Tracking	Apr 20, 2023	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available	0
What Types of Questions Require Conversation to Answer? A Case Study of AskReddit Questions	Mar 30, 2023	Spoken Dialogue Systems	—Unverified	0
Transformers in Speech Processing: A Survey	Mar 21, 2023	Automatic Speech RecognitionSpeech Enhancement	—Unverified	0
Analysis and Utilization of Entrainment on Acoustic and Emotion Features in User-agent Dialogue	Dec 7, 2022	Spoken Dialogue Systemstext-to-speech	—Unverified	0
A Transformer-Based User Satisfaction Prediction for Proactive Interaction Mechanism in DuerOS	Dec 5, 2022	Spoken Dialogue Systems	—Unverified	0
Interactivism in Spoken Dialogue Systems	Sep 27, 2022	Spoken Dialogue Systems	—Unverified	0
Simultaneous Job Interview System Using Multiple Semi-autonomous Agents	Sep 1, 2022	Dialogue UnderstandingKeyword Extraction	—Unverified	0
Using Transition Duration to Improve Turn-taking in Conversational Agents	Sep 1, 2022	Spoken Dialogue Systems	—Unverified	0
Symbol and Communicative Grounding through Object Permanence with a Mobile Robot	Sep 1, 2022	ObjectSpoken Dialogue Systems	—Unverified	0
When can I Speak? Predicting initiation points for spoken dialogue agents	Aug 7, 2022	Language ModelingLanguage Modelling	CodeCode Available	0
Acoustic Modeling for End-to-End Empathetic Dialogue Speech Synthesis Using Linguistic and Prosodic Contexts of Dialogue History	Jun 16, 2022	Self-Supervised LearningSentence	—Unverified	0
Towards Speech-only Opinion-level Sentiment Analysis	Jun 1, 2022	Sentiment AnalysisSpeaker Verification	—Unverified	0
Understanding How People Rate Their Conversations	Jun 1, 2022	Spoken Dialogue Systems	—Unverified	0
Duplex Conversation: Towards Human-like Interaction in Spoken Dialogue Systems	May 30, 2022	Data AugmentationSpoken Dialogue Systems	—Unverified	0
NLU for Game-based Learning in Real: Initial Evaluations	May 27, 2022	Intent RecognitionMath	—Unverified	0
Data Augmentation with Paraphrase Generation and Entity Extraction for Multimodal Dialogue System	May 9, 2022	Data AugmentationIntent Recognition	—Unverified	0
EVI: Multilingual Spoken Dialogue Tasks and Dataset for Knowledge-Based Enrolment, Verification, and Identification	Apr 28, 2022	Speaker IdentificationSpeaker Verification	CodeCode Available	0
Gated Multimodal Fusion with Contrastive Learning for Turn-taking Prediction in Human-robot Dialogue	Apr 18, 2022	Contrastive LearningData Augmentation	—Unverified	0
Dialogue Strategy Adaptation to New Action Sets Using Multi-dimensional Modelling	Apr 14, 2022	Dialogue ManagementManagement	—Unverified	0
A Cross-Domain Approach for Continuous Impression Recognition from Dyadic Audio-Visual-Physio Signals	Mar 25, 2022	Knowledge DistillationSpoken Dialogue Systems	—Unverified	0
EVI: Multilingual Spoken Dialogue Tasks and Dataset for Knowledge-Based Enrolment, Verification, and Identification	Jan 16, 2022	Spoken Dialogue Systems	—Unverified	0

Show:10 25 50

← PrevPage 1 of 6Next →

No leaderboard results yet.