SOTAVerified

Spoken Dialogue Systems

Papers

Showing 125 of 254 papers

TitleStatusHype
WavChat: A Survey of Spoken Dialogue ModelsCode3
WavReward: Spoken Dialogue Models With Generalist Reward EvaluatorsCode2
Towards Joint Modeling of Dialogue Response and Speech Synthesis based on Large Language ModelCode1
"How Robust r u?": Evaluating Task-Oriented Dialogue Systems on Spoken ConversationsCode1
Plato Dialogue System: A Flexible Conversational AI Research PlatformCode1
Prompt-Guided Turn-Taking Prediction0
Towards Efficient Speech-Text Jointly Decoding within One Speech Language Model0
Towards a Japanese Full-duplex Spoken Dialogue System0
Chain-of-Thought Training for Open E2E Spoken Dialogue Systems0
Speculative End-Turn Detector for Efficient Speech Chatbot Assistant0
ESPnet-SDS: Unified Toolkit and Demo for Spoken Dialogue Systems0
Talking Turns: Benchmarking Audio Foundation Models on Turn-Taking Dynamics0
FlexDuo: A Pluggable System for Enabling Full-Duplex Capabilities in Speech Dialogue Systems0
LLM-Enhanced Dialogue Management for Full-Duplex Spoken Dialogue Systems0
Multimodal Transformer Models for Turn-taking Prediction: Effects on Conversational Dynamics of Human-Agent Interaction during Cooperative Gameplay0
An LLM Benchmark for Addressee Recognition in Multi-modal Multi-party Dialogue0
Real-Time Textless Dialogue GenerationCode0
OmniChat: Enhancing Spoken Dialogue Systems with Scalable Synthetic Data for Diverse Scenarios0
SLAM-Omni: Timbre-Controllable Voice Interaction System with Single-Stage TrainingCode0
OmniFlatten: An End-to-end GPT Model for Seamless Voice ConversationCode0
Large Language Models Know What To Say But Not When To Speak0
Data Augmentation Integrating Dialogue Flow and Style to Adapt Spoken Dialogue Systems to Low-Resource User Groups0
SaSLaW: Dialogue Speech Corpus with Audio-visual Egocentric Information Toward Environment-adaptive Dialogue Speech SynthesisCode0
PSLM: Parallel Generation of Text and Speech with LLMs for Low-Latency Spoken Dialogue SystemsCode0
Evaluation of a semi-autonomous attentive listening system with takeover prompting0
Show:102550
← PrevPage 1 of 11Next →

No leaderboard results yet.