SOTAVerified|Agents Browse Leaderboard About Blog

Task-Oriented Dialogue Systems

Achieving a pre-defined task through a dialog.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1–10 of 308 papers

Title	Date	Tasks	Status	Hype
An Efficient Task-Oriented Dialogue Policy: Evolutionary Reinforcement Learning Injected by Elite Individuals	Jun 4, 2025	Deep Reinforcement LearningEvolutionary Algorithms	—Unverified	0
WHEN TO ACT, WHEN TO WAIT: Modeling Structural Trajectories for Intent Triggerability in Task-Oriented Dialogue	Jun 2, 2025	Task-Oriented Dialogue Systems	CodeCode Available	1
EnSToM: Enhancing Dialogue Systems with Entropy-Scaled Steering Vectors for Topic Maintenance	May 22, 2025	Task-Oriented Dialogue Systems	CodeCode Available	0
clem:todd: A Framework for the Systematic Benchmarking of LLM-Based Task-Oriented Dialogue System Realisations	May 8, 2025	BenchmarkingTask-Oriented Dialogue Systems	—Unverified	0
LANID: LLM-assisted New Intent Discovery	Mar 31, 2025	Intent DiscoveryTask-Oriented Dialogue Systems	CodeCode Available	0
Interpretable and Robust Dialogue State Tracking via Natural Language Summarization with LLMs	Mar 11, 2025	Dialogue State Trackingslot-filling	—Unverified	0
Leveraging Graph Structures and Large Language Models for End-to-End Synthetic Task-Oriented Dialogues	Jan 21, 2025	Task-Oriented Dialogue Systems	CodeCode Available	0
Intent-driven In-context Learning for Few-shot Dialogue State Tracking	Dec 4, 2024	Dialogue State TrackingIn-Context Learning	—Unverified	0
Large Language Models as User-Agents for Evaluating Task-Oriented-Dialogue Systems	Nov 15, 2024	DiversityTask-Oriented Dialogue Systems	—Unverified	0
Towards Automatic Evaluation of Task-Oriented Dialogue Flows	Nov 15, 2024	Task-Oriented Dialogue Systems	—Unverified	0

Show:10 25 50

← PrevPage 1 of 31Next →

All datasets Kvret SGD MULTIWOZ 2.0

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	T5	METEOR	0.33	—	Unverified
2	BART	METEOR	0.09	—	Unverified