SOTAVerified
|
Agents
Browse
Leaderboard
About
Tasks
›
Task-Oriented Dialogue Systems
Task-Oriented Dialogue Systems
Achieving a pre-defined task through a dialog.
Papers
Recently Added
Most Hyped
Most Active
Needs Verification
Most Verified
Showing 1–10 of 308 papers
Title
Date
Tasks
Status
Hype
An Efficient Task-Oriented Dialogue Policy: Evolutionary Reinforcement Learning Injected by Elite Individuals
Jun 4, 2025
Deep Reinforcement Learning
Evolutionary Algorithms
—
Unverified
0
WHEN TO ACT, WHEN TO WAIT: Modeling Structural Trajectories for Intent Triggerability in Task-Oriented Dialogue
Jun 2, 2025
Task-Oriented Dialogue Systems
Code
Code Available
1
EnSToM: Enhancing Dialogue Systems with Entropy-Scaled Steering Vectors for Topic Maintenance
May 22, 2025
Task-Oriented Dialogue Systems
Code
Code Available
0
clem:todd: A Framework for the Systematic Benchmarking of LLM-Based Task-Oriented Dialogue System Realisations
May 8, 2025
Benchmarking
Task-Oriented Dialogue Systems
—
Unverified
0
LANID: LLM-assisted New Intent Discovery
Mar 31, 2025
Intent Discovery
Task-Oriented Dialogue Systems
Code
Code Available
0
Interpretable and Robust Dialogue State Tracking via Natural Language Summarization with LLMs
Mar 11, 2025
Dialogue State Tracking
slot-filling
—
Unverified
0
Leveraging Graph Structures and Large Language Models for End-to-End Synthetic Task-Oriented Dialogues
Jan 21, 2025
Task-Oriented Dialogue Systems
Code
Code Available
0
Intent-driven In-context Learning for Few-shot Dialogue State Tracking
Dec 4, 2024
Dialogue State Tracking
In-Context Learning
—
Unverified
0
Large Language Models as User-Agents for Evaluating Task-Oriented-Dialogue Systems
Nov 15, 2024
Diversity
Task-Oriented Dialogue Systems
—
Unverified
0
Towards Automatic Evaluation of Task-Oriented Dialogue Flows
Nov 15, 2024
Task-Oriented Dialogue Systems
—
Unverified
0
Show:
10
25
50
← Prev
Page 1 of 31
Next →
All datasets
Kvret
SGD
MULTIWOZ 2.0
Benchmark Results
▼
SGD
2 submissions
↑ higher is better
#
Model
Metric
Claimed
Verified
Status
1
T5
METEOR
0.33
—
Unverified
2
BART
METEOR
0.09
—
Unverified