SOTAVerified

Task-Oriented Dialogue Systems

Achieving a pre-defined task through a dialog.

Papers

Showing 3140 of 308 papers

TitleStatusHype
Rethinking the Evaluation of Dialogue Systems: Effects of User Feedback on Crowdworkers and LLMsCode0
Context Does Matter: Implications for Crowdsourced Evaluation Labels in Task-Oriented Dialogue SystemsCode0
CAUSE: Counterfactual Assessment of User Satisfaction Estimation in Task-Oriented Dialogue Systems0
Conformal Intent Classification and Clarification for Fast and Accurate Intent Recognition0
JMultiWOZ: A Large-Scale Japanese Multi-Domain Task-Oriented Dialogue DatasetCode1
RECIPE4U: Student-ChatGPT Interaction Dataset in EFL Writing Education0
Can Similarity-Based Domain-Ordering Reduce Catastrophic Forgetting for Intent Recognition?0
Reliable LLM-based User Simulator for Task-Oriented Dialogue Systems0
Task-Oriented Dialogue with In-Context LearningCode1
Exploring the Robustness of Task-oriented Dialogue Systems for Colloquial German VarietiesCode0
Show:102550
← PrevPage 4 of 31Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1T5-3b(UnifiedSKG)Entity F170.07Unverified
2COMETEntity F163.6Unverified
3DF-NetEntity F162.7Unverified
4DF-NetEntity F162.5Unverified
5GLMPEntity F159.97Unverified
6TTOSEntity F155.38Unverified
7KB-retrieverEntity F153.7Unverified
8DSREntity F151.9Unverified
9KV Retrieval NetEntity F148Unverified
10THPNEntity F137.8Unverified
#ModelMetricClaimedVerifiedStatus
1T5METEOR0.33Unverified
2BARTMETEOR0.09Unverified
#ModelMetricClaimedVerifiedStatus
1BART (TextBox 2.0)BLEU-420.17Unverified