SOTAVerified

Task-Oriented Dialogue Systems

Achieving a pre-defined task through a dialog.

Papers

Showing 5160 of 308 papers

TitleStatusHype
MultiWOZ 2.4: A Multi-Domain Task-Oriented Dialogue Dataset with Essential Annotation Corrections to Improve State Tracking EvaluationCode1
ConvLab-2: An Open-Source Toolkit for Building, Evaluating, and Diagnosing Dialogue SystemsCode1
Simulating User Satisfaction for the Evaluation of Task-oriented Dialogue SystemsCode1
Crossing the Conversational Chasm: A Primer on Natural Language Processing for Multilingual Task-Oriented Dialogue Systems0
A Graph-to-Sequence Model for Joint Intent Detection and Slot Filling in Task-Oriented Dialogue Systems0
An Efficient Task-Oriented Dialogue Policy: Evolutionary Reinforcement Learning Injected by Elite Individuals0
An Efficient Approach to Encoding Context for Spoken Language Understanding0
Combining Task and Dialogue Streams in Unsupervised Dialogue Act Models0
clem:todd: A Framework for the Systematic Benchmarking of LLM-Based Task-Oriented Dialogue System Realisations0
Counterfactual Matters: Intrinsic Probing For Dialogue State Tracking0
Show:102550
← PrevPage 6 of 31Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1T5-3b(UnifiedSKG)Entity F170.07Unverified
2COMETEntity F163.6Unverified
3DF-NetEntity F162.7Unverified
4DF-NetEntity F162.5Unverified
5GLMPEntity F159.97Unverified
6TTOSEntity F155.38Unverified
7KB-retrieverEntity F153.7Unverified
8DSREntity F151.9Unverified
9KV Retrieval NetEntity F148Unverified
10THPNEntity F137.8Unverified
#ModelMetricClaimedVerifiedStatus
1T5METEOR0.33Unverified
2BARTMETEOR0.09Unverified
#ModelMetricClaimedVerifiedStatus
1BART (TextBox 2.0)BLEU-420.17Unverified