SOTAVerified

Task-Oriented Dialogue Systems

Achieving a pre-defined task through a dialog.

Papers

Showing 125 of 308 papers

TitleStatusHype
CrossWOZ: A Large-Scale Chinese Cross-Domain Task-Oriented Dialogue DatasetCode4
TextBox 2.0: A Text Generation Library with Pre-trained Language ModelsCode3
UnifiedSKG: Unifying and Multi-Tasking Structured Knowledge Grounding with Text-to-Text Language ModelsCode2
WHEN TO ACT, WHEN TO WAIT: Modeling Structural Trajectories for Intent Triggerability in Task-Oriented DialogueCode1
Simulating Task-Oriented Dialogues with State Transition Graphs and Large Language ModelsCode1
JMultiWOZ: A Large-Scale Japanese Multi-Domain Task-Oriented Dialogue DatasetCode1
Task-Oriented Dialogue with In-Context LearningCode1
DIALIGHT: Lightweight Multilingual Development and Evaluation of Task-Oriented Dialogue Systems with Large Language ModelsCode1
Towards LLM-driven Dialogue State TrackingCode1
InstructTODS: Large Language Models for End-to-End Task-Oriented Dialogue SystemsCode1
In-Context Learning User Simulators for Task-Oriented Dialog SystemsCode1
Schema-Guided User Satisfaction Modeling for Task-Oriented DialoguesCode1
Pre-training Intent-Aware Encoders for Zero- and Few-Shot Intent ClassificationCode1
A Hybrid Architecture for Out of Domain Intent Detection and Intent DiscoveryCode1
Injecting Domain Knowledge in Language Models for Task-Oriented Dialogue SystemsCode1
Efficient Task-Oriented Dialogue Systems with Response Selection as an Auxiliary TaskCode1
A Multi-Task BERT Model for Schema-Guided Dialogue State TrackingCode1
Learning Dialogue Representations from Consecutive UtterancesCode1
InstructDial: Improving Zero and Few-shot Generalization in Dialogue through Instruction TuningCode1
Converse: A Tree-Based Modular Task-Oriented Dialogue SystemCode1
An Interpretable Neuro-Symbolic Reasoning Framework for Task-Oriented Dialogue GenerationCode1
Contextual Semantic Parsing for Multilingual Task-Oriented DialoguesCode1
GlobalWoZ: Globalizing MultiWoZ to Develop Multilingual Task-Oriented Dialogue SystemsCode1
"How Robust r u?": Evaluating Task-Oriented Dialogue Systems on Spoken ConversationsCode1
Shades of BLEU, Flavours of Success: The Case of MultiWOZCode1
Show:102550
← PrevPage 1 of 13Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1T5-3b(UnifiedSKG)Entity F170.07Unverified
2COMETEntity F163.6Unverified
3DF-NetEntity F162.7Unverified
4DF-NetEntity F162.5Unverified
5GLMPEntity F159.97Unverified
6TTOSEntity F155.38Unverified
7KB-retrieverEntity F153.7Unverified
8DSREntity F151.9Unverified
9KV Retrieval NetEntity F148Unverified
10THPNEntity F137.8Unverified
#ModelMetricClaimedVerifiedStatus
1T5METEOR0.33Unverified
2BARTMETEOR0.09Unverified
#ModelMetricClaimedVerifiedStatus
1BART (TextBox 2.0)BLEU-420.17Unverified