SOTAVerified

Task-Oriented Dialogue Systems

Achieving a pre-defined task through a dialog.

Papers

Showing 76100 of 308 papers

TitleStatusHype
Unsupervised Mutual Learning of Dialogue Discourse Parsing and Topic SegmentationCode0
Enhancing Dialogue State Tracking Models through LLM-backed User-Agents Simulation0
Overcoming Catastrophic Forgetting by Exemplar Selection in Task-oriented Dialogue System0
DuetSim: Building User Simulator with Dual Large Language Models for Task-Oriented DialoguesCode0
Many Hands Make Light Work: Task-Oriented Dialogue System with Module-Based Mixture-of-Experts0
Unsupervised End-to-End Task-Oriented Dialogue with LLMs: The Power of the Noisy ChannelCode0
Rethinking the Evaluation of Dialogue Systems: Effects of User Feedback on Crowdworkers and LLMsCode0
Context Does Matter: Implications for Crowdsourced Evaluation Labels in Task-Oriented Dialogue SystemsCode0
Conformal Intent Classification and Clarification for Fast and Accurate Intent Recognition0
CAUSE: Counterfactual Assessment of User Satisfaction Estimation in Task-Oriented Dialogue Systems0
RECIPE4U: Student-ChatGPT Interaction Dataset in EFL Writing Education0
Can Similarity-Based Domain-Ordering Reduce Catastrophic Forgetting for Intent Recognition?0
Reliable LLM-based User Simulator for Task-Oriented Dialogue Systems0
Exploring the Robustness of Task-oriented Dialogue Systems for Colloquial German VarietiesCode0
Evaluating Task-oriented Dialogue Systems: A Systematic Review of Measures, Constructs and their Operationalisations0
Exploring the Viability of Synthetic Audio Data for Audio-Based Dialogue State TrackingCode0
ML-LMCL: Mutual Learning and Large-Margin Contrastive Learning for Improving ASR Robustness in Spoken Language Understanding0
LEEETs-Dial: Linguistic Entrainment in End-to-End Task-oriented Dialogue systemsCode0
Step by Step to Fairness: Attributing Societal Bias in Task-oriented Dialogue Systems0
Schema Graph-Guided Prompt for Multi-Domain Dialogue State Tracking0
IndoToD: A Multi-Domain Indonesian Benchmark For End-to-End Task-Oriented Dialogue SystemsCode0
IntenDD: A Unified Contrastive Learning Approach for Intent Detection and Discovery0
Turn-Level Active Learning for Dialogue State TrackingCode0
Dual-Feedback Knowledge Retrieval for Task-Oriented Dialogue Systems0
Retrieval-Augmented Neural Response Generation Using Logical Reasoning and Relevance Scoring0
Show:102550
← PrevPage 4 of 13Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1T5-3b(UnifiedSKG)Entity F170.07Unverified
2COMETEntity F163.6Unverified
3DF-NetEntity F162.7Unverified
4DF-NetEntity F162.5Unverified
5GLMPEntity F159.97Unverified
6TTOSEntity F155.38Unverified
7KB-retrieverEntity F153.7Unverified
8DSREntity F151.9Unverified
9KV Retrieval NetEntity F148Unverified
10THPNEntity F137.8Unverified
#ModelMetricClaimedVerifiedStatus
1T5METEOR0.33Unverified
2BARTMETEOR0.09Unverified
#ModelMetricClaimedVerifiedStatus
1BART (TextBox 2.0)BLEU-420.17Unverified