SOTAVerified

Task-Oriented Dialogue Systems

Achieving a pre-defined task through a dialog.

Papers

Showing 150 of 308 papers

TitleStatusHype
An Efficient Task-Oriented Dialogue Policy: Evolutionary Reinforcement Learning Injected by Elite Individuals0
WHEN TO ACT, WHEN TO WAIT: Modeling Structural Trajectories for Intent Triggerability in Task-Oriented DialogueCode1
EnSToM: Enhancing Dialogue Systems with Entropy-Scaled Steering Vectors for Topic MaintenanceCode0
clem:todd: A Framework for the Systematic Benchmarking of LLM-Based Task-Oriented Dialogue System Realisations0
LANID: LLM-assisted New Intent DiscoveryCode0
Interpretable and Robust Dialogue State Tracking via Natural Language Summarization with LLMs0
Leveraging Graph Structures and Large Language Models for End-to-End Synthetic Task-Oriented DialoguesCode0
Intent-driven In-context Learning for Few-shot Dialogue State Tracking0
Towards Automatic Evaluation of Task-Oriented Dialogue Flows0
Large Language Models as User-Agents for Evaluating Task-Oriented-Dialogue Systems0
ReSpAct: Harmonizing Reasoning, Speaking, and Acting Towards Building Large Language Model-Based Conversational AI Agents0
DARD: A Multi-Agent Approach for Task-Oriented Dialog Systems0
A Pointer Network-based Approach for Joint Extraction and Detection of Multi-Label Multi-Class Intents0
Pseudo-Label Enhanced Prototypical Contrastive Learning for Uniformed Intent DiscoveryCode0
MediTOD: An English Dialogue Dataset for Medical History Taking with Comprehensive Annotations0
Intent Detection in the Age of LLMs0
Diversity-grounded Channel Prototypical Learning for Out-of-Distribution Intent Detection0
Confidence Estimation for LLM-Based Dialogue State TrackingCode0
Keyword-Aware ASR Error Augmentation for Robust Dialogue State Tracking0
Infusing Emotions into Task-oriented Dialogue Systems: Understanding, Management, and Generation0
Unsupervised Extraction of Dialogue Policies from Conversations0
Making Task-Oriented Dialogue Datasets More Natural by Synthetically Generating Indirect User Requests0
Towards Spoken Language Understanding via Multi-level Multi-grained Contrastive Learning0
Unsupervised Mutual Learning of Dialogue Discourse Parsing and Topic SegmentationCode0
Enhancing Dialogue State Tracking Models through LLM-backed User-Agents Simulation0
DuetSim: Building User Simulator with Dual Large Language Models for Task-Oriented DialoguesCode0
Overcoming Catastrophic Forgetting by Exemplar Selection in Task-oriented Dialogue System0
Many Hands Make Light Work: Task-Oriented Dialogue System with Module-Based Mixture-of-Experts0
Unsupervised End-to-End Task-Oriented Dialogue with LLMs: The Power of the Noisy ChannelCode0
Simulating Task-Oriented Dialogues with State Transition Graphs and Large Language ModelsCode1
Rethinking the Evaluation of Dialogue Systems: Effects of User Feedback on Crowdworkers and LLMsCode0
Context Does Matter: Implications for Crowdsourced Evaluation Labels in Task-Oriented Dialogue SystemsCode0
Conformal Intent Classification and Clarification for Fast and Accurate Intent Recognition0
CAUSE: Counterfactual Assessment of User Satisfaction Estimation in Task-Oriented Dialogue Systems0
JMultiWOZ: A Large-Scale Japanese Multi-Domain Task-Oriented Dialogue DatasetCode1
RECIPE4U: Student-ChatGPT Interaction Dataset in EFL Writing Education0
Can Similarity-Based Domain-Ordering Reduce Catastrophic Forgetting for Intent Recognition?0
Reliable LLM-based User Simulator for Task-Oriented Dialogue Systems0
Task-Oriented Dialogue with In-Context LearningCode1
Exploring the Robustness of Task-oriented Dialogue Systems for Colloquial German VarietiesCode0
DIALIGHT: Lightweight Multilingual Development and Evaluation of Task-Oriented Dialogue Systems with Large Language ModelsCode1
Evaluating Task-oriented Dialogue Systems: A Systematic Review of Measures, Constructs and their Operationalisations0
Exploring the Viability of Synthetic Audio Data for Audio-Based Dialogue State TrackingCode0
ML-LMCL: Mutual Learning and Large-Margin Contrastive Learning for Improving ASR Robustness in Spoken Language Understanding0
LEEETs-Dial: Linguistic Entrainment in End-to-End Task-oriented Dialogue systemsCode0
Step by Step to Fairness: Attributing Societal Bias in Task-oriented Dialogue Systems0
Schema Graph-Guided Prompt for Multi-Domain Dialogue State Tracking0
IndoToD: A Multi-Domain Indonesian Benchmark For End-to-End Task-Oriented Dialogue SystemsCode0
IntenDD: A Unified Contrastive Learning Approach for Intent Detection and Discovery0
Towards LLM-driven Dialogue State TrackingCode1
Show:102550
← PrevPage 1 of 7Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1T5-3b(UnifiedSKG)Entity F170.07Unverified
2COMETEntity F163.6Unverified
3DF-NetEntity F162.7Unverified
4DF-NetEntity F162.5Unverified
5GLMPEntity F159.97Unverified
6TTOSEntity F155.38Unverified
7KB-retrieverEntity F153.7Unverified
8DSREntity F151.9Unverified
9KV Retrieval NetEntity F148Unverified
10THPNEntity F137.8Unverified
#ModelMetricClaimedVerifiedStatus
1T5METEOR0.33Unverified
2BARTMETEOR0.09Unverified
#ModelMetricClaimedVerifiedStatus
1BART (TextBox 2.0)BLEU-420.17Unverified