Task-Oriented Dialogue Systems

Achieving a pre-defined task through a dialog.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1–50 of 308 papers

Title	Date	Tasks	Status	Hype
An Efficient Task-Oriented Dialogue Policy: Evolutionary Reinforcement Learning Injected by Elite Individuals	Jun 4, 2025	Deep Reinforcement LearningEvolutionary Algorithms	—Unverified	0
WHEN TO ACT, WHEN TO WAIT: Modeling Structural Trajectories for Intent Triggerability in Task-Oriented Dialogue	Jun 2, 2025	Task-Oriented Dialogue Systems	CodeCode Available	1
EnSToM: Enhancing Dialogue Systems with Entropy-Scaled Steering Vectors for Topic Maintenance	May 22, 2025	Task-Oriented Dialogue Systems	CodeCode Available	0
clem:todd: A Framework for the Systematic Benchmarking of LLM-Based Task-Oriented Dialogue System Realisations	May 8, 2025	BenchmarkingTask-Oriented Dialogue Systems	—Unverified	0
LANID: LLM-assisted New Intent Discovery	Mar 31, 2025	Intent DiscoveryTask-Oriented Dialogue Systems	CodeCode Available	0
Interpretable and Robust Dialogue State Tracking via Natural Language Summarization with LLMs	Mar 11, 2025	Dialogue State Trackingslot-filling	—Unverified	0
Leveraging Graph Structures and Large Language Models for End-to-End Synthetic Task-Oriented Dialogues	Jan 21, 2025	Task-Oriented Dialogue Systems	CodeCode Available	0
Intent-driven In-context Learning for Few-shot Dialogue State Tracking	Dec 4, 2024	Dialogue State TrackingIn-Context Learning	—Unverified	0
Towards Automatic Evaluation of Task-Oriented Dialogue Flows	Nov 15, 2024	Task-Oriented Dialogue Systems	—Unverified	0
Large Language Models as User-Agents for Evaluating Task-Oriented-Dialogue Systems	Nov 15, 2024	DiversityTask-Oriented Dialogue Systems	—Unverified	0
ReSpAct: Harmonizing Reasoning, Speaking, and Acting Towards Building Large Language Model-Based Conversational AI Agents	Nov 1, 2024	Decision MakingLanguage Modeling	—Unverified	0
DARD: A Multi-Agent Approach for Task-Oriented Dialog Systems	Nov 1, 2024	Task-Oriented Dialogue Systems	—Unverified	0
A Pointer Network-based Approach for Joint Extraction and Detection of Multi-Label Multi-Class Intents	Oct 29, 2024	Intent DetectionTask-Oriented Dialogue Systems	—Unverified	0
Pseudo-Label Enhanced Prototypical Contrastive Learning for Uniformed Intent Discovery	Oct 26, 2024	ClusteringContrastive Learning	CodeCode Available	0
MediTOD: An English Dialogue Dataset for Medical History Taking with Comprehensive Annotations	Oct 18, 2024	Natural Language UnderstandingTask-Oriented Dialogue Systems	—Unverified	0
Intent Detection in the Age of LLMs	Oct 2, 2024	Data AugmentationIn-Context Learning	—Unverified	0
Diversity-grounded Channel Prototypical Learning for Out-of-Distribution Intent Detection	Sep 17, 2024	Diversityintent-classification	—Unverified	0
Confidence Estimation for LLM-Based Dialogue State Tracking	Sep 15, 2024	Dialogue State TrackingHallucination	CodeCode Available	0
Keyword-Aware ASR Error Augmentation for Robust Dialogue State Tracking	Sep 10, 2024	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified	0
Infusing Emotions into Task-oriented Dialogue Systems: Understanding, Management, and Generation	Aug 5, 2024	ManagementTask-Oriented Dialogue Systems	—Unverified	0
Unsupervised Extraction of Dialogue Policies from Conversations	Jun 21, 2024	Task-Oriented Dialogue Systems	—Unverified	0
Making Task-Oriented Dialogue Datasets More Natural by Synthetically Generating Indirect User Requests	Jun 12, 2024	Dialogue State TrackingNatural Language Understanding	—Unverified	0
Towards Spoken Language Understanding via Multi-level Multi-grained Contrastive Learning	May 31, 2024	Contrastive LearningIntent Detection	—Unverified	0
Unsupervised Mutual Learning of Dialogue Discourse Parsing and Topic Segmentation	May 30, 2024	Discourse ParsingGraph Neural Network	CodeCode Available	0
Enhancing Dialogue State Tracking Models through LLM-backed User-Agents Simulation	May 17, 2024	Dialogue State TrackingTask-Oriented Dialogue Systems	—Unverified	0
DuetSim: Building User Simulator with Dual Large Language Models for Task-Oriented Dialogues	May 16, 2024	DiversityResponse Generation	CodeCode Available	0
Overcoming Catastrophic Forgetting by Exemplar Selection in Task-oriented Dialogue System	May 16, 2024	Continual LearningTask-Oriented Dialogue Systems	—Unverified	0
Many Hands Make Light Work: Task-Oriented Dialogue System with Module-Based Mixture-of-Experts	May 16, 2024	Dialogue State TrackingMixture-of-Experts	—Unverified	0
Unsupervised End-to-End Task-Oriented Dialogue with LLMs: The Power of the Noisy Channel	Apr 23, 2024	Task-Oriented Dialogue Systems	CodeCode Available	0
Simulating Task-Oriented Dialogues with State Transition Graphs and Large Language Models	Apr 23, 2024	Conversational Question AnsweringDialogue State Tracking	CodeCode Available	1
Rethinking the Evaluation of Dialogue Systems: Effects of User Feedback on Crowdworkers and LLMs	Apr 19, 2024	Task-Oriented Dialogue Systems	CodeCode Available	0
Context Does Matter: Implications for Crowdsourced Evaluation Labels in Task-Oriented Dialogue Systems	Apr 15, 2024	Task-Oriented Dialogue Systems	CodeCode Available	0
Conformal Intent Classification and Clarification for Fast and Accurate Intent Recognition	Mar 27, 2024	intent-classificationIntent Classification	—Unverified	0
CAUSE: Counterfactual Assessment of User Satisfaction Estimation in Task-Oriented Dialogue Systems	Mar 27, 2024	counterfactualData Augmentation	—Unverified	0
JMultiWOZ: A Large-Scale Japanese Multi-Domain Task-Oriented Dialogue Dataset	Mar 26, 2024	Dialogue State TrackingLanguage Modeling	CodeCode Available	1
RECIPE4U: Student-ChatGPT Interaction Dataset in EFL Writing Education	Mar 13, 2024	Intent DetectionTask-Oriented Dialogue Systems	—Unverified	0
Can Similarity-Based Domain-Ordering Reduce Catastrophic Forgetting for Intent Recognition?	Feb 21, 2024	Continual LearningIntent Recognition	—Unverified	0
Reliable LLM-based User Simulator for Task-Oriented Dialogue Systems	Feb 20, 2024	Data AugmentationTask-Oriented Dialogue Systems	—Unverified	0
Task-Oriented Dialogue with In-Context Learning	Feb 19, 2024	In-Context LearningNavigate	CodeCode Available	1
Exploring the Robustness of Task-oriented Dialogue Systems for Colloquial German Varieties	Feb 3, 2024	Intent Recognitionslot-filling	CodeCode Available	0
DIALIGHT: Lightweight Multilingual Development and Evaluation of Task-Oriented Dialogue Systems with Large Language Models	Jan 4, 2024	In-Context LearningTask-Oriented Dialogue Systems	CodeCode Available	1
Evaluating Task-oriented Dialogue Systems: A Systematic Review of Measures, Constructs and their Operationalisations	Dec 21, 2023	Task-Oriented Dialogue Systems	—Unverified	0
Exploring the Viability of Synthetic Audio Data for Audio-Based Dialogue State Tracking	Dec 4, 2023	Dialogue State TrackingTask-Oriented Dialogue Systems	CodeCode Available	0
ML-LMCL: Mutual Learning and Large-Margin Contrastive Learning for Improving ASR Robustness in Spoken Language Understanding	Nov 19, 2023	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified	0
LEEETs-Dial: Linguistic Entrainment in End-to-End Task-oriented Dialogue systems	Nov 15, 2023	Task-Oriented Dialogue Systems	CodeCode Available	0
Step by Step to Fairness: Attributing Societal Bias in Task-oriented Dialogue Systems	Nov 11, 2023	AttributeFairness	—Unverified	0
Schema Graph-Guided Prompt for Multi-Domain Dialogue State Tracking	Nov 10, 2023	Dialogue State TrackingGraph Neural Network	—Unverified	0
IndoToD: A Multi-Domain Indonesian Benchmark For End-to-End Task-Oriented Dialogue Systems	Nov 2, 2023	Task-Oriented Dialogue SystemsTransfer Learning	CodeCode Available	0
IntenDD: A Unified Contrastive Learning Approach for Intent Detection and Discovery	Oct 25, 2023	Contrastive Learningintent-classification	—Unverified	0
Towards LLM-driven Dialogue State Tracking	Oct 23, 2023	Dialogue State TrackingTask-Oriented Dialogue Systems	CodeCode Available	1

Show:10 25 50

← PrevPage 1 of 7Next →

All datasets Kvret SGD MULTIWOZ 2.0

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	T5-3b(UnifiedSKG)	Entity F1	70.07	—	Unverified
2	COMET	Entity F1	63.6	—	Unverified
3	DF-Net	Entity F1	62.7	—	Unverified
4	DF-Net	Entity F1	62.5	—	Unverified
5	GLMP	Entity F1	59.97	—	Unverified
6	TTOS	Entity F1	55.38	—	Unverified
7	KB-retriever	Entity F1	53.7	—	Unverified
8	DSR	Entity F1	51.9	—	Unverified
9	KV Retrieval Net	Entity F1	48	—	Unverified
10	THPN	Entity F1	37.8	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	T5	METEOR	0.33	—	Unverified
2	BART	METEOR	0.09	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	BART (TextBox 2.0)	BLEU-4	20.17	—	Unverified