SOTAVerified

Dialogue Generation

Dialogue generation is the task of "understanding" natural language inputs - within natural language processing in order to produce output. The systems are usually intended for conversing with humans, for instance back and forth dialogue with a conversation agent like a chatbot. Some example benchmarks for this task (see others such as Natural Language Understanding) include FusedChat and Ubuntu DIalogue Corpus (UDC). Models can be evaluated via metrics such as BLEU, ROUGE, and METEOR albeit with challenges in terms of weak correlation with human judgement, that may be addressed by new ones like UnSupervised and Reference-free (USR) and Metric for automatic Unreferenced dialog evaluation (MaUde).

Papers

Showing 251275 of 606 papers

TitleStatusHype
CTRLStruct: Dialogue Structure Learning for Open-Domain Response GenerationCode0
Integrating Physician Diagnostic Logic into Large Language Models: Preference Learning from Process FeedbackCode0
Hierarchical Text Generation using an OutlineCode0
A Unified Data Augmentation Framework for Low-Resource Multi-Domain Dialogue GenerationCode0
Knowledge-Grounded Dialogue Generation with Term-level De-noisingCode0
Improving Knowledge-aware Dialogue Generation via Knowledge Base Question AnsweringCode0
Cross Copy Network for Dialogue GenerationCode0
Improving Medical Dialogue Generation with Abstract Meaning RepresentationsCode0
HAUSER: Towards Holistic and Automatic Evaluation of Simile GenerationCode0
Improving Context Modelling in Multimodal Dialogue GenerationCode0
An Auto-Encoder Matching Model for Learning Utterance-Level Semantic Dependency in Dialogue GenerationCode0
Improving Conditional Sequence Generative Adversarial Networks by Stepwise EvaluationCode0
KnowPrefix-Tuning: A Two-Stage Prefix-Tuning Framework for Knowledge-Grounded Dialogue GenerationCode0
Long Time No See! Open-Domain Conversation with Long-Term Persona MemoryCode0
PSYDIAL: Personality-based Synthetic Dialogue Generation using Large Language ModelsCode0
CoVoMix2: Advancing Zero-Shot Dialogue Generation with Fully Non-Autoregressive Flow Matching0
Goal-Embedded Dual Hierarchical Model for Task-Oriented Dialogue Generation0
Counterfactual Off-Policy Training for Neural Dialogue Generation0
Attribution and Alignment: Effects of Local Context Repetition on Utterance Production and Comprehension in Dialogue0
GFDC: Graph Function Dependence for Logically Consistent Dialogue Response Beyond Persona Data0
Generating Relevant and Coherent Dialogue Responses using Self-separated Conditional Variational AutoEncoders0
Generating Personalized Dialogue via Multi-Task Meta-Learning0
Counterfactual Off-Policy Training for Neural Response Generation0
Attribute Controlled Dialogue Prompting0
Generating Emotionally Aligned Responses in Dialogues using Affect Control Theory0
Show:102550
← PrevPage 11 of 25Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1LMEDRAvg F121.99Unverified
2P^2 BotAvg F119.77Unverified
3TransferTransfoAvg F119.09Unverified
4Seq2Seq + AttentionAvg F116.18Unverified
5Synthesizer (R+V)BLEU-114.7Unverified
6KV Profile MemoryAvg F111.9Unverified
#ModelMetricClaimedVerifiedStatus
1Classification-based modelSlot Accuracy0.97Unverified
2Two-in-one modelSlot Accuracy0.97Unverified
#ModelMetricClaimedVerifiedStatus
1EVAmauve0.97Unverified
2Per-BOBmauve0.95Unverified
#ModelMetricClaimedVerifiedStatus
1mm1 in 10 R@25Unverified
#ModelMetricClaimedVerifiedStatus
1∞-former (Sticky memories)F19.01Unverified
#ModelMetricClaimedVerifiedStatus
1∞-former (Sticky memories + initialized GPT-2 Small)Perplexity32.48Unverified
#ModelMetricClaimedVerifiedStatus
1SpaceFusioninterest (human)2.53Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.F14.63Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.Accuracy34.48Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.F111.43Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.Accuracy95.04Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.F13.72Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.Accuracy29.01Unverified