SOTAVerified

Dialogue Generation

Dialogue generation is the task of "understanding" natural language inputs - within natural language processing in order to produce output. The systems are usually intended for conversing with humans, for instance back and forth dialogue with a conversation agent like a chatbot. Some example benchmarks for this task (see others such as Natural Language Understanding) include FusedChat and Ubuntu DIalogue Corpus (UDC). Models can be evaluated via metrics such as BLEU, ROUGE, and METEOR albeit with challenges in terms of weak correlation with human judgement, that may be addressed by new ones like UnSupervised and Reference-free (USR) and Metric for automatic Unreferenced dialog evaluation (MaUde).

Papers

Showing 201225 of 606 papers

TitleStatusHype
Learning to Customize Model Structures for Few-shot Dialogue Generation TasksCode0
A Neural Topical Expansion Framework for Unstructured Persona-oriented Dialogue GenerationCode0
End-to-end Adversarial Learning for Generative Conversational AgentsCode0
BoK: Introducing Bag-of-Keywords Loss for Interpretable Dialogue Response GenerationCode0
Learning Discourse-level Diversity for Neural Dialog Models using Conditional Variational AutoencodersCode0
Dialogue Generation: From Imitation Learning to Inverse Reinforcement LearningCode0
Dialogue Benchmark Generation from Knowledge Graphs with Cost-Effective Retrieval-Augmented LLMsCode0
Measuring and Improving Semantic Diversity of Dialogue GenerationCode0
Multiple Generative Models Ensemble for Knowledge-Driven Proactive Human-Computer Dialogue AgentCode0
Adversarial Learning for Neural Dialogue GenerationCode0
Knowledge Diffusion for Neural Dialogue GenerationCode0
Knowledge-Grounded Dialogue Generation with Term-level De-noisingCode0
Bilateral Personalized Dialogue Generation with Contrastive LearningCode0
PLATO: Pre-trained Dialogue Generation Model with Discrete Latent VariableCode0
Integrating Physician Diagnostic Logic into Large Language Models: Preference Learning from Process FeedbackCode0
Better Conversations by Modeling, Filtering, and Optimizing for Coherence and DiversityCode0
DESED: Dialogue-based Explanation for Sentence-level Event DetectionCode0
Evaluating Dialogue Generation Systems via Response SelectionCode0
An End-to-End Model for Photo-Sharing Multi-modal Dialogue GenerationCode0
KnowPrefix-Tuning: A Two-Stage Prefix-Tuning Framework for Knowledge-Grounded Dialogue GenerationCode0
Event Transition Planning for Open-ended Text GenerationCode0
Explicit State Tracking with Semi-Supervision for Neural Dialogue GenerationCode0
An Empirical Study of Multitask Learning to Improve Open Domain Dialogue SystemsCode0
Exploiting Pairwise Mutual Information for Knowledge-Grounded DialogueCode0
Deep Reinforcement Learning for Dialogue GenerationCode0
Show:102550
← PrevPage 9 of 25Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1LMEDRAvg F121.99Unverified
2P^2 BotAvg F119.77Unverified
3TransferTransfoAvg F119.09Unverified
4Seq2Seq + AttentionAvg F116.18Unverified
5Synthesizer (R+V)BLEU-114.7Unverified
6KV Profile MemoryAvg F111.9Unverified
#ModelMetricClaimedVerifiedStatus
1Classification-based modelSlot Accuracy0.97Unverified
2Two-in-one modelSlot Accuracy0.97Unverified
#ModelMetricClaimedVerifiedStatus
1EVAmauve0.97Unverified
2Per-BOBmauve0.95Unverified
#ModelMetricClaimedVerifiedStatus
1mm1 in 10 R@25Unverified
#ModelMetricClaimedVerifiedStatus
1∞-former (Sticky memories)F19.01Unverified
#ModelMetricClaimedVerifiedStatus
1∞-former (Sticky memories + initialized GPT-2 Small)Perplexity32.48Unverified
#ModelMetricClaimedVerifiedStatus
1SpaceFusioninterest (human)2.53Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.F14.63Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.Accuracy34.48Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.F111.43Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.Accuracy95.04Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.F13.72Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.Accuracy29.01Unverified