SOTAVerified

Dialogue Generation

Dialogue generation is the task of "understanding" natural language inputs - within natural language processing in order to produce output. The systems are usually intended for conversing with humans, for instance back and forth dialogue with a conversation agent like a chatbot. Some example benchmarks for this task (see others such as Natural Language Understanding) include FusedChat and Ubuntu DIalogue Corpus (UDC). Models can be evaluated via metrics such as BLEU, ROUGE, and METEOR albeit with challenges in terms of weak correlation with human judgement, that may be addressed by new ones like UnSupervised and Reference-free (USR) and Metric for automatic Unreferenced dialog evaluation (MaUde).

Papers

Showing 201225 of 606 papers

TitleStatusHype
GLM-Dialog: Noise-tolerant Pre-training for Knowledge-grounded Dialogue GenerationCode1
Topic-switch adapted Japanese Dialogue System based on PLATO-20
CAB: Empathetic Dialogue Generation with Cognition, Affection and BehaviorCode0
Commonsense-Aware Prompting for Controllable Empathetic Dialogue Generation0
Response-act Guided Reinforced Dialogue Generation for Mental Health Counseling0
GE-Blender: Graph-Based Knowledge Enhancement for Blender0
Learning to Memorize Entailment and Discourse Relations for Persona-Consistent DialoguesCode1
Improving a sequence-to-sequence nlp model using a reinforcement learning policy algorithm0
Ontologically Faithful Generation of Non-Player Character Dialogues0
CausalDialogue: Modeling Utterance-level Causality in ConversationsCode0
SODA: Million-scale Dialogue Distillation with Social Commonsense ContextualizationCode2
SESCORE2: Learning Text Generation Evaluation via Synthesizing Realistic MistakesCode1
InferEM: Inferring the Speaker's Intention for Empathetic Dialogue Generation0
Towards Diverse, Relevant and Coherent Open-Domain Dialogue Generation via Hybrid Latent Variables0
Modeling Complex Dialogue Mappings via Sentence Semantic Segmentation Guided Conditional Variational Auto-Encoder0
CDialog: A Multi-turn Covid-19 Conversation Dataset for Entity-Aware Dialog GenerationCode0
Large Language Models Meet Harry Potter: A Bilingual Dataset for Aligning Dialogue Agents with CharactersCode1
Eliciting Knowledge from Large Pre-Trained Models for Unsupervised Knowledge-Grounded ConversationCode0
PLATO-K: Internal and External Knowledge Enhanced Dialogue Generation0
Terminology-aware Medical Dialogue GenerationCode1
Personalized Dialogue Generation with Persona-Adaptive AttentionCode1
Collaborative Reasoning on Multi-Modal Semantic Graphs for Video-Grounded Dialogue Generation0
There Is No Standard Answer: Knowledge-Grounded Dialogue Generation with Adversarial Activated Multi-Reference Learning0
Transformer-Based Conditioned Variational Autoencoder for Dialogue Generation0
Towards Efficient Dialogue Pre-training with Transferable and Interpretable Latent Structure0
Show:102550
← PrevPage 9 of 25Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1LMEDRAvg F121.99Unverified
2P^2 BotAvg F119.77Unverified
3TransferTransfoAvg F119.09Unverified
4Seq2Seq + AttentionAvg F116.18Unverified
5Synthesizer (R+V)BLEU-114.7Unverified
6KV Profile MemoryAvg F111.9Unverified
#ModelMetricClaimedVerifiedStatus
1Classification-based modelSlot Accuracy0.97Unverified
2Two-in-one modelSlot Accuracy0.97Unverified
#ModelMetricClaimedVerifiedStatus
1EVAmauve0.97Unverified
2Per-BOBmauve0.95Unverified
#ModelMetricClaimedVerifiedStatus
1mm1 in 10 R@25Unverified
#ModelMetricClaimedVerifiedStatus
1∞-former (Sticky memories)F19.01Unverified
#ModelMetricClaimedVerifiedStatus
1∞-former (Sticky memories + initialized GPT-2 Small)Perplexity32.48Unverified
#ModelMetricClaimedVerifiedStatus
1SpaceFusioninterest (human)2.53Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.F14.63Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.Accuracy34.48Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.F111.43Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.Accuracy95.04Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.F13.72Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.Accuracy29.01Unverified