SOTAVerified

Dialogue Generation

Dialogue generation is the task of "understanding" natural language inputs - within natural language processing in order to produce output. The systems are usually intended for conversing with humans, for instance back and forth dialogue with a conversation agent like a chatbot. Some example benchmarks for this task (see others such as Natural Language Understanding) include FusedChat and Ubuntu DIalogue Corpus (UDC). Models can be evaluated via metrics such as BLEU, ROUGE, and METEOR albeit with challenges in terms of weak correlation with human judgement, that may be addressed by new ones like UnSupervised and Reference-free (USR) and Metric for automatic Unreferenced dialog evaluation (MaUde).

Papers

Showing 126150 of 606 papers

TitleStatusHype
Lift Yourself Up: Retrieval-augmented Text Generation with Self MemoryCode1
RefGPT: Dialogue Generation of GPT, by GPT, and for GPTCode1
Towards Controllable Biases in Language GenerationCode1
AugESC: Dialogue Augmentation with Large Language Models for Emotional Support ConversationCode1
DEMO: Reframing Dialogue Interaction with Fine-grained Element ModelingCode1
MA-RLHF: Reinforcement Learning from Human Feedback with Macro ActionsCode1
MedDG: An Entity-Centric Medical Consultation Dataset for Entity-Aware Medical Dialogue GenerationCode1
DialoKG: Knowledge-Structure Aware Task-Oriented Dialogue GenerationCode1
Dialogue Planning via Brownian Bridge Stochastic Process for Goal-directed Proactive DialogueCode1
Diverse and Informative Dialogue Generation with Context-Specific Commonsense Knowledge AwarenessCode1
Linguistically-Informed Specificity and Semantic Plausibility for Dialogue GenerationCode0
Learning to Customize Model Structures for Few-shot Dialogue Generation TasksCode0
Learning Discourse-level Diversity for Neural Dialog Models using Conditional Variational AutoencodersCode0
Learning Retrieval Augmentation for Personalized Dialogue GenerationCode0
Long-term Control for Dialogue Generation: Methods and EvaluationCode0
Knowledge-Grounded Dialogue Generation with Term-level De-noisingCode0
KnowPrefix-Tuning: A Two-Stage Prefix-Tuning Framework for Knowledge-Grounded Dialogue GenerationCode0
Language Detoxification with Attribute-Discriminative Latent SpaceCode0
Knowledge Diffusion for Neural Dialogue GenerationCode0
Adaptive-VP: A Framework for LLM-Based Virtual Patients that Adapts to Trainees' Dialogue to Facilitate Nurse Communication TrainingCode0
Integrating Physician Diagnostic Logic into Large Language Models: Preference Learning from Process FeedbackCode0
Latent Variable Dialogue Models and their DiversityCode0
Long Time No See! Open-Domain Conversation with Long-Term Persona MemoryCode0
Improving Medical Dialogue Generation with Abstract Meaning RepresentationsCode0
Adaptive Parameterization for Neural Dialogue GenerationCode0
Show:102550
← PrevPage 6 of 25Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1LMEDRAvg F121.99Unverified
2P^2 BotAvg F119.77Unverified
3TransferTransfoAvg F119.09Unverified
4Seq2Seq + AttentionAvg F116.18Unverified
5Synthesizer (R+V)BLEU-114.7Unverified
6KV Profile MemoryAvg F111.9Unverified
#ModelMetricClaimedVerifiedStatus
1Classification-based modelSlot Accuracy0.97Unverified
2Two-in-one modelSlot Accuracy0.97Unverified
#ModelMetricClaimedVerifiedStatus
1EVAmauve0.97Unverified
2Per-BOBmauve0.95Unverified
#ModelMetricClaimedVerifiedStatus
1mm1 in 10 R@25Unverified
#ModelMetricClaimedVerifiedStatus
1∞-former (Sticky memories)F19.01Unverified
#ModelMetricClaimedVerifiedStatus
1∞-former (Sticky memories + initialized GPT-2 Small)Perplexity32.48Unverified
#ModelMetricClaimedVerifiedStatus
1SpaceFusioninterest (human)2.53Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.F14.63Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.Accuracy34.48Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.F111.43Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.Accuracy95.04Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.F13.72Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.Accuracy29.01Unverified