SOTAVerified

Dialogue Generation

Dialogue generation is the task of "understanding" natural language inputs - within natural language processing in order to produce output. The systems are usually intended for conversing with humans, for instance back and forth dialogue with a conversation agent like a chatbot. Some example benchmarks for this task (see others such as Natural Language Understanding) include FusedChat and Ubuntu DIalogue Corpus (UDC). Models can be evaluated via metrics such as BLEU, ROUGE, and METEOR albeit with challenges in terms of weak correlation with human judgement, that may be addressed by new ones like UnSupervised and Reference-free (USR) and Metric for automatic Unreferenced dialog evaluation (MaUde).

Papers

Showing 151200 of 606 papers

TitleStatusHype
ZRIGF: An Innovative Multimodal Framework for Zero-Resource Image-Grounded Dialogue GenerationCode0
Does Collaborative Human-LM Dialogue Generation Help Information Extraction from Human Dialogues?0
DecompEval: Evaluating Generated Texts as Unsupervised Decomposed Question AnsweringCode0
Attribute Controlled Dialogue Prompting0
DialoGPS: Dialogue Path Sampling in Continuous Semantic Space for Data Augmentation in Multi-Turn Conversations0
Enhancing Dialogue Generation via Dynamic Graph Knowledge AggregationCode1
KnowPrefix-Tuning: A Two-Stage Prefix-Tuning Framework for Knowledge-Grounded Dialogue GenerationCode0
MindDial: Belief Dynamics Tracking with Theory-of-Mind Modeling for Situated Neural Dialogue Generation0
Seen to Unseen: Exploring Compositional Generalization of Multi-Attribute Controllable Dialogue GenerationCode0
HAUSER: Towards Holistic and Automatic Evaluation of Simile GenerationCode0
MidMed: Towards Mixed-Type Dialogues for Medical ConsultationCode0
Diverse and Faithful Knowledge-Grounded Dialogue Generation via Sequential Posterior InferenceCode1
VSTAR: A Video-grounded Dialogue Dataset for Situated Semantic Understanding with Scene and Topic TransitionsCode1
Knowledge Graph-Augmented Language Models for Knowledge-Grounded Dialogue Generation0
Contextual Knowledge Learning For Dialogue Generation0
Medical Dialogue Generation via Dual Flow ModelingCode1
GripRank: Bridging the Gap between Retrieval and Generation via the Generative Knowledge Improved Passage Ranking0
Improving Empathetic Dialogue Generation by Dynamically Infusing Commonsense KnowledgeCode1
RefGPT: Dialogue Generation of GPT, by GPT, and for GPTCode1
Dolphin: A Challenging and Diverse Benchmark for Arabic NLG0
Evaluate What You Can't Evaluate: Unassessable Quality for Generated Response0
Cross-lingual Data Augmentation for Document-grounded Dialog Systems in Low Resource Languages0
Towards Robust Personalized Dialogue Generation via Order-Insensitive Representation RegularizationCode1
Enhancing Personalized Dialogue Generation with Contrastive Latent Variables: Combining Sparse and Dense PersonaCode1
PlugMed: Improving Specificity in Patient-Centered Medical Dialogue Generation using In-Context Learning0
DiffuSIA: A Spiral Interaction Architecture for Encoder-Decoder Text Diffusion0
SimOAP: Improve Coherence and Consistency in Persona-based Dialogue Generation via Over-sampling and Post-evaluationCode0
IMAD: IMage-Augmented multi-modal DialogueCode0
NLG Evaluation Metrics Beyond Correlation Analysis: An Empirical Metric Preference ChecklistCode3
Parameter-Efficient Fine-Tuning with Layer Pruning on Free-Text Sequence-to-Sequence ModelingCode1
Prompt Learning to Mitigate Catastrophic Forgetting in Cross-lingual Transfer for Open-domain Dialogue GenerationCode0
CADGE: Context-Aware Dialogue Generation Enhanced with Graph-Structured Knowledge AggregationCode0
Dialogue Planning via Brownian Bridge Stochastic Process for Goal-directed Proactive DialogueCode1
Controllable Mixed-Initiative Dialogue Generation through PromptingCode1
U-NEED: A Fine-grained Dataset for User Needs-Centric E-commerce Conversational Recommendation0
White-Box Multi-Objective Adversarial Attack on Dialogue GenerationCode1
Semantic Space Grounded Weighted Decoding for Multi-Attribute Controllable Dialogue GenerationCode0
Lift Yourself Up: Retrieval-augmented Text Generation with Self MemoryCode1
An Empirical Study of Multitask Learning to Improve Open Domain Dialogue SystemsCode0
The StatCan Dialogue Dataset: Retrieving Data Tables through Conversations with Genuine IntentsCode0
When Crowd Meets Persona: Creating a Large-Scale Open-Domain Persona Dialogue Corpus0
Elastic Weight Removal for Faithful and Abstractive Dialogue GenerationCode1
How do decoding algorithms distribute information in dialogue responses?0
G-Eval: NLG Evaluation using GPT-4 with Better Human AlignmentCode1
Deep RL with Hierarchical Action Exploration for Dialogue Generation0
Heterogeneous-Branch Collaborative Learning for Dialogue Generation0
Learning towards Selective Data Augmentation for Dialogue Generation0
X-ReCoSa: Multi-Scale Context Aggregation For Multi-Turn Dialogue Generation0
CTRLStruct: Dialogue Structure Learning for Open-Domain Response GenerationCode0
Almanac: Retrieval-Augmented Language Models for Clinical Medicine0
Show:102550
← PrevPage 4 of 13Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1LMEDRAvg F121.99Unverified
2P^2 BotAvg F119.77Unverified
3TransferTransfoAvg F119.09Unverified
4Seq2Seq + AttentionAvg F116.18Unverified
5Synthesizer (R+V)BLEU-114.7Unverified
6KV Profile MemoryAvg F111.9Unverified
#ModelMetricClaimedVerifiedStatus
1Classification-based modelSlot Accuracy0.97Unverified
2Two-in-one modelSlot Accuracy0.97Unverified
#ModelMetricClaimedVerifiedStatus
1EVAmauve0.97Unverified
2Per-BOBmauve0.95Unverified
#ModelMetricClaimedVerifiedStatus
1mm1 in 10 R@25Unverified
#ModelMetricClaimedVerifiedStatus
1∞-former (Sticky memories)F19.01Unverified
#ModelMetricClaimedVerifiedStatus
1∞-former (Sticky memories + initialized GPT-2 Small)Perplexity32.48Unverified
#ModelMetricClaimedVerifiedStatus
1SpaceFusioninterest (human)2.53Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.F14.63Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.Accuracy34.48Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.F111.43Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.Accuracy95.04Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.F13.72Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.Accuracy29.01Unverified