SOTAVerified

Dialogue Generation

Dialogue generation is the task of "understanding" natural language inputs - within natural language processing in order to produce output. The systems are usually intended for conversing with humans, for instance back and forth dialogue with a conversation agent like a chatbot. Some example benchmarks for this task (see others such as Natural Language Understanding) include FusedChat and Ubuntu DIalogue Corpus (UDC). Models can be evaluated via metrics such as BLEU, ROUGE, and METEOR albeit with challenges in terms of weak correlation with human judgement, that may be addressed by new ones like UnSupervised and Reference-free (USR) and Metric for automatic Unreferenced dialog evaluation (MaUde).

Papers

Showing 126150 of 606 papers

TitleStatusHype
Improvement of a dedicated model for open domain persona-aware dialogue generationCode1
Memory-Based Model Editing at ScaleCode1
PRODIGy: a PROfile-based DIalogue Generation datasetCode1
AugESC: Dialogue Augmentation with Large Language Models for Emotional Support ConversationCode1
Elastic Weight Removal for Faithful and Abstractive Dialogue GenerationCode1
Empathetic Dialogue Generation via Sensitive Emotion Recognition and Sensible Knowledge SelectionCode1
EmpDG: Multi-resolution Interactive Empathetic Dialogue GenerationCode1
Enhancing Dialogue Generation via Dynamic Graph Knowledge AggregationCode1
Enhancing Personalized Dialogue Generation with Contrastive Latent Variables: Combining Sparse and Dense PersonaCode1
FaithDial: A Faithful Benchmark for Information-Seeking DialogueCode1
CoVoMix2: Advancing Zero-Shot Dialogue Generation with Fully Non-Autoregressive Flow Matching0
Counterfactual Off-Policy Training for Neural Dialogue Generation0
Attribution and Alignment: Effects of Local Context Repetition on Utterance Production and Comprehension in Dialogue0
Counterfactual Off-Policy Training for Neural Response Generation0
Attribute Controlled Dialogue Prompting0
Conversation Chronicles: Towards Diverse Temporal and Relational Dynamics in Multi-Session Conversations0
Conversation AI Dialog for Medicare powered by Finetuning and Retrieval Augmented Generation0
A Systematic Evaluation of Large Language Models for Natural Language Generation Tasks0
An Adversarial Approach to High-Quality, Sentiment-Controlled Neural Dialogue Generation0
Dynamic Knowledge Graph-based Dialogue Generation with Improved Adversarial Meta-Learning0
Controllable Meaning Representation to Text Generation: Linearization and Data Augmentation Strategies0
Controllable Dialogue Generation with Disentangled Multi-grained Style Specification and Attribute Consistency Reward0
A Survey of Text Watermarking in the Era of Large Language Models0
A Model-agnostic Data Manipulation Method for Persona-based Dialogue Generation0
Controllable and Diverse Data Augmentation with Large Language Model for Low-Resource Open-Domain Dialogue Generation0
Show:102550
← PrevPage 6 of 25Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1LMEDRAvg F121.99Unverified
2P^2 BotAvg F119.77Unverified
3TransferTransfoAvg F119.09Unverified
4Seq2Seq + AttentionAvg F116.18Unverified
5Synthesizer (R+V)BLEU-114.7Unverified
6KV Profile MemoryAvg F111.9Unverified
#ModelMetricClaimedVerifiedStatus
1Classification-based modelSlot Accuracy0.97Unverified
2Two-in-one modelSlot Accuracy0.97Unverified
#ModelMetricClaimedVerifiedStatus
1EVAmauve0.97Unverified
2Per-BOBmauve0.95Unverified
#ModelMetricClaimedVerifiedStatus
1mm1 in 10 R@25Unverified
#ModelMetricClaimedVerifiedStatus
1∞-former (Sticky memories)F19.01Unverified
#ModelMetricClaimedVerifiedStatus
1∞-former (Sticky memories + initialized GPT-2 Small)Perplexity32.48Unverified
#ModelMetricClaimedVerifiedStatus
1SpaceFusioninterest (human)2.53Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.F14.63Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.Accuracy34.48Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.F111.43Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.Accuracy95.04Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.F13.72Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.Accuracy29.01Unverified