SOTAVerified

Dialogue Generation

Dialogue generation is the task of "understanding" natural language inputs - within natural language processing in order to produce output. The systems are usually intended for conversing with humans, for instance back and forth dialogue with a conversation agent like a chatbot. Some example benchmarks for this task (see others such as Natural Language Understanding) include FusedChat and Ubuntu DIalogue Corpus (UDC). Models can be evaluated via metrics such as BLEU, ROUGE, and METEOR albeit with challenges in terms of weak correlation with human judgement, that may be addressed by new ones like UnSupervised and Reference-free (USR) and Metric for automatic Unreferenced dialog evaluation (MaUde).

Papers

Showing 301325 of 606 papers

TitleStatusHype
Emotional Support with LLM-based Empathetic Dialogue Generation0
Empathetic Dialogue Generation with Pre-trained RoBERTa-GPT2 and External Knowledge0
Enhancing Dialogue Generation in Werewolf Game Through Situation Analysis and Persuasion Strategies0
Enhancing Dialogue Generation via Multi-Level Contrastive Learning0
Enhancing Large Language Model Induced Task-Oriented Dialogue Systems Through Look-Forward Motivated Goals0
Enhancing Persona Consistency for LLMs' Role-Playing using Persona-Aware Contrastive Learning0
Enhancing Role-playing Systems through Aggressive Queries: Evaluation and Improvement0
Entropy-Enhanced Multimodal Attention Model for Scene-Aware Dialogue Generation0
Evaluate What You Can't Evaluate: Unassessable Quality for Generated Response0
Evaluating Gender Bias Transfer from Film Data0
Exploring Effective Information Utilization in Multi-Turn Topic-Driven Conversations0
Exploring Persona Sentiment Sensitivity in Personalized Dialogue Generation0
Extended Named Entity Recognition API and Its Applications in Language Education0
Fact-based Dialogue Generation with Convergent and Divergent Decoding0
Faithfulness in Natural Language Generation: A Systematic Survey of Analysis, Evaluation and Optimization Methods0
Few-Shot Dialogue Generation Without Annotated Data: A Transfer Learning Approach0
Learning to Generate Prompts for Dialogue Generation through Reinforcement Learning0
RecInDial: A Unified Framework for Conversational Recommendation with Pretrained Language Models0
Fine-Tuning Qwen 2.5 3B for Realistic Movie Dialogue Generation0
Modeling Compositionality with Dependency Graph for Dialogue Generation0
Modeling Topical Relevance for Multi-Turn Dialogue Generation0
More but Correct: Generating Diversified and Entity-revised Medical Response0
More Diverse Dialogue Datasets via Diversity-Informed Data Collection0
More Informative Dialogue Generation via Multiple Knowledge Selection0
MORPHEUS: Modeling Role from Personalized Dialogue History by Exploring and Utilizing Latent Space0
Show:102550
← PrevPage 13 of 25Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1LMEDRAvg F121.99Unverified
2P^2 BotAvg F119.77Unverified
3TransferTransfoAvg F119.09Unverified
4Seq2Seq + AttentionAvg F116.18Unverified
5Synthesizer (R+V)BLEU-114.7Unverified
6KV Profile MemoryAvg F111.9Unverified
#ModelMetricClaimedVerifiedStatus
1Classification-based modelSlot Accuracy0.97Unverified
2Two-in-one modelSlot Accuracy0.97Unverified
#ModelMetricClaimedVerifiedStatus
1EVAmauve0.97Unverified
2Per-BOBmauve0.95Unverified
#ModelMetricClaimedVerifiedStatus
1mm1 in 10 R@25Unverified
#ModelMetricClaimedVerifiedStatus
1∞-former (Sticky memories)F19.01Unverified
#ModelMetricClaimedVerifiedStatus
1∞-former (Sticky memories + initialized GPT-2 Small)Perplexity32.48Unverified
#ModelMetricClaimedVerifiedStatus
1SpaceFusioninterest (human)2.53Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.F14.63Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.Accuracy34.48Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.F111.43Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.Accuracy95.04Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.F13.72Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.Accuracy29.01Unverified