SOTAVerified

Dialogue Generation

Dialogue generation is the task of "understanding" natural language inputs - within natural language processing in order to produce output. The systems are usually intended for conversing with humans, for instance back and forth dialogue with a conversation agent like a chatbot. Some example benchmarks for this task (see others such as Natural Language Understanding) include FusedChat and Ubuntu DIalogue Corpus (UDC). Models can be evaluated via metrics such as BLEU, ROUGE, and METEOR albeit with challenges in terms of weak correlation with human judgement, that may be addressed by new ones like UnSupervised and Reference-free (USR) and Metric for automatic Unreferenced dialog evaluation (MaUde).

Papers

Showing 226250 of 606 papers

TitleStatusHype
Exploring Persona Sentiment Sensitivity in Personalized Dialogue Generation0
Extended Named Entity Recognition API and Its Applications in Language Education0
Fact-based Dialogue Generation with Convergent and Divergent Decoding0
Controllable and Diverse Data Augmentation with Large Language Model for Low-Resource Open-Domain Dialogue Generation0
Dolphin: A Challenging and Diverse Benchmark for Arabic NLG0
Faithfulness in Natural Language Generation: A Systematic Survey of Analysis, Evaluation and Optimization Methods0
Few-Shot Dialogue Generation Without Annotated Data: A Transfer Learning Approach0
Learning to Generate Prompts for Dialogue Generation through Reinforcement Learning0
Controllable Meaning Representation to Text Generation: Linearization and Data Augmentation Strategies0
Active Defense Against Social Engineering: The Case for Human Language Technology0
DLVGen: A Dual Latent Variable Approach to Personalized Dialogue Generation0
DLGNet-Task: An End-to-end Neural Network Framework for Modeling Multi-turn Multi-domain Task-Oriented Dialogue0
Adaptive Bridge between Training and Inference for Dialogue0
Improving Matching Models with Hierarchical Contextualized Representations for Multi-turn Response Selection0
Building Resource-Constrained Language Agents: A Korean Case Study on Chemical Toxicity Information0
GE-Blender: Graph-Based Knowledge Enhancement for Blender0
Generalizable and Explainable Dialogue Generation via Explicit Action Learning0
Diversifying Neural Dialogue Generation via Negative Distillation0
Improving Contextual Coherence in Variational Personalized and Empathetic Dialogue Agents0
Diversifying Neural Dialogue Generation via Negative Distillation0
Generating Emotionally Aligned Responses in Dialogues using Affect Control Theory0
Bridging the Gap between Prior and Posterior Knowledge Selection for Knowledge-Grounded Dialogue Generation0
Generating Personalized Dialogue via Multi-Task Meta-Learning0
Diverse dialogue generation with context dependent dynamic loss function0
HUMBO: Bridging Response Generation and Facial Expression Synthesis0
Show:102550
← PrevPage 10 of 25Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1LMEDRAvg F121.99Unverified
2P^2 BotAvg F119.77Unverified
3TransferTransfoAvg F119.09Unverified
4Seq2Seq + AttentionAvg F116.18Unverified
5Synthesizer (R+V)BLEU-114.7Unverified
6KV Profile MemoryAvg F111.9Unverified
#ModelMetricClaimedVerifiedStatus
1Classification-based modelSlot Accuracy0.97Unverified
2Two-in-one modelSlot Accuracy0.97Unverified
#ModelMetricClaimedVerifiedStatus
1EVAmauve0.97Unverified
2Per-BOBmauve0.95Unverified
#ModelMetricClaimedVerifiedStatus
1mm1 in 10 R@25Unverified
#ModelMetricClaimedVerifiedStatus
1∞-former (Sticky memories)F19.01Unverified
#ModelMetricClaimedVerifiedStatus
1∞-former (Sticky memories + initialized GPT-2 Small)Perplexity32.48Unverified
#ModelMetricClaimedVerifiedStatus
1SpaceFusioninterest (human)2.53Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.F14.63Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.Accuracy34.48Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.F111.43Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.Accuracy95.04Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.F13.72Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.Accuracy29.01Unverified