SOTAVerified

Dialogue Generation

Dialogue generation is the task of "understanding" natural language inputs - within natural language processing in order to produce output. The systems are usually intended for conversing with humans, for instance back and forth dialogue with a conversation agent like a chatbot. Some example benchmarks for this task (see others such as Natural Language Understanding) include FusedChat and Ubuntu DIalogue Corpus (UDC). Models can be evaluated via metrics such as BLEU, ROUGE, and METEOR albeit with challenges in terms of weak correlation with human judgement, that may be addressed by new ones like UnSupervised and Reference-free (USR) and Metric for automatic Unreferenced dialog evaluation (MaUde).

Papers

Showing 351375 of 606 papers

TitleStatusHype
Open-domain Dialogue Generation: What We Can Do, Cannot Do, And Should Do Next0
Open Domain Dialogue Generation with Latent Images0
ORD: Object Relationship Discovery for Visual Dialogue Generation0
Pan More Gold from the Sand: Refining Open-domain Dialogue Training with Noisy Self-Retrieval Generation0
Partner Personas Generation for Diverse Dialogue Generation0
Persona-Knowledge Dialogue Multi-Context Retrieval and Enhanced Decoding Methods0
PLATO-K: Internal and External Knowledge Enhanced Dialogue Generation0
PlugMed: Improving Specificity in Patient-Centered Medical Dialogue Generation using In-Context Learning0
Policy-driven Knowledge Selection and Response Generation for Document-grounded Dialogue0
Position Debiasing Fine-Tuning for Causal Perception in Long-Term Dialogue0
Prediction, Selection, and Generation: Exploration of Knowledge-Driven Conversation System0
Proactive Assistant Dialogue Generation from Streaming Egocentric Videos0
Profanity-Avoiding Training Framework for Seq2seq Models with Certified Robustness0
ProphetChat: Enhancing Dialogue Generation with Simulation of Future Conversation0
Prototype-to-Style: Dialogue Generation with Style-Aware Editing on Retrieval Memory0
PsyPlay: Personality-Infused Role-Playing Conversational Agents0
Queens are Powerful too: Mitigating Gender Bias in Dialogue Generation0
Ranking Enhanced Dialogue Generation0
Reasoning in Conversation: Solving Subjective Tasks through Dialogue Simulation for Large Language Models0
Recent Trends in Personalized Dialogue Generation: A Review of Datasets, Methodologies, and Evaluations0
Reference-Aware Language Models0
Refine and Imitate: Reducing Repetition and Inconsistency in Dialogue Generation via Reinforcement Learning and Human Demonstration0
ReflectDiffu:Reflect between Emotion-intent Contagion and Mimicry for Empathetic Response Generation via a RL-Diffusion Framework0
Regularizing Dialogue Generation by Imitating Implicit Scenarios0
RepSum: Unsupervised Dialogue Summarization based on Replacement Strategy0
Show:102550
← PrevPage 15 of 25Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1LMEDRAvg F121.99Unverified
2P^2 BotAvg F119.77Unverified
3TransferTransfoAvg F119.09Unverified
4Seq2Seq + AttentionAvg F116.18Unverified
5Synthesizer (R+V)BLEU-114.7Unverified
6KV Profile MemoryAvg F111.9Unverified
#ModelMetricClaimedVerifiedStatus
1Classification-based modelSlot Accuracy0.97Unverified
2Two-in-one modelSlot Accuracy0.97Unverified
#ModelMetricClaimedVerifiedStatus
1EVAmauve0.97Unverified
2Per-BOBmauve0.95Unverified
#ModelMetricClaimedVerifiedStatus
1mm1 in 10 R@25Unverified
#ModelMetricClaimedVerifiedStatus
1∞-former (Sticky memories)F19.01Unverified
#ModelMetricClaimedVerifiedStatus
1∞-former (Sticky memories + initialized GPT-2 Small)Perplexity32.48Unverified
#ModelMetricClaimedVerifiedStatus
1SpaceFusioninterest (human)2.53Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.F14.63Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.Accuracy34.48Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.F111.43Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.Accuracy95.04Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.F13.72Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.Accuracy29.01Unverified