SOTAVerified

Dialogue Generation

Dialogue generation is the task of "understanding" natural language inputs - within natural language processing in order to produce output. The systems are usually intended for conversing with humans, for instance back and forth dialogue with a conversation agent like a chatbot. Some example benchmarks for this task (see others such as Natural Language Understanding) include FusedChat and Ubuntu DIalogue Corpus (UDC). Models can be evaluated via metrics such as BLEU, ROUGE, and METEOR albeit with challenges in terms of weak correlation with human judgement, that may be addressed by new ones like UnSupervised and Reference-free (USR) and Metric for automatic Unreferenced dialog evaluation (MaUde).

Papers

Showing 251275 of 606 papers

TitleStatusHype
RT-KGD: Relation Transition Aware Knowledge-Grounded Dialogue GenerationCode0
Scene-Aware Prompt for Multi-modal Dialogue Understanding and Generation0
Evaluating Gender Bias Transfer from Film Data0
A Shoulder to Cry on: Towards A Motivational Virtual Assistant for Assuaging Mental Agony0
Modeling Compositionality with Dependency Graph for Dialogue Generation0
Towards an open-domain chatbot for language practiceCode0
Medical Dialogue Response Generation with Pivotal Information Recalling0
Memory-Based Model Editing at ScaleCode1
Grounding in social media: An approach to building a chit-chat dialogue model0
Learning to Generate Prompts for Dialogue Generation through Reinforcement Learning0
A Unifying View On Task-oriented Dialogue AnnotationCode0
CPED: A Large-Scale Chinese Personalized and Emotional Dialogue Dataset for Conversational AICode2
Commonsense and Named Entity Aware Knowledge Grounded Dialogue GenerationCode0
Keywords and Instances: A Hierarchical Contrastive Learning Framework Unifying Hybrid Granularities for Text Generation0
ProsocialDialog: A Prosocial Backbone for Conversational AgentsCode1
InstructDial: Improving Zero and Few-shot Generalization in Dialogue through Instruction TuningCode1
DFM: Dialogue Foundation Model for Universal Large-Scale Dialogue-Oriented Task Learning0
BanglaNLG and BanglaT5: Benchmarks and Resources for Evaluating Low-Resource Natural Language Generation in BanglaCode1
Robust Task-Oriented Dialogue Generation with Contrastive Pre-training and Adversarial Filtering0
Self-training with Two-phase Self-augmentation for Few-shot Dialogue GenerationCode0
Long-term Control for Dialogue Generation: Methods and EvaluationCode0
A Simple Contrastive Learning Objective for Alleviating Neural Text DegenerationCode1
Diversifying Neural Dialogue Generation via Negative Distillation0
Semantic Diversity in Dialogue with Natural Language Inference0
COSPLAY: Concept Set Guided Personalized Dialogue Generation Across Both Party PersonasCode1
Show:102550
← PrevPage 11 of 25Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1LMEDRAvg F121.99Unverified
2P^2 BotAvg F119.77Unverified
3TransferTransfoAvg F119.09Unverified
4Seq2Seq + AttentionAvg F116.18Unverified
5Synthesizer (R+V)BLEU-114.7Unverified
6KV Profile MemoryAvg F111.9Unverified
#ModelMetricClaimedVerifiedStatus
1Classification-based modelSlot Accuracy0.97Unverified
2Two-in-one modelSlot Accuracy0.97Unverified
#ModelMetricClaimedVerifiedStatus
1EVAmauve0.97Unverified
2Per-BOBmauve0.95Unverified
#ModelMetricClaimedVerifiedStatus
1mm1 in 10 R@25Unverified
#ModelMetricClaimedVerifiedStatus
1∞-former (Sticky memories)F19.01Unverified
#ModelMetricClaimedVerifiedStatus
1∞-former (Sticky memories + initialized GPT-2 Small)Perplexity32.48Unverified
#ModelMetricClaimedVerifiedStatus
1SpaceFusioninterest (human)2.53Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.F14.63Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.Accuracy34.48Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.F111.43Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.Accuracy95.04Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.F13.72Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.Accuracy29.01Unverified