SOTAVerified

Dialogue Generation

Dialogue generation is the task of "understanding" natural language inputs - within natural language processing in order to produce output. The systems are usually intended for conversing with humans, for instance back and forth dialogue with a conversation agent like a chatbot. Some example benchmarks for this task (see others such as Natural Language Understanding) include FusedChat and Ubuntu DIalogue Corpus (UDC). Models can be evaluated via metrics such as BLEU, ROUGE, and METEOR albeit with challenges in terms of weak correlation with human judgement, that may be addressed by new ones like UnSupervised and Reference-free (USR) and Metric for automatic Unreferenced dialog evaluation (MaUde).

Papers

Showing 151200 of 606 papers

TitleStatusHype
More is Better: Enhancing Open-Domain Dialogue Generation via Multi-Source Heterogeneous KnowledgeCode0
Multiple Generative Models Ensemble for Knowledge-Driven Proactive Human-Computer Dialogue AgentCode0
Commonsense and Named Entity Aware Knowledge Grounded Dialogue GenerationCode0
Multiresolution Recurrent Neural Networks: An Application to Dialogue Response GenerationCode0
Mind the Gap Between Conversations for Improved Long-Term Dialogue GenerationCode0
Meta-Context Transformers for Domain-Specific Response GenerationCode0
Hi Model, generating 'nice' instead of 'good' is not as bad as generating 'rice'! Towards Context and Semantic Infused Dialogue Generation Loss Function and Evaluation MetricCode0
MidMed: Towards Mixed-Type Dialogues for Medical ConsultationCode0
MedDialog: Large-scale Medical Dialogue DatasetsCode0
Eliciting Knowledge from Large Pre-Trained Models for Unsupervised Knowledge-Grounded ConversationCode0
Efficient Tuning of Large Language Models for Knowledge-Grounded Dialogue GenerationCode0
A Knowledge Plug-and-Play Test Bed for Open-domain Dialogue GenerationCode0
MDIA: A Benchmark for Multilingual Dialogue Generation in 46 LanguagesCode0
Measuring and Improving Semantic Diversity of Dialogue GenerationCode0
MELD: A Multimodal Multi-Party Dataset for Emotion Recognition in ConversationsCode0
CDialog: A Multi-turn Covid-19 Conversation Dataset for Entity-Aware Dialog GenerationCode0
CausalDialogue: Modeling Utterance-level Causality in ConversationsCode0
Long-term Control for Dialogue Generation: Methods and EvaluationCode0
Approximation of Response Knowledge Retrieval in Knowledge-grounded Dialogue GenerationCode0
Domain Agnostic Real-Valued Specificity PredictionCode0
Can LLMs Simulate L2-English Dialogue? An Information-Theoretic Analysis of L1-Dependent BiasesCode0
Long Time No See! Open-Domain Conversation with Long-Term Persona MemoryCode0
CADGE: Context-Aware Dialogue Generation Enhanced with Graph-Structured Knowledge AggregationCode0
CAB: Empathetic Dialogue Generation with Cognition, Affection and BehaviorCode0
Learning Retrieval Augmentation for Personalized Dialogue GenerationCode0
Diversity-Promoting GAN: A Cross-Entropy Based Generative Adversarial Network for Diversified Text GenerationCode0
AirDialogue: An Environment for Goal-Oriented Dialogue ResearchCode0
Latent Variable Dialogue Models and their DiversityCode0
Diversifying Dialogue Generation with Non-Conversational TextCode0
Another Diversity-Promoting Objective Function for Neural Dialogue GenerationCode0
Language Detoxification with Attribute-Discriminative Latent SpaceCode0
Learning Discourse-level Diversity for Neural Dialog Models using Conditional Variational AutoencodersCode0
Learning to Customize Model Structures for Few-shot Dialogue Generation TasksCode0
DP-GAN: Diversity-Promoting Generative Adversarial Network for Generating Informative and Diversified TextCode0
Knowledge Diffusion for Neural Dialogue GenerationCode0
DyKgChat: Benchmarking Dialogue Generation Grounding on Dynamic Knowledge GraphsCode0
Distilling Implicit Multimodal Knowledge into Large Language Models for Zero-Resource Dialogue GenerationCode0
Bridging Cultural Nuances in Dialogue Agents through Cultural Value SurveysCode0
Integrating Physician Diagnostic Logic into Large Language Models: Preference Learning from Process FeedbackCode0
Linguistically-Informed Specificity and Semantic Plausibility for Dialogue GenerationCode0
Knowledge-Grounded Dialogue Generation with Term-level De-noisingCode0
A Neural Topical Expansion Framework for Unstructured Persona-oriented Dialogue GenerationCode0
Improving Medical Dialogue Generation with Abstract Meaning RepresentationsCode0
BoK: Introducing Bag-of-Keywords Loss for Interpretable Dialogue Response GenerationCode0
Dialogue Generation: From Imitation Learning to Inverse Reinforcement LearningCode0
Dialogue Benchmark Generation from Knowledge Graphs with Cost-Effective Retrieval-Augmented LLMsCode0
Improving Knowledge-aware Dialogue Generation via Knowledge Base Question AnsweringCode0
KnowPrefix-Tuning: A Two-Stage Prefix-Tuning Framework for Knowledge-Grounded Dialogue GenerationCode0
Adversarial Learning for Neural Dialogue GenerationCode0
IMAD: IMage-Augmented multi-modal DialogueCode0
Show:102550
← PrevPage 4 of 13Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1LMEDRAvg F121.99Unverified
2P^2 BotAvg F119.77Unverified
3TransferTransfoAvg F119.09Unverified
4Seq2Seq + AttentionAvg F116.18Unverified
5Synthesizer (R+V)BLEU-114.7Unverified
6KV Profile MemoryAvg F111.9Unverified
#ModelMetricClaimedVerifiedStatus
1Classification-based modelSlot Accuracy0.97Unverified
2Two-in-one modelSlot Accuracy0.97Unverified
#ModelMetricClaimedVerifiedStatus
1EVAmauve0.97Unverified
2Per-BOBmauve0.95Unverified
#ModelMetricClaimedVerifiedStatus
1mm1 in 10 R@25Unverified
#ModelMetricClaimedVerifiedStatus
1∞-former (Sticky memories)F19.01Unverified
#ModelMetricClaimedVerifiedStatus
1∞-former (Sticky memories + initialized GPT-2 Small)Perplexity32.48Unverified
#ModelMetricClaimedVerifiedStatus
1SpaceFusioninterest (human)2.53Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.F14.63Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.Accuracy34.48Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.F111.43Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.Accuracy95.04Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.F13.72Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.Accuracy29.01Unverified