SOTAVerified

Dialogue Generation

Dialogue generation is the task of "understanding" natural language inputs - within natural language processing in order to produce output. The systems are usually intended for conversing with humans, for instance back and forth dialogue with a conversation agent like a chatbot. Some example benchmarks for this task (see others such as Natural Language Understanding) include FusedChat and Ubuntu DIalogue Corpus (UDC). Models can be evaluated via metrics such as BLEU, ROUGE, and METEOR albeit with challenges in terms of weak correlation with human judgement, that may be addressed by new ones like UnSupervised and Reference-free (USR) and Metric for automatic Unreferenced dialog evaluation (MaUde).

Papers

Showing 501525 of 606 papers

TitleStatusHype
Bridging Cultural Nuances in Dialogue Agents through Cultural Value SurveysCode0
Approximation of Response Knowledge Retrieval in Knowledge-grounded Dialogue GenerationCode0
Long-term Control for Dialogue Generation: Methods and EvaluationCode0
Long Time No See! Open-Domain Conversation with Long-Term Persona MemoryCode0
Improving Knowledge-aware Dialogue Generation via Knowledge Base Question AnsweringCode0
ReCoSa: Detecting the Relevant Contexts with Self-Attention for Multi-turn Dialogue GenerationCode0
DialoGen: Generalized Long-Range Context Representation for Dialogue SystemsCode0
Improving Context Modelling in Multimodal Dialogue GenerationCode0
DESED: Dialogue-based Explanation for Sentence-level Event DetectionCode0
Another Diversity-Promoting Objective Function for Neural Dialogue GenerationCode0
Improving Conditional Sequence Generative Adversarial Networks by Stepwise EvaluationCode0
MDIA: A Benchmark for Multilingual Dialogue Generation in 46 LanguagesCode0
Adaptive Parameterization for Neural Dialogue GenerationCode0
Measuring and Improving Semantic Diversity of Dialogue GenerationCode0
Importance of Search and Evaluation Strategies in Neural Dialogue ModelingCode0
MedDialog: Large-scale Medical Dialogue DatasetsCode0
IMAD: IMage-Augmented multi-modal DialogueCode0
Concept Matching with Agent for Out-of-Distribution DetectionCode0
BoK: Introducing Bag-of-Keywords Loss for Interpretable Dialogue Response GenerationCode0
Relevance of Unsupervised Metrics in Task-Oriented Dialogue for Evaluating Natural Language GenerationCode0
MELD: A Multimodal Multi-Party Dataset for Emotion Recognition in ConversationsCode0
Unsupervised Knowledge Selection for Dialogue GenerationCode0
Hierarchical Text Generation using an OutlineCode0
Meta-Context Transformers for Domain-Specific Response GenerationCode0
Towards Less Generic Responses in Neural Conversation Models: A Statistical Re-weighting MethodCode0
Show:102550
← PrevPage 21 of 25Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1LMEDRAvg F121.99Unverified
2P^2 BotAvg F119.77Unverified
3TransferTransfoAvg F119.09Unverified
4Seq2Seq + AttentionAvg F116.18Unverified
5Synthesizer (R+V)BLEU-114.7Unverified
6KV Profile MemoryAvg F111.9Unverified
#ModelMetricClaimedVerifiedStatus
1Classification-based modelSlot Accuracy0.97Unverified
2Two-in-one modelSlot Accuracy0.97Unverified
#ModelMetricClaimedVerifiedStatus
1EVAmauve0.97Unverified
2Per-BOBmauve0.95Unverified
#ModelMetricClaimedVerifiedStatus
1mm1 in 10 R@25Unverified
#ModelMetricClaimedVerifiedStatus
1∞-former (Sticky memories)F19.01Unverified
#ModelMetricClaimedVerifiedStatus
1∞-former (Sticky memories + initialized GPT-2 Small)Perplexity32.48Unverified
#ModelMetricClaimedVerifiedStatus
1SpaceFusioninterest (human)2.53Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.F14.63Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.Accuracy34.48Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.F111.43Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.Accuracy95.04Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.F13.72Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.Accuracy29.01Unverified