SOTAVerified

Dialogue Generation

Dialogue generation is the task of "understanding" natural language inputs - within natural language processing in order to produce output. The systems are usually intended for conversing with humans, for instance back and forth dialogue with a conversation agent like a chatbot. Some example benchmarks for this task (see others such as Natural Language Understanding) include FusedChat and Ubuntu DIalogue Corpus (UDC). Models can be evaluated via metrics such as BLEU, ROUGE, and METEOR albeit with challenges in terms of weak correlation with human judgement, that may be addressed by new ones like UnSupervised and Reference-free (USR) and Metric for automatic Unreferenced dialog evaluation (MaUde).

Papers

Showing 5175 of 606 papers

TitleStatusHype
Personalized Dialogue Generation with Persona-Adaptive AttentionCode1
Empathetic Dialogue Generation via Sensitive Emotion Recognition and Sensible Knowledge SelectionCode1
Improving Multi-turn Emotional Support Dialogue Generation with Lookahead Strategy PlanningCode1
Does GPT-3 Generate Empathetic Dialogues? A Novel In-Context Example Selection Method and Automatic Evaluation Metric for Empathetic Dialogue GenerationCode1
An Equal-Size Hard EM Algorithm for Diverse Dialogue GenerationCode1
CASE: Aligning Coarse-to-Fine Cognition and Affection for Empathetic Response GenerationCode1
Follow Me: Conversation Planning for Target-driven Recommendation Dialogue SystemsCode1
Memory-Based Model Editing at ScaleCode1
InstructDial: Improving Zero and Few-shot Generalization in Dialogue through Instruction TuningCode1
ProsocialDialog: A Prosocial Backbone for Conversational AgentsCode1
BanglaNLG and BanglaT5: Benchmarks and Resources for Evaluating Low-Resource Natural Language Generation in BanglaCode1
A Simple Contrastive Learning Objective for Alleviating Neural Text DegenerationCode1
COSPLAY: Concept Set Guided Personalized Dialogue Generation Across Both Party PersonasCode1
-former: Infinite Memory TransformerCode1
Anno-MI: A Dataset of Expert-Annotated Counselling DialoguesCode1
Emotion-Aware Transformer Encoder for Empathetic Dialogue GenerationCode1
FaithDial: A Faithful Benchmark for Information-Seeking DialogueCode1
SalesBot: Transitioning from Chit-Chat to Task-Oriented DialoguesCode1
A Model-Agnostic Data Manipulation Method for Persona-based Dialogue GenerationCode1
A Survey on Non-Autoregressive Generation for Neural Machine Translation and BeyondCode1
LingYi: Medical Conversational Question Answering System based on Multi-modal Knowledge GraphsCode1
User-Centric Conversational Recommendation with Multi-Aspect User ModelingCode1
DialoKG: Knowledge-Structure Aware Task-Oriented Dialogue GenerationCode1
An Interpretable Neuro-Symbolic Reasoning Framework for Task-Oriented Dialogue GenerationCode1
AugESC: Dialogue Augmentation with Large Language Models for Emotional Support ConversationCode1
Show:102550
← PrevPage 3 of 25Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1LMEDRAvg F121.99Unverified
2P^2 BotAvg F119.77Unverified
3TransferTransfoAvg F119.09Unverified
4Seq2Seq + AttentionAvg F116.18Unverified
5Synthesizer (R+V)BLEU-114.7Unverified
6KV Profile MemoryAvg F111.9Unverified
#ModelMetricClaimedVerifiedStatus
1Classification-based modelSlot Accuracy0.97Unverified
2Two-in-one modelSlot Accuracy0.97Unverified
#ModelMetricClaimedVerifiedStatus
1EVAmauve0.97Unverified
2Per-BOBmauve0.95Unverified
#ModelMetricClaimedVerifiedStatus
1mm1 in 10 R@25Unverified
#ModelMetricClaimedVerifiedStatus
1∞-former (Sticky memories)F19.01Unverified
#ModelMetricClaimedVerifiedStatus
1∞-former (Sticky memories + initialized GPT-2 Small)Perplexity32.48Unverified
#ModelMetricClaimedVerifiedStatus
1SpaceFusioninterest (human)2.53Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.F14.63Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.Accuracy34.48Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.F111.43Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.Accuracy95.04Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.F13.72Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.Accuracy29.01Unverified