SOTAVerified

Dialogue Generation

Dialogue generation is the task of "understanding" natural language inputs - within natural language processing in order to produce output. The systems are usually intended for conversing with humans, for instance back and forth dialogue with a conversation agent like a chatbot. Some example benchmarks for this task (see others such as Natural Language Understanding) include FusedChat and Ubuntu DIalogue Corpus (UDC). Models can be evaluated via metrics such as BLEU, ROUGE, and METEOR albeit with challenges in terms of weak correlation with human judgement, that may be addressed by new ones like UnSupervised and Reference-free (USR) and Metric for automatic Unreferenced dialog evaluation (MaUde).

Papers

Showing 51100 of 606 papers

TitleStatusHype
Enhancing Personalized Dialogue Generation with Contrastive Latent Variables: Combining Sparse and Dense PersonaCode1
Modeling Low-Resource Health Coaching Dialogues via Neuro-Symbolic Goal Summarization and Text-Units-Text GenerationCode1
A Simple and Efficient Multi-Task Learning Approach for Conditioned Dialogue GenerationCode1
Neural Machine Translation by Jointly Learning to Align and TranslateCode1
OpenViDial: A Large-Scale, Open-Domain Dialogue Dataset with Visual ContextsCode1
PanGu-α: Large-scale Autoregressive Pretrained Chinese Language Models with Auto-parallel ComputationCode1
Parameter-Efficient Fine-Tuning with Layer Pruning on Free-Text Sequence-to-Sequence ModelingCode1
A Survey on Non-Autoregressive Generation for Neural Machine Translation and BeyondCode1
A Model-Agnostic Data Manipulation Method for Persona-based Dialogue GenerationCode1
A Three-Stage Learning Framework for Low-Resource Knowledge-Grounded Dialogue GenerationCode1
Adding Chit-Chat to Enhance Task-Oriented DialoguesCode1
Perspective-taking and Pragmatics for Generating Empathetic Responses Focused on Emotion CausesCode1
Emotion-Aware Transformer Encoder for Empathetic Dialogue GenerationCode1
Diverse and Informative Dialogue Generation with Context-Specific Commonsense Knowledge AwarenessCode1
CoMix: A Comprehensive Benchmark for Multi-Task Comic UnderstandingCode1
Diversifying Dialog Generation via Adaptive Label SmoothingCode1
Knowledge Bridging for Empathetic Dialogue GenerationCode1
Dialogue Planning via Brownian Bridge Stochastic Process for Goal-directed Proactive DialogueCode1
Does GPT-3 Generate Empathetic Dialogues? A Novel In-Context Example Selection Method and Automatic Evaluation Metric for Empathetic Dialogue GenerationCode1
Elastic Weight Removal for Faithful and Abstractive Dialogue GenerationCode1
An Empirical Investigation of Pre-Trained Transformer Language Models for Open-Domain Dialogue GenerationCode1
BANG: Bridging Autoregressive and Non-autoregressive Generation with Large Scale PretrainingCode1
BanglaNLG and BanglaT5: Benchmarks and Resources for Evaluating Low-Resource Natural Language Generation in BanglaCode1
EmpDG: Multiresolution Interactive Empathetic Dialogue GenerationCode1
ERNIE-GEN: An Enhanced Multi-Flow Pre-training and Fine-tuning Framework for Natural Language GenerationCode1
ESCoT: Towards Interpretable Emotional Support Dialogue SystemsCode1
An Equal-Size Hard EM Algorithm for Diverse Dialogue GenerationCode1
FaithDial: A Faithful Benchmark for Information-Seeking DialogueCode1
BoB: BERT Over BERT for Training Persona-based Dialogue Models from Limited Personalized DataCode1
A Batch Normalized Inference Network Keeps the KL Vanishing AwayCode1
G-Eval: NLG Evaluation using GPT-4 with Better Human AlignmentCode1
Graph-Evolving Meta-Learning for Low-Resource Medical Dialogue GenerationCode1
An Interpretable Neuro-Symbolic Reasoning Framework for Task-Oriented Dialogue GenerationCode1
Towards Empathetic Open-domain Conversation Models: a New Benchmark and DatasetCode1
Improving Empathetic Dialogue Generation by Dynamically Infusing Commonsense KnowledgeCode1
Bridging Text and Video: A Universal Multimodal Transformer for Video-Audio Scene-Aware DialogCode1
Anno-MI: A Dataset of Expert-Annotated Counselling DialoguesCode1
Conversations Are Not Flat: Modeling the Dynamic Information Flow across Dialogue UtterancesCode1
Instruct Once, Chat Consistently in Multiple Rounds: An Efficient Tuning Framework for DialogueCode1
RetGen: A Joint framework for Retrieval and Grounded Text Generation ModelingCode1
Controllable Mixed-Initiative Dialogue Generation through PromptingCode1
CASE: Aligning Coarse-to-Fine Cognition and Affection for Empathetic Response GenerationCode1
A Pre-training Based Personalized Dialogue Generation Model with Persona-sparse DataCode1
Controlling Dialogue Generation with Semantic ExemplarsCode1
DEMO: Reframing Dialogue Interaction with Fine-grained Element ModelingCode1
DialoKG: Knowledge-Structure Aware Task-Oriented Dialogue GenerationCode1
ReMeDi: Resources for Multi-domain, Multi-service, Medical DialoguesCode1
ChiMed-GPT: A Chinese Medical Large Language Model with Full Training Regime and Better Alignment to Human PreferencesCode1
Medical Dialogue Generation via Dual Flow ModelingCode1
AugESC: Dialogue Augmentation with Large Language Models for Emotional Support ConversationCode1
Show:102550
← PrevPage 2 of 13Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1LMEDRAvg F121.99Unverified
2P^2 BotAvg F119.77Unverified
3TransferTransfoAvg F119.09Unverified
4Seq2Seq + AttentionAvg F116.18Unverified
5Synthesizer (R+V)BLEU-114.7Unverified
6KV Profile MemoryAvg F111.9Unverified
#ModelMetricClaimedVerifiedStatus
1Classification-based modelSlot Accuracy0.97Unverified
2Two-in-one modelSlot Accuracy0.97Unverified
#ModelMetricClaimedVerifiedStatus
1EVAmauve0.97Unverified
2Per-BOBmauve0.95Unverified
#ModelMetricClaimedVerifiedStatus
1mm1 in 10 R@25Unverified
#ModelMetricClaimedVerifiedStatus
1∞-former (Sticky memories)F19.01Unverified
#ModelMetricClaimedVerifiedStatus
1∞-former (Sticky memories + initialized GPT-2 Small)Perplexity32.48Unverified
#ModelMetricClaimedVerifiedStatus
1SpaceFusioninterest (human)2.53Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.F14.63Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.Accuracy34.48Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.F111.43Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.Accuracy95.04Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.F13.72Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.Accuracy29.01Unverified