SOTAVerified

Dialogue Generation

Dialogue generation is the task of "understanding" natural language inputs - within natural language processing in order to produce output. The systems are usually intended for conversing with humans, for instance back and forth dialogue with a conversation agent like a chatbot. Some example benchmarks for this task (see others such as Natural Language Understanding) include FusedChat and Ubuntu DIalogue Corpus (UDC). Models can be evaluated via metrics such as BLEU, ROUGE, and METEOR albeit with challenges in terms of weak correlation with human judgement, that may be addressed by new ones like UnSupervised and Reference-free (USR) and Metric for automatic Unreferenced dialog evaluation (MaUde).

Papers

Showing 101125 of 606 papers

TitleStatusHype
Diversifying Dialog Generation via Adaptive Label SmoothingCode1
NoteChat: A Dataset of Synthetic Doctor-Patient Conversations Conditioned on Clinical NotesCode1
AugESC: Dialogue Augmentation with Large Language Models for Emotional Support ConversationCode1
CoMix: A Comprehensive Benchmark for Multi-Task Comic UnderstandingCode1
Parameter-Efficient Fine-Tuning with Layer Pruning on Free-Text Sequence-to-Sequence ModelingCode1
Personalized Dialogue Generation with Diversified TraitsCode1
Dialogue Planning via Brownian Bridge Stochastic Process for Goal-directed Proactive DialogueCode1
Perspective-taking and Pragmatics for Generating Empathetic Responses Focused on Emotion CausesCode1
Polite Dialogue Generation Without Parallel DataCode1
Pretrained Language Models for Dialogue Generation with Multiple Input SourcesCode1
DialoKG: Knowledge-Structure Aware Task-Oriented Dialogue GenerationCode1
SafeDialBench: A Fine-Grained Safety Benchmark for Large Language Models in Multi-Turn Dialogues with Diverse Jailbreak AttacksCode1
SalesBot: Transitioning from Chit-Chat to Task-Oriented DialoguesCode1
Does GPT-3 Generate Empathetic Dialogues? A Novel In-Context Example Selection Method and Automatic Evaluation Metric for Empathetic Dialogue GenerationCode1
EmpDG: Multiresolution Interactive Empathetic Dialogue GenerationCode1
SideControl: Controlled Open-domain Dialogue Generation via Additive Side NetworksCode1
Controllable Mixed-Initiative Dialogue Generation through PromptingCode1
Controlling Dialogue Generation with Semantic ExemplarsCode1
A Model-Agnostic Data Manipulation Method for Persona-based Dialogue GenerationCode1
Stylized Dialogue Response Generation Using Stylized Unpaired TextsCode1
A Three-Stage Learning Framework for Low-Resource Knowledge-Grounded Dialogue GenerationCode1
Conversations Are Not Flat: Modeling the Dynamic Information Flow across Dialogue UtterancesCode1
COSPLAY: Concept Set Guided Personalized Dialogue Generation Across Both Party PersonasCode1
Adding Chit-Chat to Enhance Task-Oriented DialoguesCode1
Graph-Evolving Meta-Learning for Low-Resource Medical Dialogue GenerationCode1
Show:102550
← PrevPage 5 of 25Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1LMEDRAvg F121.99Unverified
2P^2 BotAvg F119.77Unverified
3TransferTransfoAvg F119.09Unverified
4Seq2Seq + AttentionAvg F116.18Unverified
5Synthesizer (R+V)BLEU-114.7Unverified
6KV Profile MemoryAvg F111.9Unverified
#ModelMetricClaimedVerifiedStatus
1Classification-based modelSlot Accuracy0.97Unverified
2Two-in-one modelSlot Accuracy0.97Unverified
#ModelMetricClaimedVerifiedStatus
1EVAmauve0.97Unverified
2Per-BOBmauve0.95Unverified
#ModelMetricClaimedVerifiedStatus
1mm1 in 10 R@25Unverified
#ModelMetricClaimedVerifiedStatus
1∞-former (Sticky memories)F19.01Unverified
#ModelMetricClaimedVerifiedStatus
1∞-former (Sticky memories + initialized GPT-2 Small)Perplexity32.48Unverified
#ModelMetricClaimedVerifiedStatus
1SpaceFusioninterest (human)2.53Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.F14.63Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.Accuracy34.48Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.F111.43Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.Accuracy95.04Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.F13.72Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.Accuracy29.01Unverified