SOTAVerified

Dialogue Generation

Dialogue generation is the task of "understanding" natural language inputs - within natural language processing in order to produce output. The systems are usually intended for conversing with humans, for instance back and forth dialogue with a conversation agent like a chatbot. Some example benchmarks for this task (see others such as Natural Language Understanding) include FusedChat and Ubuntu DIalogue Corpus (UDC). Models can be evaluated via metrics such as BLEU, ROUGE, and METEOR albeit with challenges in terms of weak correlation with human judgement, that may be addressed by new ones like UnSupervised and Reference-free (USR) and Metric for automatic Unreferenced dialog evaluation (MaUde).

Papers

Showing 51100 of 606 papers

TitleStatusHype
MADial-Bench: Towards Real-world Evaluation of Memory-Augmented Dialogue Generation0
LLMR: Knowledge Distillation with a Large Language Model-Induced Reward0
ReflectDiffu:Reflect between Emotion-intent Contagion and Mimicry for Empathetic Response Generation via a RL-Diffusion Framework0
UPCS: Unbiased Persona Construction for Dialogue Generation0
User-Specific Dialogue Generation with User Profile-Aware Pre-Training Model and Parameter-Efficient Fine-Tuning0
Enhancing Dialogue Generation in Werewolf Game Through Situation Analysis and Persuasion Strategies0
Strategist: Learning Strategic Skills by LLMs via Bi-Level Tree SearchCode2
ChatZero:Zero-shot Cross-Lingual Dialogue Generation via Pseudo-Target Language0
An End-to-End Model for Photo-Sharing Multi-modal Dialogue GenerationCode0
Style-Talker: Finetuning Audio Language Model and Style-Based Text-to-Speech Model for Fast Spoken Dialogue Generation0
Rethinking the Alignment of Psychotherapy Dialogue Generation with Motivational Interviewing Strategies0
Synthetic Patient-Physician Dialogue Generation from Clinical Notes Using LLM0
Self-Emotion Blended Dialogue Generation in Social Simulation Agents0
What if Red Can Talk? Dynamic Dialogue Generation Using Large Language Models0
LoRA-Pro: Are Low-Rank Adapters Properly Optimized?Code2
J-CHAT: Japanese Large-scale Spoken Dialogue Corpus for Spoken Dialogue Language Modeling0
A Factuality and Diversity Reconciled Decoding Method for Knowledge-Grounded Dialogue Generation0
CoMix: A Comprehensive Benchmark for Multi-Task Comic UnderstandingCode1
MORPHEUS: Modeling Role from Personalized Dialogue History by Exploring and Utilizing Latent Space0
Investigating the Effects of Large-Scale Pseudo-Stereo Data and Different Speech Foundation Model on Dialogue Generative Spoken Language Model0
Learning Retrieval Augmentation for Personalized Dialogue GenerationCode0
Selective Prompting Tuning for Personalized Conversations with LLMsCode1
Data Augmentation of Multi-turn Psychological Dialogue via Knowledge-driven Progressive Thought Prompting0
ESCoT: Towards Interpretable Emotional Support Dialogue SystemsCode1
Rapport-Driven Virtual Agent: Rapport Building Dialogue Strategy for Improving User Experience at First MeetingCode0
A Unified Data Augmentation Framework for Low-Resource Multi-Domain Dialogue GenerationCode0
Dynamic Stochastic Decoding Strategy for Open-Domain Dialogue Generation0
Investigating and Addressing Hallucinations of LLMs in Tasks Involving Negation0
Position Debiasing Fine-Tuning for Causal Perception in Long-Term Dialogue0
Recent Trends in Personalized Dialogue Generation: A Review of Datasets, Methodologies, and Evaluations0
Concept Matching with Agent for Out-of-Distribution DetectionCode0
M-RAG: Reinforcing Large Language Model Performance through Retrieval-Augmented Generation with Multiple Partitions0
A Systematic Evaluation of Large Language Models for Natural Language Generation Tasks0
Distilling Implicit Multimodal Knowledge into Large Language Models for Zero-Resource Dialogue GenerationCode0
Research on emotionally intelligent dialogue generation based on automatic dialogue system0
Modeling Low-Resource Health Coaching Dialogues via Neuro-Symbolic Goal Summarization and Text-Units-Text GenerationCode1
Impact of Preference Noise on the Alignment Performance of Generative Language Models0
CoVoMix: Advancing Zero-Shot Speech Generation for Human-like Multi-talker ConversationsCode2
DiffusionDialog: A Diffusion Model for Diverse Dialog Generation with Latent Space0
A Cause-Effect Look at Alleviating Hallucination of Knowledge-grounded Dialogue Generation0
PSYDIAL: Personality-based Synthetic Dialogue Generation using Large Language ModelsCode0
Controllable and Diverse Data Augmentation with Large Language Model for Low-Resource Open-Domain Dialogue Generation0
BP4ER: Bootstrap Prompting for Explicit Reasoning in Medical Dialogue Generation0
Empowering Segmentation Ability to Multi-modal Large Language ModelsCode0
StyleChat: Learning Recitation-Augmented Memory in LLMs for Stylized Dialogue Generation0
MedKP: Medical Dialogue with Knowledge Enhancement and Clinical Pathway Encoding0
Target-constrained Bidirectional Planning for Generation of Target-oriented Proactive DialogueCode1
MP2D: An Automated Topic Shift Dialogue Generation Framework Leveraging Knowledge Graphs0
A Knowledge Plug-and-Play Test Bed for Open-domain Dialogue GenerationCode0
"In Dialogues We Learn": Towards Personalized Dialogue Without Pre-defined Profiles through In-Dialogue Learning0
Show:102550
← PrevPage 2 of 13Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1LMEDRAvg F121.99Unverified
2P^2 BotAvg F119.77Unverified
3TransferTransfoAvg F119.09Unverified
4Seq2Seq + AttentionAvg F116.18Unverified
5Synthesizer (R+V)BLEU-114.7Unverified
6KV Profile MemoryAvg F111.9Unverified
#ModelMetricClaimedVerifiedStatus
1Classification-based modelSlot Accuracy0.97Unverified
2Two-in-one modelSlot Accuracy0.97Unverified
#ModelMetricClaimedVerifiedStatus
1EVAmauve0.97Unverified
2Per-BOBmauve0.95Unverified
#ModelMetricClaimedVerifiedStatus
1mm1 in 10 R@25Unverified
#ModelMetricClaimedVerifiedStatus
1∞-former (Sticky memories)F19.01Unverified
#ModelMetricClaimedVerifiedStatus
1∞-former (Sticky memories + initialized GPT-2 Small)Perplexity32.48Unverified
#ModelMetricClaimedVerifiedStatus
1SpaceFusioninterest (human)2.53Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.F14.63Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.Accuracy34.48Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.F111.43Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.Accuracy95.04Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.F13.72Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.Accuracy29.01Unverified