SOTAVerified

Dialogue Generation

Dialogue generation is the task of "understanding" natural language inputs - within natural language processing in order to produce output. The systems are usually intended for conversing with humans, for instance back and forth dialogue with a conversation agent like a chatbot. Some example benchmarks for this task (see others such as Natural Language Understanding) include FusedChat and Ubuntu DIalogue Corpus (UDC). Models can be evaluated via metrics such as BLEU, ROUGE, and METEOR albeit with challenges in terms of weak correlation with human judgement, that may be addressed by new ones like UnSupervised and Reference-free (USR) and Metric for automatic Unreferenced dialog evaluation (MaUde).

Papers

Showing 201250 of 606 papers

TitleStatusHype
Position Debiasing Fine-Tuning for Causal Perception in Long-Term Dialogue0
Recent Trends in Personalized Dialogue Generation: A Review of Datasets, Methodologies, and Evaluations0
Concept Matching with Agent for Out-of-Distribution DetectionCode0
M-RAG: Reinforcing Large Language Model Performance through Retrieval-Augmented Generation with Multiple Partitions0
A Systematic Evaluation of Large Language Models for Natural Language Generation Tasks0
Distilling Implicit Multimodal Knowledge into Large Language Models for Zero-Resource Dialogue GenerationCode0
Research on emotionally intelligent dialogue generation based on automatic dialogue system0
Impact of Preference Noise on the Alignment Performance of Generative Language Models0
DiffusionDialog: A Diffusion Model for Diverse Dialog Generation with Latent Space0
A Cause-Effect Look at Alleviating Hallucination of Knowledge-grounded Dialogue Generation0
PSYDIAL: Personality-based Synthetic Dialogue Generation using Large Language ModelsCode0
Controllable and Diverse Data Augmentation with Large Language Model for Low-Resource Open-Domain Dialogue Generation0
BP4ER: Bootstrap Prompting for Explicit Reasoning in Medical Dialogue Generation0
Empowering Segmentation Ability to Multi-modal Large Language ModelsCode0
StyleChat: Learning Recitation-Augmented Memory in LLMs for Stylized Dialogue Generation0
MedKP: Medical Dialogue with Knowledge Enhancement and Clinical Pathway Encoding0
MP2D: An Automated Topic Shift Dialogue Generation Framework Leveraging Knowledge Graphs0
A Knowledge Plug-and-Play Test Bed for Open-domain Dialogue GenerationCode0
"In Dialogues We Learn": Towards Personalized Dialogue Without Pre-defined Profiles through In-Dialogue Learning0
Exploiting Emotion-Semantic Correlations for Empathetic Response GenerationCode0
Reasoning in Conversation: Solving Subjective Tasks through Dialogue Simulation for Large Language Models0
M2K-VDG: Model-Adaptive Multimodal Knowledge Anchor Enhanced Video-grounded Dialogue Generation0
Enhancing Role-playing Systems through Aggressive Queries: Evaluation and Improvement0
Crafting a Good Prompt or Providing Exemplary Dialogues? A Study of In-Context Learning for Persona-based Dialogue Generation0
Investigating Content Planning for Navigating Trade-offs in Knowledge-Grounded Dialogue0
Bridging Cultural Nuances in Dialogue Agents through Cultural Value SurveysCode0
Medical Dialogue Generation via Intuitive-then-Analytical Differential Diagnosis0
Integrating Physician Diagnostic Logic into Large Language Models: Preference Learning from Process FeedbackCode0
An EcoSage Assistant: Towards Building A Multimodal Plant Care Dialogue AssistantCode0
OmniDialog: An Omnipotent Pre-training Model for Task-Oriented Dialogue System0
A Survey of Text Watermarking in the Era of Large Language Models0
Sibyl: Empowering Empathetic Dialogue Generation in Large Language Models via Sensible and Visionary Commonsense InferenceCode0
E-CORE: Emotion Correlation Enhanced Empathetic Dialogue Generation0
CMed-GPT: Prompt Tuning for Entity-Aware Chinese Medical Dialogue Generation0
Attribution and Alignment: Effects of Local Context Repetition on Utterance Production and Comprehension in Dialogue0
An Empirical Bayes Framework for Open-Domain Dialogue Generation0
X-Eval: Generalizable Multi-aspect Text Evaluation via Augmented Instruction Tuning with Auxiliary Evaluation Aspects0
Context-dependent Instruction Tuning for Dialogue Response Generation0
Think Before You Speak: Cultivating Communication Skills of Large Language Models via Inner MonologueCode0
Blending Reward Functions via Few Expert Demonstrations for Faithful and Accurate Knowledge-Grounded Dialogue Generation0
Sequence-Level Certainty Reduces Hallucination In Knowledge-Grounded Dialogue Generation0
FactSpotter: Evaluating the Factual Faithfulness of Graph-to-Text GenerationCode0
Mind the Gap Between Conversations for Improved Long-Term Dialogue GenerationCode0
Fidelity-Enriched Contrastive Search: Reconciling the Faithfulness-Diversity Trade-Off in Text GenerationCode0
Conversation Chronicles: Towards Diverse Temporal and Relational Dynamics in Multi-Session Conversations0
Multi-level Adaptive Contrastive Learning for Knowledge Internalization in Dialogue Generation0
Hexa: Self-Improving for Knowledge-Grounded Dialogue System0
We are what we repeatedly do: Inducing and deploying habitual schemas in persona-based responsesCode0
Towards human-like spoken dialogue generation between AI agents from written dialogue0
MSG-BART: Multi-granularity Scene Graph-Enhanced Encoder-Decoder Language Model for Video-grounded Dialogue Generation0
Show:102550
← PrevPage 5 of 13Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1LMEDRAvg F121.99Unverified
2P^2 BotAvg F119.77Unverified
3TransferTransfoAvg F119.09Unverified
4Seq2Seq + AttentionAvg F116.18Unverified
5Synthesizer (R+V)BLEU-114.7Unverified
6KV Profile MemoryAvg F111.9Unverified
#ModelMetricClaimedVerifiedStatus
1Classification-based modelSlot Accuracy0.97Unverified
2Two-in-one modelSlot Accuracy0.97Unverified
#ModelMetricClaimedVerifiedStatus
1EVAmauve0.97Unverified
2Per-BOBmauve0.95Unverified
#ModelMetricClaimedVerifiedStatus
1mm1 in 10 R@25Unverified
#ModelMetricClaimedVerifiedStatus
1∞-former (Sticky memories)F19.01Unverified
#ModelMetricClaimedVerifiedStatus
1∞-former (Sticky memories + initialized GPT-2 Small)Perplexity32.48Unverified
#ModelMetricClaimedVerifiedStatus
1SpaceFusioninterest (human)2.53Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.F14.63Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.Accuracy34.48Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.F111.43Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.Accuracy95.04Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.F13.72Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.Accuracy29.01Unverified