SOTAVerified

Dialogue Generation

Dialogue generation is the task of "understanding" natural language inputs - within natural language processing in order to produce output. The systems are usually intended for conversing with humans, for instance back and forth dialogue with a conversation agent like a chatbot. Some example benchmarks for this task (see others such as Natural Language Understanding) include FusedChat and Ubuntu DIalogue Corpus (UDC). Models can be evaluated via metrics such as BLEU, ROUGE, and METEOR albeit with challenges in terms of weak correlation with human judgement, that may be addressed by new ones like UnSupervised and Reference-free (USR) and Metric for automatic Unreferenced dialog evaluation (MaUde).

Papers

Showing 125 of 606 papers

TitleStatusHype
Emotional Support with LLM-based Empathetic Dialogue Generation0
ZipVoice-Dialog: Non-Autoregressive Spoken Dialogue Generation with Flow MatchingCode4
Enhancing Medical Dialogue Generation through Knowledge Refinement and Dynamic Prompt AdjustmentCode0
SDialog: A Python Toolkit for Synthetic Dialogue Generation and AnalysisCode2
Proactive Assistant Dialogue Generation from Streaming Egocentric Videos0
ConsistentChat: Building Skeleton-Guided Consistent Dialogues for Large Language Models from Scratch0
CoVoMix2: Advancing Zero-Shot Dialogue Generation with Fully Non-Autoregressive Flow Matching0
Adaptive-VP: A Framework for LLM-Based Virtual Patients that Adapts to Trainees' Dialogue to Facilitate Nurse Communication TrainingCode0
When Harry Meets Superman: The Role of The Interlocutor in Persona-Based Dialogue Generation0
Benchmarking Expressive Japanese Character Text-to-Speech with VITS and Style-BERT-VITS20
Action2Dialogue: Generating Character-Centric Narratives from Scene-Level Prompts0
Aligning Dialogue Agents with Global Feedback via Large Language Model Reward Decomposition0
Streamlining Biomedical Research with Specialized LLMs0
Efficient Tuning of Large Language Models for Knowledge-Grounded Dialogue GenerationCode0
A Unified Virtual Mixture-of-Experts Framework:Enhanced Inference and Hallucination Mitigation in Single-Model System0
Building Resource-Constrained Language Agents: A Korean Case Study on Chemical Toxicity Information0
Enhancing Persona Consistency for LLMs' Role-Playing using Persona-Aware Contrastive Learning0
VisTai: Benchmarking Vision-Language Models for Traditional Chinese in TaiwanCode1
Contrastive Speaker-Aware Learning for Multi-party Dialogue Generation with LLMs0
SAGE: Steering and Refining Dialog Generation with State-Action AugmentationCode1
ToolDial: Multi-turn Dialogue Generation Method for Tool-Augmented Language ModelsCode1
Single- vs. Dual-Prompt Dialogue Generation with LLMs for Job Interviews in Human Resources0
Fine-Tuning Qwen 2.5 3B for Realistic Movie Dialogue Generation0
Can LLMs Simulate L2-English Dialogue? An Information-Theoretic Analysis of L1-Dependent BiasesCode0
Exploring Persona Sentiment Sensitivity in Personalized Dialogue Generation0
Show:102550
← PrevPage 1 of 25Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1LMEDRAvg F121.99Unverified
2P^2 BotAvg F119.77Unverified
3TransferTransfoAvg F119.09Unverified
4Seq2Seq + AttentionAvg F116.18Unverified
5Synthesizer (R+V)BLEU-114.7Unverified
6KV Profile MemoryAvg F111.9Unverified
#ModelMetricClaimedVerifiedStatus
1Classification-based modelSlot Accuracy0.97Unverified
2Two-in-one modelSlot Accuracy0.97Unverified
#ModelMetricClaimedVerifiedStatus
1EVAmauve0.97Unverified
2Per-BOBmauve0.95Unverified
#ModelMetricClaimedVerifiedStatus
1mm1 in 10 R@25Unverified
#ModelMetricClaimedVerifiedStatus
1∞-former (Sticky memories)F19.01Unverified
#ModelMetricClaimedVerifiedStatus
1∞-former (Sticky memories + initialized GPT-2 Small)Perplexity32.48Unverified
#ModelMetricClaimedVerifiedStatus
1SpaceFusioninterest (human)2.53Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.F14.63Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.Accuracy34.48Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.F111.43Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.Accuracy95.04Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.F13.72Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.Accuracy29.01Unverified