SOTAVerified

Dialogue Generation

Dialogue generation is the task of "understanding" natural language inputs - within natural language processing in order to produce output. The systems are usually intended for conversing with humans, for instance back and forth dialogue with a conversation agent like a chatbot. Some example benchmarks for this task (see others such as Natural Language Understanding) include FusedChat and Ubuntu DIalogue Corpus (UDC). Models can be evaluated via metrics such as BLEU, ROUGE, and METEOR albeit with challenges in terms of weak correlation with human judgement, that may be addressed by new ones like UnSupervised and Reference-free (USR) and Metric for automatic Unreferenced dialog evaluation (MaUde).

Papers

Showing 126150 of 606 papers

TitleStatusHype
PRODIGy: a PROfile-based DIalogue Generation datasetCode1
Blending Reward Functions via Few Expert Demonstrations for Faithful and Accurate Knowledge-Grounded Dialogue Generation0
Sequence-Level Certainty Reduces Hallucination In Knowledge-Grounded Dialogue Generation0
FactSpotter: Evaluating the Factual Faithfulness of Graph-to-Text GenerationCode0
NoteChat: A Dataset of Synthetic Doctor-Patient Conversations Conditioned on Clinical NotesCode1
Mind the Gap Between Conversations for Improved Long-Term Dialogue GenerationCode0
Fidelity-Enriched Contrastive Search: Reconciling the Faithfulness-Diversity Trade-Off in Text GenerationCode0
MIRACLE: Towards Personalized Dialogue Generation with Latent-Space Multiple Personal Attribute ControlCode1
PromptCBLUE: A Chinese Prompt Tuning Benchmark for the Medical DomainCode2
Conversation Chronicles: Towards Diverse Temporal and Relational Dynamics in Multi-Session Conversations0
Multi-level Adaptive Contrastive Learning for Knowledge Internalization in Dialogue Generation0
Hexa: Self-Improving for Knowledge-Grounded Dialogue System0
We are what we repeatedly do: Inducing and deploying habitual schemas in persona-based responsesCode0
Towards human-like spoken dialogue generation between AI agents from written dialogue0
MSG-BART: Multi-granularity Scene Graph-Enhanced Encoder-Decoder Language Model for Video-grounded Dialogue Generation0
Learning to Diversify Neural Text Generation via Degenerative Model0
Improving Medical Dialogue Generation with Abstract Meaning RepresentationsCode0
Enhancing Large Language Model Induced Task-Oriented Dialogue Systems Through Look-Forward Motivated Goals0
Unleashing Potential of Evidence in Knowledge-Intensive Dialogue Generation0
Hi Model, generating 'nice' instead of 'good' is not as bad as generating 'rice'! Towards Context and Semantic Infused Dialogue Generation Loss Function and Evaluation MetricCode0
Promoting Open-domain Dialogue Generation through Learning Pattern Information between Contexts and ResponsesCode0
Bilevel Scheduled Sampling for Dialogue Generation0
RLAIF vs. RLHF: Scaling Reinforcement Learning from Human Feedback with AI Feedback0
TARJAMAT: Evaluation of Bard and ChatGPT on Machine Translation of Ten Arabic Varieties0
Dataflow Dialogue Generation0
Show:102550
← PrevPage 6 of 25Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1LMEDRAvg F121.99Unverified
2P^2 BotAvg F119.77Unverified
3TransferTransfoAvg F119.09Unverified
4Seq2Seq + AttentionAvg F116.18Unverified
5Synthesizer (R+V)BLEU-114.7Unverified
6KV Profile MemoryAvg F111.9Unverified
#ModelMetricClaimedVerifiedStatus
1Classification-based modelSlot Accuracy0.97Unverified
2Two-in-one modelSlot Accuracy0.97Unverified
#ModelMetricClaimedVerifiedStatus
1EVAmauve0.97Unverified
2Per-BOBmauve0.95Unverified
#ModelMetricClaimedVerifiedStatus
1mm1 in 10 R@25Unverified
#ModelMetricClaimedVerifiedStatus
1∞-former (Sticky memories)F19.01Unverified
#ModelMetricClaimedVerifiedStatus
1∞-former (Sticky memories + initialized GPT-2 Small)Perplexity32.48Unverified
#ModelMetricClaimedVerifiedStatus
1SpaceFusioninterest (human)2.53Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.F14.63Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.Accuracy34.48Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.F111.43Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.Accuracy95.04Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.F13.72Unverified
#ModelMetricClaimedVerifiedStatus
1MrRNN Act.-Ent.Accuracy29.01Unverified