SOTAVerified

Response Generation

A task where an agent should play the $DE$ role and generate a text to respond to a $P$ message.

Papers

Showing 426450 of 914 papers

TitleStatusHype
Evaluating Large Language Models for Document-grounded Response Generation in Information-Seeking Dialogues0
PICK: Polished & Informed Candidate Scoring for Knowledge-Grounded Dialogue SystemsCode0
SYNDICOM: Improving Conversational Commonsense with Error-Injection and Natural Language Feedback0
RADE: Reference-Assisted Dialogue Evaluation for Open-Domain Dialogue0
Tree of Uncertain Thoughts Reasoning for Large Language Models0
Hi Model, generating 'nice' instead of 'good' is not as bad as generating 'rice'! Towards Context and Semantic Infused Dialogue Generation Loss Function and Evaluation MetricCode0
Towards Reliable and Fluent Large Language Models: Incorporating Feedback Learning Loops in QA Systems0
Promoting Open-domain Dialogue Generation through Learning Pattern Information between Contexts and ResponsesCode0
Towards Filling the Gap in Conversational Search: From Passage Retrieval to Conversational Response GenerationCode0
NewsDialogues: Towards Proactive News Grounded ConversationCode0
A Large Language Model Enhanced Conversational Recommender System0
FLIRT: Feedback Loop In-context Red Teaming0
System-Initiated Transitions from Chit-Chat to Task-Oriented Dialogues with Transition Info Extractor and Transition Sentence Generator0
Leveraging Few-Shot Data Augmentation and Waterfall Prompting for Response Generation0
DiactTOD: Learning Generalizable Latent Dialogue Acts for Controllable Task-Oriented Dialogue Systems0
ZRIGF: An Innovative Multimodal Framework for Zero-Resource Image-Grounded Dialogue GenerationCode0
Roll Up Your Sleeves: Working with a Collaborative and Engaging Task-Oriented Dialogue SystemCode0
Reasoning before Responding: Integrating Commonsense-based Causality Explanation for Empathetic Response Generation0
Controllable Generation of Dialogue Acts for Dialogue Systems via Few-Shot Response Generation and RankingCode0
On the Effectiveness of Offline RL for Dialogue Response GenerationCode0
LLM Comparative Assessment: Zero-shot NLG Evaluation through Pairwise Comparisons using Large Language ModelsCode0
SimpleMTOD: A Simple Language Model for Multimodal Task-Oriented Dialogue with Symbolic Scene Representation0
KnowPrefix-Tuning: A Two-Stage Prefix-Tuning Framework for Knowledge-Grounded Dialogue GenerationCode0
System-Level Natural Language FeedbackCode0
ChatGPT for Suicide Risk Assessment on Social Media: Quantitative Evaluation of Model Performance, Potentials and LimitationsCode0
Show:102550
← PrevPage 18 of 37Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PaCEBLEU34.1Unverified
2BART-largeBLEU33.1Unverified
3BART-baseBLEU29.4Unverified
4MTNBLEU21.7Unverified
5GPT-2BLEU19.2Unverified
#ModelMetricClaimedVerifiedStatus
1LED(Q,F)Message-F119.54Unverified
2LED(Q,P,H)Message-F116.14Unverified
3LED(Q,P)Message-F114.25Unverified
#ModelMetricClaimedVerifiedStatus
1PaCEBLEU22Unverified
2SimpleTODBLEU20.3Unverified