SOTAVerified

Response Generation

A task where an agent should play the $DE$ role and generate a text to respond to a $P$ message.

Papers

Showing 2650 of 914 papers

TitleStatusHype
On Evaluating Adversarial Robustness of Large Vision-Language ModelsCode2
Towards a Unified Multi-Dimensional Evaluator for Text GenerationCode2
DIALOGPT : Large-Scale Generative Pre-training for Conversational Response GenerationCode2
MASS: Masked Sequence to Sequence Pre-training for Language GenerationCode2
Teaching Large Language Models to Maintain Contextual Faithfulness via Synthetic Tasks and Reinforcement LearningCode1
Deliberation on Priors: Trustworthy Reasoning of Large Language Models on Knowledge GraphsCode1
Neuro-Symbolic Query CompilerCode1
Antidote: A Unified Framework for Mitigating LVLM Hallucinations in Counterfactual Presupposition and Object PerceptionCode1
MSCRS: Multi-modal Semantic Graph Prompt Learning Framework for Conversational Recommender SystemsCode1
LexRAG: Benchmarking Retrieval-Augmented Generation in Multi-Turn Legal Consultation ConversationCode1
Self-Correcting Decoding with Generative Feedback for Mitigating Hallucinations in Large Vision-Language ModelsCode1
Emotional RAG: Enhancing Role-Playing Agents through Emotional RetrievalCode1
Developing Retrieval Augmented Generation (RAG) based LLM Systems from PDFs: An Experience ReportCode1
Towards Empathetic Conversational Recommender SystemsCode1
BI-MDRG: Bridging Image History in Multimodal Dialogue Response GenerationCode1
Empathy Level Alignment via Reinforcement Learning for Empathetic Response GenerationCode1
Towards Aligning Language Models with Textual FeedbackCode1
MedVH: Towards Systematic Evaluation of Hallucination for Large Vision Language Models in the Medical ContextCode1
ESCoT: Towards Interpretable Emotional Support Dialogue SystemsCode1
Visual Description Grounding Reduces Hallucinations and Boosts Reasoning in LVLMsCode1
The 2nd FutureDial Challenge: Dialog Systems with Retrieval Augmented Generation (FutureDial-RAG)Code1
Doing Personal LAPS: LLM-Augmented Dialogue Construction for Personalized Multi-Session Conversational SearchCode1
ERAGent: Enhancing Retrieval-Augmented Language Models with Improved Accuracy, Efficiency, and PersonalizationCode1
Efficient Data Generation for Source-grounded Information-seeking Dialogs: A Use Case for Meeting TranscriptsCode1
Aligning LLM Agents by Learning Latent Preference from User EditsCode1
Show:102550
← PrevPage 2 of 37Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PaCEBLEU34.1Unverified
2BART-largeBLEU33.1Unverified
3BART-baseBLEU29.4Unverified
4MTNBLEU21.7Unverified
5GPT-2BLEU19.2Unverified
#ModelMetricClaimedVerifiedStatus
1LED(Q,F)Message-F119.54Unverified
2LED(Q,P,H)Message-F116.14Unverified
3LED(Q,P)Message-F114.25Unverified
#ModelMetricClaimedVerifiedStatus
1PaCEBLEU22Unverified
2SimpleTODBLEU20.3Unverified