SOTAVerified

Response Generation

A task where an agent should play the $DE$ role and generate a text to respond to a $P$ message.

Papers

Showing 125 of 914 papers

TitleStatusHype
Relevance Isn't All You Need: Scaling RAG Systems With Inference-Time Compute Via Multi-Criteria RerankingCode13
Attentive Reasoning Queries: A Systematic Method for Optimizing Instruction-Following in Large Language ModelsCode11
Self-RAG: Learning to Retrieve, Generate, and Critique through Self-ReflectionCode4
Guiding Instruction-based Image Editing via Multimodal Large Language ModelsCode4
Medical Graph RAG: Towards Safe Medical Large Language Model via Graph Retrieval-Augmented GenerationCode4
Knowledge-tuning Large Language Models with Structured Medical Knowledge Bases for Reliable Response Generation in ChineseCode4
Self-Refine: Iterative Refinement with Self-FeedbackCode3
NLG Evaluation Metrics Beyond Correlation Analysis: An Empirical Metric Preference ChecklistCode3
From Matching to Generation: A Survey on Generative Information RetrievalCode3
Tool Learning with Large Language Models: A SurveyCode3
On Evaluating Adversarial Robustness of Large Vision-Language ModelsCode2
Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local AttentionCode2
MASS: Masked Sequence to Sequence Pre-training for Language GenerationCode2
Fact Finder -- Enhancing Domain Expertise of Large Language Models by Incorporating Knowledge GraphsCode2
ISR-DPO: Aligning Large Multimodal Models for Videos by Iterative Self-Retrospective DPOCode2
Enhancing Multi-Step Reasoning Abilities of Language Models through Direct Q-Function OptimizationCode2
DIALOGPT : Large-Scale Generative Pre-training for Conversational Response GenerationCode2
CORAL: Benchmarking Multi-turn Conversational Retrieval-Augmentation GenerationCode2
AGLA: Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local AttentionCode2
CREMA: Generalizable and Efficient Video-Language Reasoning via Multimodal Modular FusionCode2
Agent4Edu: Generating Learner Response Data by Generative Agents for Intelligent Education SystemsCode2
Hello Again! LLM-powered Personalized Agent for Long-term DialogueCode2
Compressing Context to Enhance Inference Efficiency of Large Language ModelsCode2
CtrlA: Adaptive Retrieval-Augmented Generation via Inherent ControlCode2
Large Language Models as Zero-shot Dialogue State Tracker through Function CallingCode2
Show:102550
← PrevPage 1 of 37Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PaCEBLEU34.1Unverified
2BART-largeBLEU33.1Unverified
3BART-baseBLEU29.4Unverified
4MTNBLEU21.7Unverified
5GPT-2BLEU19.2Unverified
#ModelMetricClaimedVerifiedStatus
1LED(Q,F)Message-F119.54Unverified
2LED(Q,P,H)Message-F116.14Unverified
3LED(Q,P)Message-F114.25Unverified
#ModelMetricClaimedVerifiedStatus
1PaCEBLEU22Unverified
2SimpleTODBLEU20.3Unverified