Response Generation

A task where an agent should play the $DE$ role and generate a text to respond to a $P$ message.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1–25 of 914 papers

Title	Date	Tasks	Status	Hype
Disambiguation-Centric Finetuning Makes Enterprise Tool-Calling LLMs More Realistic and Less Risky	Jul 4, 2025	Response Generation	—Unverified	0
Knowledge Augmented Finetuning Matters in both RAG and Agent Based Dialog Systems	Jun 28, 2025	RAGResponse Generation	—Unverified	0
SAFEx: Analyzing Vulnerabilities of MoE-Based LLMs via Stable Safety-critical Expert Identification	Jun 20, 2025	Mixture-of-ExpertsResponse Generation	—Unverified	0
From What to Respond to When to Respond: Timely Response Generation for Open-domain Dialogue Agents	Jun 17, 2025	Language ModelingLanguage Modelling	CodeCode Available	0
Improving Factuality for Dialogue Response Generation via Graph-Based Knowledge Augmentation	Jun 14, 2025	Response Generation	—Unverified	0
CIIR@LiveRAG 2025: Optimizing Multi-Agent Retrieval Augmented Generation through Self-Training	Jun 12, 2025	RAGResponse Generation	CodeCode Available	0
AMIA: Automatic Masking and Joint Intention Analysis Makes LVLMs Robust Jailbreak Defenders	May 30, 2025	Response Generation	—Unverified	0
OmniResponse: Online Multimodal Conversational Response Generation in Dyadic Interactions	May 27, 2025	Audio-Visual SynchronizationConversational Response Generation	—Unverified	0
Enhancing LLMs' Reasoning-Intensive Multimedia Search Capabilities through Fine-Tuning and Reinforcement Learning	May 24, 2025	Multiple-choicePrompt Engineering	—Unverified	0
Retrieval Augmented Generation-based Large Language Models for Bridging Transportation Cybersecurity Legal Knowledge Gaps	May 23, 2025	Language ModelingLanguage Modelling	—Unverified	0
DecoupledESC: Enhancing Emotional Support Generation via Strategy-Response Decoupled Preference Optimization	May 22, 2025	Response Generation	—Unverified	0
Teaching Large Language Models to Maintain Contextual Faithfulness via Synthetic Tasks and Reinforcement Learning	May 22, 2025	FormQuestion Answering	CodeCode Available	1
Hallucinate at the Last in Long Response Generation: A Case Study on Long Document Summarization	May 21, 2025	Document SummarizationHallucination	—Unverified	0
Deliberation on Priors: Trustworthy Reasoning of Large Language Models on Knowledge Graphs	May 21, 2025	Knowledge DistillationKnowledge Graphs	CodeCode Available	1
DecIF: Improving Instruction-Following through Meta-Decomposition	May 20, 2025	Instruction FollowingResponse Generation	—Unverified	0
Void in Language Models	May 20, 2025	MMLUResponse Generation	CodeCode Available	0
Rethinking Stateful Tool Use in Multi-Turn Dialogues: Benchmarks and Challenges	May 19, 2025	Response Generation	—Unverified	0
ProDS: Preference-oriented Data Selection for Instruction Tuning	May 19, 2025	Response Generation	—Unverified	0
Multi-Armed Bandits Meet Large Language Models	May 19, 2025	Decision MakingMulti-Armed Bandits	—Unverified	0
Neuro-Symbolic Query Compiler	May 17, 2025	RAGResponse Generation	CodeCode Available	1
GE-Chat: A Graph Enhanced RAG Framework for Evidential Response Generation of LLMs	May 15, 2025	RAGResponse Generation	—Unverified	0
Personalizing Large Language Models using Retrieval Augmented Generation and Knowledge Graph	May 15, 2025	Knowledge GraphsRAG	CodeCode Available	0
DIF: A Framework for Benchmarking and Verifying Implicit Bias in LLMs	May 15, 2025	BenchmarkingFairness	—Unverified	0
PIPA: A Unified Evaluation Protocol for Diagnosing Interactive Planning Agents	May 2, 2025	Instruction FollowingResponse Generation	—Unverified	0
Antidote: A Unified Framework for Mitigating LVLM Hallucinations in Counterfactual Presupposition and Object Perception	Apr 29, 2025	counterfactualHallucination	CodeCode Available	1

Show:10 25 50

← PrevPage 1 of 37Next →

All datasets SIMMC2.0 ArgSciChat MMConv

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	PaCE	BLEU	34.1	—	Unverified
2	BART-large	BLEU	33.1	—	Unverified
3	BART-base	BLEU	29.4	—	Unverified
4	MTN	BLEU	21.7	—	Unverified
5	GPT-2	BLEU	19.2	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LED(Q,F)	Message-F1	19.54	—	Unverified
2	LED(Q,P,H)	Message-F1	16.14	—	Unverified
3	LED(Q,P)	Message-F1	14.25	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	PaCE	BLEU	22	—	Unverified
2	SimpleTOD	BLEU	20.3	—	Unverified