Response Generation

A task where an agent should play the $DE$ role and generate a text to respond to a $P$ message.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1–50 of 914 papers

Title	Date	Tasks	Status	Hype
Disambiguation-Centric Finetuning Makes Enterprise Tool-Calling LLMs More Realistic and Less Risky	Jul 4, 2025	Response Generation	—Unverified	0
Knowledge Augmented Finetuning Matters in both RAG and Agent Based Dialog Systems	Jun 28, 2025	RAGResponse Generation	—Unverified	0
SAFEx: Analyzing Vulnerabilities of MoE-Based LLMs via Stable Safety-critical Expert Identification	Jun 20, 2025	Mixture-of-ExpertsResponse Generation	—Unverified	0
From What to Respond to When to Respond: Timely Response Generation for Open-domain Dialogue Agents	Jun 17, 2025	Language ModelingLanguage Modelling	CodeCode Available	0
Improving Factuality for Dialogue Response Generation via Graph-Based Knowledge Augmentation	Jun 14, 2025	Response Generation	—Unverified	0
CIIR@LiveRAG 2025: Optimizing Multi-Agent Retrieval Augmented Generation through Self-Training	Jun 12, 2025	RAGResponse Generation	CodeCode Available	0
AMIA: Automatic Masking and Joint Intention Analysis Makes LVLMs Robust Jailbreak Defenders	May 30, 2025	Response Generation	—Unverified	0
OmniResponse: Online Multimodal Conversational Response Generation in Dyadic Interactions	May 27, 2025	Audio-Visual SynchronizationConversational Response Generation	—Unverified	0
Enhancing LLMs' Reasoning-Intensive Multimedia Search Capabilities through Fine-Tuning and Reinforcement Learning	May 24, 2025	Multiple-choicePrompt Engineering	—Unverified	0
Retrieval Augmented Generation-based Large Language Models for Bridging Transportation Cybersecurity Legal Knowledge Gaps	May 23, 2025	Language ModelingLanguage Modelling	—Unverified	0
Teaching Large Language Models to Maintain Contextual Faithfulness via Synthetic Tasks and Reinforcement Learning	May 22, 2025	FormQuestion Answering	CodeCode Available	1
DecoupledESC: Enhancing Emotional Support Generation via Strategy-Response Decoupled Preference Optimization	May 22, 2025	Response Generation	—Unverified	0
Hallucinate at the Last in Long Response Generation: A Case Study on Long Document Summarization	May 21, 2025	Document SummarizationHallucination	—Unverified	0
Deliberation on Priors: Trustworthy Reasoning of Large Language Models on Knowledge Graphs	May 21, 2025	Knowledge DistillationKnowledge Graphs	CodeCode Available	1
DecIF: Improving Instruction-Following through Meta-Decomposition	May 20, 2025	Instruction FollowingResponse Generation	—Unverified	0
Void in Language Models	May 20, 2025	MMLUResponse Generation	CodeCode Available	0
Rethinking Stateful Tool Use in Multi-Turn Dialogues: Benchmarks and Challenges	May 19, 2025	Response Generation	—Unverified	0
ProDS: Preference-oriented Data Selection for Instruction Tuning	May 19, 2025	Response Generation	—Unverified	0
Multi-Armed Bandits Meet Large Language Models	May 19, 2025	Decision MakingMulti-Armed Bandits	—Unverified	0
Neuro-Symbolic Query Compiler	May 17, 2025	RAGResponse Generation	CodeCode Available	1
DIF: A Framework for Benchmarking and Verifying Implicit Bias in LLMs	May 15, 2025	BenchmarkingFairness	—Unverified	0
GE-Chat: A Graph Enhanced RAG Framework for Evidential Response Generation of LLMs	May 15, 2025	RAGResponse Generation	—Unverified	0
Personalizing Large Language Models using Retrieval Augmented Generation and Knowledge Graph	May 15, 2025	Knowledge GraphsRAG	CodeCode Available	0
PIPA: A Unified Evaluation Protocol for Diagnosing Interactive Planning Agents	May 2, 2025	Instruction FollowingResponse Generation	—Unverified	0
Antidote: A Unified Framework for Mitigating LVLM Hallucinations in Counterfactual Presupposition and Object Perception	Apr 29, 2025	counterfactualHallucination	CodeCode Available	1
Deep Learning Characterizes Depression and Suicidal Ideation from Eye Movements	Apr 29, 2025	Deep LearningResponse Generation	—Unverified	0
PICO: Secure Transformers via Robust Prompt Isolation and Cybersecurity Oversight	Apr 26, 2025	Mixture-of-ExpertsPICO	—Unverified	0
Auto-SLURP: A Benchmark Dataset for Evaluating Multi-Agent Frameworks in Smart Personal Assistant	Apr 25, 2025	Natural Language UnderstandingResponse Generation	CodeCode Available	0
Beyond Whole Dialogue Modeling: Contextual Disentanglement for Conversational Recommendation	Apr 24, 2025	Conversational Recommendationcounterfactual	—Unverified	0
LegalRAG: A Hybrid RAG System for Multilingual Legal Information Retrieval	Apr 19, 2025	Information RetrievalQuestion Answering	—Unverified	0
Accommodate Knowledge Conflicts in Retrieval-augmented LLMs: Towards Reliable Response Generation in the Wild	Apr 17, 2025	Decision MakingInformation Retrieval	—Unverified	0
MSCRS: Multi-modal Semantic Graph Prompt Learning Framework for Conversational Recommender Systems	Apr 15, 2025	Prompt LearningRecommendation Systems	CodeCode Available	1
The Quantum LLM: Modeling Semantic Spaces with Quantum Principles	Apr 13, 2025	Response Generationvalid	—Unverified	0
SafeChat: A Framework for Building Trustworthy Collaborative Assistants and a Case Study of its Usefulness	Apr 8, 2025	ChatbotExtractive Summarization	CodeCode Available	0
RS-RAG: Bridging Remote Sensing Imagery and Comprehensive Knowledge with a Multi-Modal Dataset and Retrieval-Augmented Generation Model	Apr 7, 2025	Image Captioningimage-classification	—Unverified	0
AGITB: A Signal-Level Benchmark for Evaluating Artificial General Intelligence	Apr 6, 2025	MemorizationResponse Generation	CodeCode Available	0
Hawkeye:Efficient Reasoning with Model Collaboration	Apr 1, 2025	Mathmodel	—Unverified	0
Enhancing Large Language Models (LLMs) for Telecommunications using Knowledge Graphs and Retrieval-Augmented Generation	Mar 31, 2025	Knowledge GraphsQuestion Answering	—Unverified	0
When LLM Therapists Become Salespeople: Evaluating Large Language Models for Ethical Motivational Interviewing	Mar 30, 2025	EthicsResponse Generation	—Unverified	0
Firm or Fickle? Evaluating Large Language Models Consistency in Sequential Interactions	Mar 28, 2025	Response Generation	—Unverified	0
Clean & Clear: Feasibility of Safe LLM Clinical Guidance	Mar 26, 2025	ChatbotDiagnostic	—Unverified	0
DEMENTIA-PLAN: An Agent-Based Framework for Multi-Knowledge Graph Retrieval-Augmented Generation in Dementia Care	Mar 26, 2025	Knowledge GraphsResponse Generation	—Unverified	0
CoMAC: Conversational Agent for Multi-Source Auxiliary Context with Sparse and Symmetric Latent Interactions	Mar 25, 2025	Response Generationtext similarity	CodeCode Available	0
Mitigating Reward Over-Optimization in RLHF via Behavior-Supported Regularization	Mar 23, 2025	Reinforcement Learning (RL)Response Generation	—Unverified	0
GINGER: Grounded Information Nugget-Based Generation of Responses	Mar 23, 2025	RAGResponse Generation	CodeCode Available	0
Conversational User-AI Intervention: A Study on Prompt Rewriting for Improved LLM Response Generation	Mar 21, 2025	ChatbotResponse Generation	—Unverified	0
Relevance Isn't All You Need: Scaling RAG Systems With Inference-Time Compute Via Multi-Criteria Reranking	Mar 14, 2025	AllLarge Language Model	CodeCode Available	14
FG-RAG: Enhancing Query-Focused Summarization with Context-Aware Fine-Grained Graph RAG	Mar 13, 2025	DiversityQuery-focused Summarization	CodeCode Available	0
Intent-Aware Self-Correction for Mitigating Social Biases in Large Language Models	Mar 8, 2025	Response Generation	—Unverified	0
Attentive Reasoning Queries: A Systematic Method for Optimizing Instruction-Following in Large Language Models	Mar 5, 2025	HallucinationInstruction Following	CodeCode Available	11

Show:10 25 50

← PrevPage 1 of 19Next →

All datasets SIMMC2.0 ArgSciChat MMConv

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	PaCE	BLEU	34.1	—	Unverified
2	BART-large	BLEU	33.1	—	Unverified
3	BART-base	BLEU	29.4	—	Unverified
4	MTN	BLEU	21.7	—	Unverified
5	GPT-2	BLEU	19.2	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LED(Q,F)	Message-F1	19.54	—	Unverified
2	LED(Q,P,H)	Message-F1	16.14	—	Unverified
3	LED(Q,P)	Message-F1	14.25	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	PaCE	BLEU	22	—	Unverified
2	SimpleTOD	BLEU	20.3	—	Unverified