Response Generation

A task where an agent should play the $DE$ role and generate a text to respond to a $P$ message.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 176–200 of 914 papers

Title	Date	Tasks	Status
Enhancing LLMs' Reasoning-Intensive Multimedia Search Capabilities through Fine-Tuning and Reinforcement Learning	May 24, 2025	Multiple-choicePrompt Engineering	—Unverified
Retrieval Augmented Generation-based Large Language Models for Bridging Transportation Cybersecurity Legal Knowledge Gaps	May 23, 2025	Language ModelingLanguage Modelling	—Unverified
DecoupledESC: Enhancing Emotional Support Generation via Strategy-Response Decoupled Preference Optimization	May 22, 2025	Response Generation	—Unverified
Hallucinate at the Last in Long Response Generation: A Case Study on Long Document Summarization	May 21, 2025	Document SummarizationHallucination	—Unverified
Void in Language Models	May 20, 2025	MMLUResponse Generation	CodeCode Available
DecIF: Improving Instruction-Following through Meta-Decomposition	May 20, 2025	Instruction FollowingResponse Generation	—Unverified
Multi-Armed Bandits Meet Large Language Models	May 19, 2025	Decision MakingMulti-Armed Bandits	—Unverified
Rethinking Stateful Tool Use in Multi-Turn Dialogues: Benchmarks and Challenges	May 19, 2025	Response Generation	—Unverified
ProDS: Preference-oriented Data Selection for Instruction Tuning	May 19, 2025	Response Generation	—Unverified
Personalizing Large Language Models using Retrieval Augmented Generation and Knowledge Graph	May 15, 2025	Knowledge GraphsRAG	CodeCode Available
DIF: A Framework for Benchmarking and Verifying Implicit Bias in LLMs	May 15, 2025	BenchmarkingFairness	—Unverified
GE-Chat: A Graph Enhanced RAG Framework for Evidential Response Generation of LLMs	May 15, 2025	RAGResponse Generation	—Unverified
PIPA: A Unified Evaluation Protocol for Diagnosing Interactive Planning Agents	May 2, 2025	Instruction FollowingResponse Generation	—Unverified
Deep Learning Characterizes Depression and Suicidal Ideation from Eye Movements	Apr 29, 2025	Deep LearningResponse Generation	—Unverified
PICO: Secure Transformers via Robust Prompt Isolation and Cybersecurity Oversight	Apr 26, 2025	Mixture-of-ExpertsPICO	—Unverified
Auto-SLURP: A Benchmark Dataset for Evaluating Multi-Agent Frameworks in Smart Personal Assistant	Apr 25, 2025	Natural Language UnderstandingResponse Generation	CodeCode Available
Beyond Whole Dialogue Modeling: Contextual Disentanglement for Conversational Recommendation	Apr 24, 2025	Conversational Recommendationcounterfactual	—Unverified
LegalRAG: A Hybrid RAG System for Multilingual Legal Information Retrieval	Apr 19, 2025	Information RetrievalQuestion Answering	—Unverified
Accommodate Knowledge Conflicts in Retrieval-augmented LLMs: Towards Reliable Response Generation in the Wild	Apr 17, 2025	Decision MakingInformation Retrieval	—Unverified
The Quantum LLM: Modeling Semantic Spaces with Quantum Principles	Apr 13, 2025	Response Generationvalid	—Unverified
SafeChat: A Framework for Building Trustworthy Collaborative Assistants and a Case Study of its Usefulness	Apr 8, 2025	ChatbotExtractive Summarization	CodeCode Available
RS-RAG: Bridging Remote Sensing Imagery and Comprehensive Knowledge with a Multi-Modal Dataset and Retrieval-Augmented Generation Model	Apr 7, 2025	Image Captioningimage-classification	—Unverified
AGITB: A Signal-Level Benchmark for Evaluating Artificial General Intelligence	Apr 6, 2025	MemorizationResponse Generation	CodeCode Available
Hawkeye:Efficient Reasoning with Model Collaboration	Apr 1, 2025	Mathmodel	—Unverified
Enhancing Large Language Models (LLMs) for Telecommunications using Knowledge Graphs and Retrieval-Augmented Generation	Mar 31, 2025	Knowledge GraphsQuestion Answering	—Unverified

Show:10 25 50

← PrevPage 8 of 37Next →

All datasets SIMMC2.0 ArgSciChat MMConv

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	PaCE	BLEU	34.1	—	Unverified
2	BART-large	BLEU	33.1	—	Unverified
3	BART-base	BLEU	29.4	—	Unverified
4	MTN	BLEU	21.7	—	Unverified
5	GPT-2	BLEU	19.2	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	LED(Q,F)	Message-F1	19.54	—	Unverified
2	LED(Q,P,H)	Message-F1	16.14	—	Unverified
3	LED(Q,P)	Message-F1	14.25	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	PaCE	BLEU	22	—	Unverified
2	SimpleTOD	BLEU	20.3	—	Unverified