SOTAVerified

Response Generation

A task where an agent should play the $DE$ role and generate a text to respond to a $P$ message.

Papers

Showing 201225 of 914 papers

TitleStatusHype
When LLM Therapists Become Salespeople: Evaluating Large Language Models for Ethical Motivational Interviewing0
Firm or Fickle? Evaluating Large Language Models Consistency in Sequential Interactions0
DEMENTIA-PLAN: An Agent-Based Framework for Multi-Knowledge Graph Retrieval-Augmented Generation in Dementia Care0
Clean & Clear: Feasibility of Safe LLM Clinical Guidance0
CoMAC: Conversational Agent for Multi-Source Auxiliary Context with Sparse and Symmetric Latent InteractionsCode0
Mitigating Reward Over-Optimization in RLHF via Behavior-Supported Regularization0
GINGER: Grounded Information Nugget-Based Generation of ResponsesCode0
Conversational User-AI Intervention: A Study on Prompt Rewriting for Improved LLM Response Generation0
FG-RAG: Enhancing Query-Focused Summarization with Context-Aware Fine-Grained Graph RAGCode0
Intent-Aware Self-Correction for Mitigating Social Biases in Large Language Models0
LION-FS: Fast & Slow Video-Language Thinker as Online Video Assistant0
LINGOLY-TOO: Disentangling Memorisation from Reasoning with Linguistic Templatisation and Orthographic Obfuscation0
LLM-Safety Evaluations Lack Robustness0
The RAG Paradox: A Black-Box Attack Exploiting Unintentional Vulnerabilities in Retrieval-Augmented Generation Systems0
ProAI: Proactive Multi-Agent Conversational AI with Structured Knowledge Base for Psychiatric Diagnosis0
SS-MPC: A Sequence-Structured Multi-Party Conversation System0
AAD-LLM: Neural Attention-Driven Auditory Scene Understanding0
Cross-Format Retrieval-Augmented Generation in XR with LLMs for Context-Aware Maintenance Assistance0
PSCon: Product Search Through ConversationsCode0
LLM-Enhanced Dialogue Management for Full-Duplex Spoken Dialogue Systems0
On-Device LLMs for Home Assistant: Dual Role in Intent Detection and Response Generation0
Can LLMs Simulate Social Media Engagement? A Study on Action-Guided Response Generation0
Towards Fully Exploiting LLM Internal States to Enhance Knowledge Boundary Perception0
DiSCo: Device-Server Collaborative LLM-Based Text Streaming Services0
Efficient Response Generation Method Selection for Fine-Tuning Large Language Models0
Show:102550
← PrevPage 9 of 37Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PaCEBLEU34.1Unverified
2BART-largeBLEU33.1Unverified
3BART-baseBLEU29.4Unverified
4MTNBLEU21.7Unverified
5GPT-2BLEU19.2Unverified
#ModelMetricClaimedVerifiedStatus
1LED(Q,F)Message-F119.54Unverified
2LED(Q,P,H)Message-F116.14Unverified
3LED(Q,P)Message-F114.25Unverified
#ModelMetricClaimedVerifiedStatus
1PaCEBLEU22Unverified
2SimpleTODBLEU20.3Unverified