SOTAVerified

Response Generation

A task where an agent should play the $DE$ role and generate a text to respond to a $P$ message.

Papers

Showing 201250 of 914 papers

TitleStatusHype
When LLM Therapists Become Salespeople: Evaluating Large Language Models for Ethical Motivational Interviewing0
Firm or Fickle? Evaluating Large Language Models Consistency in Sequential Interactions0
DEMENTIA-PLAN: An Agent-Based Framework for Multi-Knowledge Graph Retrieval-Augmented Generation in Dementia Care0
Clean & Clear: Feasibility of Safe LLM Clinical Guidance0
CoMAC: Conversational Agent for Multi-Source Auxiliary Context with Sparse and Symmetric Latent InteractionsCode0
GINGER: Grounded Information Nugget-Based Generation of ResponsesCode0
Mitigating Reward Over-Optimization in RLHF via Behavior-Supported Regularization0
Conversational User-AI Intervention: A Study on Prompt Rewriting for Improved LLM Response Generation0
FG-RAG: Enhancing Query-Focused Summarization with Context-Aware Fine-Grained Graph RAGCode0
Intent-Aware Self-Correction for Mitigating Social Biases in Large Language Models0
LION-FS: Fast & Slow Video-Language Thinker as Online Video Assistant0
LLM-Safety Evaluations Lack Robustness0
LINGOLY-TOO: Disentangling Memorisation from Reasoning with Linguistic Templatisation and Orthographic Obfuscation0
The RAG Paradox: A Black-Box Attack Exploiting Unintentional Vulnerabilities in Retrieval-Augmented Generation Systems0
ProAI: Proactive Multi-Agent Conversational AI with Structured Knowledge Base for Psychiatric Diagnosis0
AAD-LLM: Neural Attention-Driven Auditory Scene Understanding0
SS-MPC: A Sequence-Structured Multi-Party Conversation System0
Cross-Format Retrieval-Augmented Generation in XR with LLMs for Context-Aware Maintenance Assistance0
PSCon: Product Search Through ConversationsCode0
LLM-Enhanced Dialogue Management for Full-Duplex Spoken Dialogue Systems0
On-Device LLMs for Home Assistant: Dual Role in Intent Detection and Response Generation0
Can LLMs Simulate Social Media Engagement? A Study on Action-Guided Response Generation0
DiSCo: Device-Server Collaborative LLM-Based Text Streaming Services0
Towards Fully Exploiting LLM Internal States to Enhance Knowledge Boundary Perception0
Efficient Response Generation Method Selection for Fine-Tuning Large Language Models0
MuDoC: An Interactive Multimodal Document-grounded Conversational AI System0
DiMA: An LLM-Powered Ride-Hailing Assistant at DiDi0
Grammar Control in Dialogue Response Generation for Language Learning ChatbotsCode0
On Memory Construction and Retrieval for Personalized Conversational Agents0
MultiQ&A: An Analysis in Measuring Robustness via Automated Crowdsourcing of Question Perturbations and Answers0
Vision-Integrated LLMs for Autonomous Driving Assistance : Human Performance Comparison and Trust Evaluation0
CAMI: A Counselor Agent Supporting Motivational Interviewing through State Inference and Topic Exploration0
A Video-grounded Dialogue Dataset and Metric for Event-driven ActivitiesCode0
Open-Source Retrieval Augmented Generation Framework for Retrieving Accurate Medication Insights from Formularies for African Healthcare Workers0
OpenCharacter: Training Customizable Role-Playing LLMs with Large-Scale Synthetic Personas0
RPO: Retrieval Preference Optimization for Robust Retrieval-Augmented Generation0
CAPRAG: A Large Language Model Solution for Customer Service and Automatic Reporting using Vector and Graph Retrieval-Augmented Generation0
EchoLM: Accelerating LLM Serving with Real-time Knowledge Distillation0
Reference-free Evaluation Metrics for Text Generation: A Survey0
Advancing Multi-Party Dialogue Framework with Speaker-ware Contrastive Learning0
Can MLLMs Generalize to Multi-Party dialog? Exploring Multilingual Response Generation in Complex Scenarios0
BoK: Introducing Bag-of-Keywords Loss for Interpretable Dialogue Response GenerationCode0
Applying General Turn-taking Models to Conversational Human-Robot Interaction0
SUGAR: Leveraging Contextual Confidence for Smarter Retrieval0
RAG-Check: Evaluating Multimodal Retrieval Augmented Generation Performance0
Reasoning-Enhanced Self-Training for Long-Form Personalized Text Generation0
Sentiment-guided Commonsense-aware Response Generation for Mental Health Counseling0
LLM-driven Multimodal and Multi-Identity Listening Head Generation0
From Hallucinations to Facts: Enhancing Language Models with Curated Knowledge Graphs0
Survey on Abstractive Text Summarization: Dataset, Models, and MetricsCode0
Show:102550
← PrevPage 5 of 19Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PaCEBLEU34.1Unverified
2BART-largeBLEU33.1Unverified
3BART-baseBLEU29.4Unverified
4MTNBLEU21.7Unverified
5GPT-2BLEU19.2Unverified
#ModelMetricClaimedVerifiedStatus
1LED(Q,F)Message-F119.54Unverified
2LED(Q,P,H)Message-F116.14Unverified
3LED(Q,P)Message-F114.25Unverified
#ModelMetricClaimedVerifiedStatus
1PaCEBLEU22Unverified
2SimpleTODBLEU20.3Unverified