SOTAVerified

Response Generation

A task where an agent should play the $DE$ role and generate a text to respond to a $P$ message.

Papers

Showing 51100 of 914 papers

TitleStatusHype
Attentive Reasoning Queries: A Systematic Method for Optimizing Instruction-Following in Large Language ModelsCode11
LINGOLY-TOO: Disentangling Memorisation from Reasoning with Linguistic Templatisation and Orthographic Obfuscation0
LLM-Safety Evaluations Lack Robustness0
LexRAG: Benchmarking Retrieval-Augmented Generation in Multi-Turn Legal Consultation ConversationCode1
ProAI: Proactive Multi-Agent Conversational AI with Structured Knowledge Base for Psychiatric Diagnosis0
The RAG Paradox: A Black-Box Attack Exploiting Unintentional Vulnerabilities in Retrieval-Augmented Generation Systems0
AAD-LLM: Neural Attention-Driven Auditory Scene Understanding0
SS-MPC: A Sequence-Structured Multi-Party Conversation System0
Cross-Format Retrieval-Augmented Generation in XR with LLMs for Context-Aware Maintenance Assistance0
LLM-Enhanced Dialogue Management for Full-Duplex Spoken Dialogue Systems0
PSCon: Product Search Through ConversationsCode0
On-Device LLMs for Home Assistant: Dual Role in Intent Detection and Response Generation0
Can LLMs Simulate Social Media Engagement? A Study on Action-Guided Response Generation0
Efficient Response Generation Method Selection for Fine-Tuning Large Language Models0
DiSCo: Device-Server Collaborative LLM-Based Text Streaming Services0
Towards Fully Exploiting LLM Internal States to Enhance Knowledge Boundary Perception0
MuDoC: An Interactive Multimodal Document-grounded Conversational AI System0
DiMA: An LLM-Powered Ride-Hailing Assistant at DiDi0
Grammar Control in Dialogue Response Generation for Language Learning ChatbotsCode0
Self-Correcting Decoding with Generative Feedback for Mitigating Hallucinations in Large Vision-Language ModelsCode1
On Memory Construction and Retrieval for Personalized Conversational Agents0
MultiQ&A: An Analysis in Measuring Robustness via Automated Crowdsourcing of Question Perturbations and Answers0
Vision-Integrated LLMs for Autonomous Driving Assistance : Human Performance Comparison and Trust Evaluation0
CAMI: A Counselor Agent Supporting Motivational Interviewing through State Inference and Topic Exploration0
A Video-grounded Dialogue Dataset and Metric for Event-driven ActivitiesCode0
Open-Source Retrieval Augmented Generation Framework for Retrieving Accurate Medication Insights from Formularies for African Healthcare Workers0
OpenCharacter: Training Customizable Role-Playing LLMs with Large-Scale Synthetic Personas0
CAPRAG: A Large Language Model Solution for Customer Service and Automatic Reporting using Vector and Graph Retrieval-Augmented Generation0
RPO: Retrieval Preference Optimization for Robust Retrieval-Augmented Generation0
EchoLM: Accelerating LLM Serving with Real-time Knowledge Distillation0
Reference-free Evaluation Metrics for Text Generation: A Survey0
Can MLLMs Generalize to Multi-Party dialog? Exploring Multilingual Response Generation in Complex Scenarios0
Advancing Multi-Party Dialogue Framework with Speaker-ware Contrastive Learning0
Agent4Edu: Generating Learner Response Data by Generative Agents for Intelligent Education SystemsCode2
BoK: Introducing Bag-of-Keywords Loss for Interpretable Dialogue Response GenerationCode0
Applying General Turn-taking Models to Conversational Human-Robot Interaction0
VideoRAG: Retrieval-Augmented Generation over Video CorpusCode2
SUGAR: Leveraging Contextual Confidence for Smarter Retrieval0
RAG-Check: Evaluating Multimodal Retrieval Augmented Generation Performance0
Reasoning-Enhanced Self-Training for Long-Form Personalized Text Generation0
Sentiment-guided Commonsense-aware Response Generation for Mental Health Counseling0
LLM-driven Multimodal and Multi-Identity Listening Head Generation0
Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local AttentionCode2
From Hallucinations to Facts: Enhancing Language Models with Curated Knowledge Graphs0
Survey on Abstractive Text Summarization: Dataset, Models, and MetricsCode0
Unified Understanding of Environment, Task, and Human for Human-Robot Interaction in Real-World EnvironmentsCode0
Personalized LLM for Generating Customized Responses to the Same Query from Different UsersCode0
Agentic AI-Driven Technical Troubleshooting for Enterprise Systems: A Novel Weighted Retrieval-Augmented Generation Paradigm0
ALMA: Alignment with Minimal Annotation0
TOOL-ED: Enhancing Empathetic Response Generation with the Tool Calling Capability of LLMCode0
Show:102550
← PrevPage 2 of 19Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PaCEBLEU34.1Unverified
2BART-largeBLEU33.1Unverified
3BART-baseBLEU29.4Unverified
4MTNBLEU21.7Unverified
5GPT-2BLEU19.2Unverified
#ModelMetricClaimedVerifiedStatus
1LED(Q,F)Message-F119.54Unverified
2LED(Q,P,H)Message-F116.14Unverified
3LED(Q,P)Message-F114.25Unverified
#ModelMetricClaimedVerifiedStatus
1PaCEBLEU22Unverified
2SimpleTODBLEU20.3Unverified