SOTAVerified

Response Generation

A task where an agent should play the $DE$ role and generate a text to respond to a $P$ message.

Papers

Showing 251275 of 914 papers

TitleStatusHype
Unified Understanding of Environment, Task, and Human for Human-Robot Interaction in Real-World EnvironmentsCode0
Agentic AI-Driven Technical Troubleshooting for Enterprise Systems: A Novel Weighted Retrieval-Augmented Generation Paradigm0
Personalized LLM for Generating Customized Responses to the Same Query from Different UsersCode0
ALMA: Alignment with Minimal Annotation0
TOOL-ED: Enhancing Empathetic Response Generation with the Tool Calling Capability of LLMCode0
Strategic Prompting for Conversational Tasks: A Comparative Analysis of Large Language Models Across Diverse Conversational Tasks0
Socio-Emotional Response Generation: A Human Evaluation Protocol for LLM-Based Conversational Systems0
Chain of Attack: On the Robustness of Vision-Language Models Against Transfer-Based Adversarial Attacks0
IRLab@iKAT24: Learned Sparse Retrieval with Multi-aspect LLM Query Generation for Conversational Search0
Safe + Safe = Unsafe? Exploring How Safe Images Can Be Exploited to Jailbreak Large Vision-Language ModelsCode0
Unstructured Text Enhanced Open-domain Dialogue System: A Systematic Survey0
HierTOD: A Task-Oriented Dialogue System Driven by Hierarchical Goals0
LLM-Ref: Enhancing Reference Handling in Technical Writing with Large Language Models0
Dynamic Strategy Planning for Efficient Question Answering with Large Language Models0
Multi-aspect Depression Severity Assessment via Inductive Dialogue System0
Can Users Detect Biases or Factual Errors in Generated Responses in Conversational Information-Seeking?Code0
A Stack-Propagation Framework for Low-Resource Personalized Dialogue Generation0
LLM-Aided Efficient Hardware Design Automation0
Bridging Search and Recommendation in Generative Retrieval: Does One Task Help the Other?0
Policy-driven Knowledge Selection and Response Generation for Document-grounded Dialogue0
Information for Conversation Generation: Proposals Utilising Knowledge Graphs0
A Survey of Conversational Search0
ELOQ: Resources for Enhancing LLM Detection of Out-of-Scope QuestionsCode0
CBT-Bench: Evaluating Large Language Models on Assisting Cognitive Behavior Therapy0
On the Capacity of Citation Generation by Large Language Models0
Show:102550
← PrevPage 11 of 37Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PaCEBLEU34.1Unverified
2BART-largeBLEU33.1Unverified
3BART-baseBLEU29.4Unverified
4MTNBLEU21.7Unverified
5GPT-2BLEU19.2Unverified
#ModelMetricClaimedVerifiedStatus
1LED(Q,F)Message-F119.54Unverified
2LED(Q,P,H)Message-F116.14Unverified
3LED(Q,P)Message-F114.25Unverified
#ModelMetricClaimedVerifiedStatus
1PaCEBLEU22Unverified
2SimpleTODBLEU20.3Unverified