SOTAVerified

Response Generation

A task where an agent should play the $DE$ role and generate a text to respond to a $P$ message.

Papers

Showing 101125 of 914 papers

TitleStatusHype
Socio-Emotional Response Generation: A Human Evaluation Protocol for LLM-Based Conversational Systems0
Strategic Prompting for Conversational Tasks: A Comparative Analysis of Large Language Models Across Diverse Conversational Tasks0
Chain of Attack: On the Robustness of Vision-Language Models Against Transfer-Based Adversarial Attacks0
IRLab@iKAT24: Learned Sparse Retrieval with Multi-aspect LLM Query Generation for Conversational Search0
Safe + Safe = Unsafe? Exploring How Safe Images Can Be Exploited to Jailbreak Large Vision-Language ModelsCode0
Unstructured Text Enhanced Open-domain Dialogue System: A Systematic Survey0
HierTOD: A Task-Oriented Dialogue System Driven by Hierarchical Goals0
LLM-Ref: Enhancing Reference Handling in Technical Writing with Large Language Models0
CORAL: Benchmarking Multi-turn Conversational Retrieval-Augmentation GenerationCode2
Dynamic Strategy Planning for Efficient Question Answering with Large Language Models0
Emotional RAG: Enhancing Role-Playing Agents through Emotional RetrievalCode1
Multi-aspect Depression Severity Assessment via Inductive Dialogue System0
Can Users Detect Biases or Factual Errors in Generated Responses in Conversational Information-Seeking?Code0
A Stack-Propagation Framework for Low-Resource Personalized Dialogue Generation0
LLM-Aided Efficient Hardware Design Automation0
Bridging Search and Recommendation in Generative Retrieval: Does One Task Help the Other?0
Policy-driven Knowledge Selection and Response Generation for Document-grounded Dialogue0
A Survey of Conversational Search0
Developing Retrieval Augmented Generation (RAG) based LLM Systems from PDFs: An Experience ReportCode1
Information for Conversation Generation: Proposals Utilising Knowledge Graphs0
ELOQ: Resources for Enhancing LLM Detection of Out-of-Scope QuestionsCode0
CBT-Bench: Evaluating Large Language Models on Assisting Cognitive Behavior Therapy0
Self-adaptive Multimodal Retrieval-Augmented GenerationCode0
On the Capacity of Citation Generation by Large Language Models0
Enhancing Multi-Step Reasoning Abilities of Language Models through Direct Q-Function OptimizationCode2
Show:102550
← PrevPage 5 of 37Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PaCEBLEU34.1Unverified
2BART-largeBLEU33.1Unverified
3BART-baseBLEU29.4Unverified
4MTNBLEU21.7Unverified
5GPT-2BLEU19.2Unverified
#ModelMetricClaimedVerifiedStatus
1LED(Q,F)Message-F119.54Unverified
2LED(Q,P,H)Message-F116.14Unverified
3LED(Q,P)Message-F114.25Unverified
#ModelMetricClaimedVerifiedStatus
1PaCEBLEU22Unverified
2SimpleTODBLEU20.3Unverified