SOTAVerified

Response Generation

A task where an agent should play the $DE$ role and generate a text to respond to a $P$ message.

Papers

Showing 101150 of 914 papers

TitleStatusHype
Socio-Emotional Response Generation: A Human Evaluation Protocol for LLM-Based Conversational Systems0
Strategic Prompting for Conversational Tasks: A Comparative Analysis of Large Language Models Across Diverse Conversational Tasks0
Chain of Attack: On the Robustness of Vision-Language Models Against Transfer-Based Adversarial Attacks0
IRLab@iKAT24: Learned Sparse Retrieval with Multi-aspect LLM Query Generation for Conversational Search0
Safe + Safe = Unsafe? Exploring How Safe Images Can Be Exploited to Jailbreak Large Vision-Language ModelsCode0
Unstructured Text Enhanced Open-domain Dialogue System: A Systematic Survey0
HierTOD: A Task-Oriented Dialogue System Driven by Hierarchical Goals0
LLM-Ref: Enhancing Reference Handling in Technical Writing with Large Language Models0
Emotional RAG: Enhancing Role-Playing Agents through Emotional RetrievalCode1
Dynamic Strategy Planning for Efficient Question Answering with Large Language Models0
CORAL: Benchmarking Multi-turn Conversational Retrieval-Augmentation GenerationCode2
Multi-aspect Depression Severity Assessment via Inductive Dialogue System0
Can Users Detect Biases or Factual Errors in Generated Responses in Conversational Information-Seeking?Code0
A Stack-Propagation Framework for Low-Resource Personalized Dialogue Generation0
LLM-Aided Efficient Hardware Design Automation0
Bridging Search and Recommendation in Generative Retrieval: Does One Task Help the Other?0
Policy-driven Knowledge Selection and Response Generation for Document-grounded Dialogue0
A Survey of Conversational Search0
Developing Retrieval Augmented Generation (RAG) based LLM Systems from PDFs: An Experience ReportCode1
Information for Conversation Generation: Proposals Utilising Knowledge Graphs0
ELOQ: Resources for Enhancing LLM Detection of Out-of-Scope QuestionsCode0
CBT-Bench: Evaluating Large Language Models on Assisting Cognitive Behavior Therapy0
Self-adaptive Multimodal Retrieval-Augmented GenerationCode0
On the Capacity of Citation Generation by Large Language Models0
Enhancing Multi-Step Reasoning Abilities of Language Models through Direct Q-Function OptimizationCode2
GIVE: Structured Reasoning with Knowledge Graph Inspired Veracity Extrapolation0
IntrinsicVoice: Empowering LLMs with Intrinsic Real-time Voice Interaction Abilities0
Uncovering Factor Level Preferences to Improve Human-Model Alignment0
Grounding is All You Need? Dual Temporal Grounding for Video Dialog0
Listening to Patients: A Framework of Detecting and Mitigating Patient Misreport for Medical Dialogue Generation0
Toxic Subword Pruning for Dialogue Response Generation on Large Language Models0
PersoBench: Benchmarking Personalized Response Generation in Large Language ModelsCode0
Emotion-Aware Embedding Fusion in LLMs (Flan-T5, LLAMA 2, DeepSeek-R1, and ChatGPT 4) for Intelligent Response Generation0
Blind Spatial Impulse Response Generation from Separate Room- and Scene-Specific Information0
Learning When to Retrieve, What to Rewrite, and How to Respond in Conversational QA0
Effectively Enhancing Vision Language Large Models by Prompt Augmentation and Caption UtilizationCode0
Recent Advancement of Emotion Cognition in Large Language Models0
VERA: Validation and Enhancement for Retrieval Augmented systems0
Enabling Real-Time Conversations with Minimal Training Costs0
Investigating Context-Faithfulness in Large Language Models: The Roles of Memory Strength and Evidence Style0
ReflectDiffu:Reflect between Emotion-intent Contagion and Mimicry for Empathetic Response Generation via a RL-Diffusion Framework0
Bio-Eng-LMM AI Assist chatbot: A Comprehensive Tool for Research and EducationCode0
CogniDual Framework: Self-Training Large Language Models within a Dual-System Theoretical Framework for Improving Cognitive Tasks0
YA-TA: Towards Personalized Question-Answering Teaching Assistants using Instructor-Student Dual Retrieval-augmented Knowledge Fusion0
Towards Empathetic Conversational Recommender SystemsCode1
MaFeRw: Query Rewriting with Multi-Aspect Feedbacks for Retrieval-Augmented Large Language ModelsCode0
Enhancing Dialogue Generation in Werewolf Game Through Situation Analysis and Persuasion Strategies0
Preference-Guided Reflective Sampling for Aligning Language ModelsCode0
Cause-Aware Empathetic Response Generation via Chain-of-Thought Fine-Tuning0
LBC: Language-Based-Classifier for Out-Of-Variable GeneralizationCode0
Show:102550
← PrevPage 3 of 19Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PaCEBLEU34.1Unverified
2BART-largeBLEU33.1Unverified
3BART-baseBLEU29.4Unverified
4MTNBLEU21.7Unverified
5GPT-2BLEU19.2Unverified
#ModelMetricClaimedVerifiedStatus
1LED(Q,F)Message-F119.54Unverified
2LED(Q,P,H)Message-F116.14Unverified
3LED(Q,P)Message-F114.25Unverified
#ModelMetricClaimedVerifiedStatus
1PaCEBLEU22Unverified
2SimpleTODBLEU20.3Unverified