SOTAVerified

Response Generation

A task where an agent should play the $DE$ role and generate a text to respond to a $P$ message.

Papers

Showing 151200 of 914 papers

TitleStatusHype
The Fellowship of the LLMs: Multi-Agent Workflows for Synthetic Preference Optimization Dataset GenerationCode0
BI-MDRG: Bridging Image History in Multimodal Dialogue Response GenerationCode1
Medical Graph RAG: Towards Safe Medical Large Language Model via Graph Retrieval-Augmented GenerationCode4
Decoding Biases: Automated Methods and LLM Judges for Gender Bias Detection in Language Models0
Fact Finder -- Enhancing Domain Expertise of Large Language Models by Incorporating Knowledge GraphsCode2
Empathy Level Alignment via Reinforcement Learning for Empathetic Response GenerationCode1
StyEmp: Stylizing Empathetic Response Generation via Multi-Grained Prefix Encoder and Personality Reinforcement0
Recent Advances in Multi-Choice Machine Reading Comprehension: A Survey on Methods and Datasets0
FANNO: Augmenting High-Quality Instruction Data with Open-Sourced LLMs Only0
From Feature Importance to Natural Language Explanations Using LLMs with RAGCode0
Improving Retrieval Augmented Language Model with Self-Reasoning0
Towards Aligning Language Models with Textual FeedbackCode1
APTNESS: Incorporating Appraisal Theory and Emotion Support Strategies for Empathetic Response GenerationCode0
Stepwise Verification and Remediation of Student Reasoning Errors with Large Language Model TutorsCode0
Attribute or Abstain: Large Language Models as Long Document AssistantsCode0
Efficient and Accurate Memorable Conversation Model using DPO based on sLLM0
VDMA: Video Question Answering with Dynamically Generated Multi-Agents0
MentalAgora: A Gateway to Advanced Personalized Care in Mental Health through Multi-Agent Debating and Attribute Control0
LLM Internal States Reveal Hallucination Risk Faced With a QueryCode0
MedVH: Towards Systematic Evaluation of Hallucination for Large Vision Language Models in the Medical ContextCode1
MORPHEUS: Modeling Role from Personalized Dialogue History by Exploring and Utilizing Latent Space0
Granite-Function Calling Model: Introducing Function Calling Abilities via Multi-task Learning of Granular Tasks0
EmPO: Emotion Grounding for Empathetic Response Generation through Preference OptimizationCode0
Grounded and Transparent Response Generation for Conversational Information-Seeking Systems0
Towards Comprehensive Preference Data Collection for Reward Modeling0
Context-augmented Retrieval: A Novel Framework for Fast Information Retrieval based Response Generation using Large Language Model0
Rewarding What Matters: Step-by-Step Reinforcement Learning for Task-Oriented Dialogue0
On-Policy Fine-grained Knowledge Feedback for Hallucination MitigationCode0
PSLM: Parallel Generation of Text and Speech with LLMs for Low-Latency Spoken Dialogue SystemsCode0
AGLA: Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local AttentionCode2
ISR-DPO: Aligning Large Multimodal Models for Videos by Iterative Self-Retrospective DPOCode2
Towards Lifelong Dialogue Agents via Timeline-based Memory Management0
ESCoT: Towards Interpretable Emotional Support Dialogue SystemsCode1
We Care: Multimodal Depression Detection and Knowledge Infused Mental Health Therapeutic Response Generation0
Detecting Response Generation Not Requiring Factual Judgment0
Leveraging Explicit Reasoning for Inference Integration in Commonsense-Augmented Dialogue ModelsCode0
Should We Fine-Tune or RAG? Evaluating Different Techniques to Adapt LLMs for DialogueCode0
Hello Again! LLM-powered Personalized Agent for Long-term DialogueCode2
Position Debiasing Fine-Tuning for Causal Perception in Long-Term Dialogue0
CODE: Contrasting Self-generated Description to Combat Hallucination in Large Multi-modal Models0
SLM as Guardian: Pioneering AI Safety with Small Language Models0
CtrlA: Adaptive Retrieval-Augmented Generation via Inherent ControlCode2
Benchmarks Underestimate the Readiness of Multi-lingual Dialogue Agents0
Tool Learning with Large Language Models: A SurveyCode3
Unifying Demonstration Selection and Compression for In-Context Learning0
Visual Description Grounding Reduces Hallucinations and Boosts Reasoning in LVLMsCode1
Leveraging Logical Rules in Knowledge Editing: A Cherry on the Top0
The 2nd FutureDial Challenge: Dialog Systems with Retrieval Augmented Generation (FutureDial-RAG)Code1
DuetSim: Building User Simulator with Dual Large Language Models for Task-Oriented DialoguesCode0
Many Hands Make Light Work: Task-Oriented Dialogue System with Module-Based Mixture-of-Experts0
Show:102550
← PrevPage 4 of 19Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PaCEBLEU34.1Unverified
2BART-largeBLEU33.1Unverified
3BART-baseBLEU29.4Unverified
4MTNBLEU21.7Unverified
5GPT-2BLEU19.2Unverified
#ModelMetricClaimedVerifiedStatus
1LED(Q,F)Message-F119.54Unverified
2LED(Q,P,H)Message-F116.14Unverified
3LED(Q,P)Message-F114.25Unverified
#ModelMetricClaimedVerifiedStatus
1PaCEBLEU22Unverified
2SimpleTODBLEU20.3Unverified