SOTAVerified

Response Generation

A task where an agent should play the $DE$ role and generate a text to respond to a $P$ message.

Papers

Showing 301350 of 914 papers

TitleStatusHype
The Fellowship of the LLMs: Multi-Agent Workflows for Synthetic Preference Optimization Dataset GenerationCode0
Decoding Biases: Automated Methods and LLM Judges for Gender Bias Detection in Language Models0
StyEmp: Stylizing Empathetic Response Generation via Multi-Grained Prefix Encoder and Personality Reinforcement0
Recent Advances in Multi-Choice Machine Reading Comprehension: A Survey on Methods and Datasets0
FANNO: Augmenting High-Quality Instruction Data with Open-Sourced LLMs Only0
From Feature Importance to Natural Language Explanations Using LLMs with RAGCode0
Improving Retrieval Augmented Language Model with Self-Reasoning0
APTNESS: Incorporating Appraisal Theory and Emotion Support Strategies for Empathetic Response GenerationCode0
Stepwise Verification and Remediation of Student Reasoning Errors with Large Language Model TutorsCode0
Attribute or Abstain: Large Language Models as Long Document AssistantsCode0
Efficient and Accurate Memorable Conversation Model using DPO based on sLLM0
VDMA: Video Question Answering with Dynamically Generated Multi-Agents0
MentalAgora: A Gateway to Advanced Personalized Care in Mental Health through Multi-Agent Debating and Attribute Control0
LLM Internal States Reveal Hallucination Risk Faced With a QueryCode0
MORPHEUS: Modeling Role from Personalized Dialogue History by Exploring and Utilizing Latent Space0
Grounded and Transparent Response Generation for Conversational Information-Seeking Systems0
EmPO: Emotion Grounding for Empathetic Response Generation through Preference OptimizationCode0
Granite-Function Calling Model: Introducing Function Calling Abilities via Multi-task Learning of Granular Tasks0
Towards Comprehensive Preference Data Collection for Reward Modeling0
Context-augmented Retrieval: A Novel Framework for Fast Information Retrieval based Response Generation using Large Language Model0
Rewarding What Matters: Step-by-Step Reinforcement Learning for Task-Oriented Dialogue0
On-Policy Fine-grained Knowledge Feedback for Hallucination MitigationCode0
PSLM: Parallel Generation of Text and Speech with LLMs for Low-Latency Spoken Dialogue SystemsCode0
Towards Lifelong Dialogue Agents via Timeline-based Memory Management0
We Care: Multimodal Depression Detection and Knowledge Infused Mental Health Therapeutic Response Generation0
Detecting Response Generation Not Requiring Factual Judgment0
Leveraging Explicit Reasoning for Inference Integration in Commonsense-Augmented Dialogue ModelsCode0
Should We Fine-Tune or RAG? Evaluating Different Techniques to Adapt LLMs for DialogueCode0
Position Debiasing Fine-Tuning for Causal Perception in Long-Term Dialogue0
CODE: Contrasting Self-generated Description to Combat Hallucination in Large Multi-modal Models0
SLM as Guardian: Pioneering AI Safety with Small Language Models0
Benchmarks Underestimate the Readiness of Multi-lingual Dialogue Agents0
Unifying Demonstration Selection and Compression for In-Context Learning0
Leveraging Logical Rules in Knowledge Editing: A Cherry on the Top0
Many Hands Make Light Work: Task-Oriented Dialogue System with Module-Based Mixture-of-Experts0
DuetSim: Building User Simulator with Dual Large Language Models for Task-Oriented DialoguesCode0
Enhancing Knowledge Retrieval with Topic Modeling for Knowledge-Grounded DialogueCode0
Self-Improving Customer Review Response Generation Based on LLMs0
R4: Reinforced Retriever-Reorder-Responder for Retrieval-Augmented Large Language Models0
SUKHSANDESH: An Avatar Therapeutic Question Answering Platform for Sexual Education in Rural India0
Creative Beam Search: LLM-as-a-Judge For Improving Response Generation0
Evaluating the Application of ChatGPT in Outpatient Triage Guidance: A Comparative Study0
Tree of Reviews: A Tree-based Dynamic Iterative Retrieval Framework for Multi-hop Question Answering0
Personalized LLM Response Generation with Parameterized Memory InjectionCode0
Token Trails: Navigating Contextual Depths in Conversational AI with ChatLLM0
Dynamic Demonstration Retrieval and Cognitive Understanding for Emotional Support ConversationCode0
RQ-RAG: Learning to Refine Queries for Retrieval Augmented GenerationCode0
Real Acoustic Fields: An Audio-Visual Room Acoustics Dataset and Benchmark0
Mix-Initiative Response Generation with Dynamic Prefix Tuning0
CTSM: Combining Trait and State Emotions for Empathetic Response ModelCode0
Show:102550
← PrevPage 7 of 19Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PaCEBLEU34.1Unverified
2BART-largeBLEU33.1Unverified
3BART-baseBLEU29.4Unverified
4MTNBLEU21.7Unverified
5GPT-2BLEU19.2Unverified
#ModelMetricClaimedVerifiedStatus
1LED(Q,F)Message-F119.54Unverified
2LED(Q,P,H)Message-F116.14Unverified
3LED(Q,P)Message-F114.25Unverified
#ModelMetricClaimedVerifiedStatus
1PaCEBLEU22Unverified
2SimpleTODBLEU20.3Unverified