SOTAVerified

Response Generation

A task where an agent should play the $DE$ role and generate a text to respond to a $P$ message.

Papers

Showing 2130 of 914 papers

TitleStatusHype
GE-Chat: A Graph Enhanced RAG Framework for Evidential Response Generation of LLMs0
DIF: A Framework for Benchmarking and Verifying Implicit Bias in LLMs0
Personalizing Large Language Models using Retrieval Augmented Generation and Knowledge GraphCode0
PIPA: A Unified Evaluation Protocol for Diagnosing Interactive Planning Agents0
Antidote: A Unified Framework for Mitigating LVLM Hallucinations in Counterfactual Presupposition and Object PerceptionCode1
Deep Learning Characterizes Depression and Suicidal Ideation from Eye Movements0
PICO: Secure Transformers via Robust Prompt Isolation and Cybersecurity Oversight0
Auto-SLURP: A Benchmark Dataset for Evaluating Multi-Agent Frameworks in Smart Personal AssistantCode0
Beyond Whole Dialogue Modeling: Contextual Disentanglement for Conversational Recommendation0
LegalRAG: A Hybrid RAG System for Multilingual Legal Information Retrieval0
Show:102550
← PrevPage 3 of 92Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PaCEBLEU34.1Unverified
2BART-largeBLEU33.1Unverified
3BART-baseBLEU29.4Unverified
4MTNBLEU21.7Unverified
5GPT-2BLEU19.2Unverified
#ModelMetricClaimedVerifiedStatus
1LED(Q,F)Message-F119.54Unverified
2LED(Q,P,H)Message-F116.14Unverified
3LED(Q,P)Message-F114.25Unverified
#ModelMetricClaimedVerifiedStatus
1PaCEBLEU22Unverified
2SimpleTODBLEU20.3Unverified