SOTAVerified

Response Generation

A task where an agent should play the $DE$ role and generate a text to respond to a $P$ message.

Papers

Showing 150 of 914 papers

TitleStatusHype
Relevance Isn't All You Need: Scaling RAG Systems With Inference-Time Compute Via Multi-Criteria RerankingCode13
Attentive Reasoning Queries: A Systematic Method for Optimizing Instruction-Following in Large Language ModelsCode11
Guiding Instruction-based Image Editing via Multimodal Large Language ModelsCode4
Knowledge-tuning Large Language Models with Structured Medical Knowledge Bases for Reliable Response Generation in ChineseCode4
Medical Graph RAG: Towards Safe Medical Large Language Model via Graph Retrieval-Augmented GenerationCode4
Self-RAG: Learning to Retrieve, Generate, and Critique through Self-ReflectionCode4
From Matching to Generation: A Survey on Generative Information RetrievalCode3
Self-Refine: Iterative Refinement with Self-FeedbackCode3
NLG Evaluation Metrics Beyond Correlation Analysis: An Empirical Metric Preference ChecklistCode3
Tool Learning with Large Language Models: A SurveyCode3
ISR-DPO: Aligning Large Multimodal Models for Videos by Iterative Self-Retrospective DPOCode2
On Evaluating Adversarial Robustness of Large Vision-Language ModelsCode2
Agent4Edu: Generating Learner Response Data by Generative Agents for Intelligent Education SystemsCode2
Enhancing Multi-Step Reasoning Abilities of Language Models through Direct Q-Function OptimizationCode2
MASS: Masked Sequence to Sequence Pre-training for Language GenerationCode2
CtrlA: Adaptive Retrieval-Augmented Generation via Inherent ControlCode2
CORAL: Benchmarking Multi-turn Conversational Retrieval-Augmentation GenerationCode2
STICKERCONV: Generating Multimodal Empathetic Responses from ScratchCode2
VideoRAG: Retrieval-Augmented Generation over Video CorpusCode2
SQuARe: A Large-Scale Dataset of Sensitive Questions and Acceptable Responses Created Through Human-Machine CollaborationCode2
Towards a Unified Multi-Dimensional Evaluator for Text GenerationCode2
Fact Finder -- Enhancing Domain Expertise of Large Language Models by Incorporating Knowledge GraphsCode2
Large Language Models as Zero-shot Dialogue State Tracker through Function CallingCode2
Hello Again! LLM-powered Personalized Agent for Long-term DialogueCode2
Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local AttentionCode2
DIALOGPT : Large-Scale Generative Pre-training for Conversational Response GenerationCode2
AGLA: Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local AttentionCode2
Compressing Context to Enhance Inference Efficiency of Large Language ModelsCode2
CREMA: Generalizable and Efficient Video-Language Reasoning via Multimodal Modular FusionCode2
Developing Retrieval Augmented Generation (RAG) based LLM Systems from PDFs: An Experience ReportCode1
BoB: BERT Over BERT for Training Persona-based Dialogue Models from Limited Personalized DataCode1
DIALKI: Knowledge Identification in Conversational Systems through Dialogue-Document ContextualizationCode1
A Comprehensive Assessment of Dialog Evaluation MetricsCode1
CR-Walker: Tree-Structured Graph Reasoning and Dialog Acts for Conversational RecommendationCode1
DialogBERT: Discourse-Aware Response Generation via Learning to Recover and Rank UtterancesCode1
Beyond Domain APIs: Task-oriented Conversational Modeling with Unstructured Knowledge AccessCode1
Beyond Domain APIs: Task-oriented Conversational Modeling with Unstructured Knowledge Access Track in DSTC9Code1
Cue-word Driven Neural Response Generation with a Shrinking VocabularyCode1
Automating App Review Response GenerationCode1
BI-MDRG: Bridging Image History in Multimodal Dialogue Response GenerationCode1
Bringing the State-of-the-Art to Customers: A Neural Agent Assistant Framework for Customer Service SupportCode1
Deliberation on Priors: Trustworthy Reasoning of Large Language Models on Knowledge GraphsCode1
DialoGPT: Large-Scale Generative Pre-training for Conversational Response GenerationCode1
A Model of Cross-Lingual Knowledge-Grounded Response Generation for Open-Domain Dialogue SystemsCode1
Contrast and Generation Make BART a Good Dialogue Emotion RecognizerCode1
Controlling Dialogue Generation with Semantic ExemplarsCode1
Compress to Impress: Unleashing the Potential of Compressive Memory in Real-World Long-Term ConversationsCode1
CONFLARE: CONFormal LArge language model REtrievalCode1
Affective Decoding for Empathetic Response GenerationCode1
Conversations with Search Engines: SERP-based Conversational Response GenerationCode1
Show:102550
← PrevPage 1 of 19Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PaCEBLEU34.1Unverified
2BART-largeBLEU33.1Unverified
3BART-baseBLEU29.4Unverified
4MTNBLEU21.7Unverified
5GPT-2BLEU19.2Unverified
#ModelMetricClaimedVerifiedStatus
1LED(Q,F)Message-F119.54Unverified
2LED(Q,P,H)Message-F116.14Unverified
3LED(Q,P)Message-F114.25Unverified
#ModelMetricClaimedVerifiedStatus
1PaCEBLEU22Unverified
2SimpleTODBLEU20.3Unverified