SOTAVerified

Response Generation

A task where an agent should play the $DE$ role and generate a text to respond to a $P$ message.

Papers

Showing 151200 of 914 papers

TitleStatusHype
Beyond Domain APIs: Task-oriented Conversational Modeling with Unstructured Knowledge AccessCode1
Situated and Interactive Multimodal ConversationsCode1
Fluent Response Generation for Conversational Question AnsweringCode1
SOLOIST: Building Task Bots at Scale with Transfer Learning and Machine TeachingCode1
A Simple Language Model for Task-Oriented DialogueCode1
A Controllable Model of Grounded Response GenerationCode1
Conversations with Search Engines: SERP-based Conversational Response GenerationCode1
Multi-Domain Dialogue Acts and Response Co-GenerationCode1
PALM: Pre-training an Autoencoding&Autoregressive Language Model for Context-conditioned GenerationCode1
Variational Transformers for Diverse Response GenerationCode1
Non-Autoregressive Dialog State TrackingCode1
Automating App Review Response GenerationCode1
DialoGPT: Large-Scale Generative Pre-training for Conversational Response GenerationCode1
Towards Scalable Multi-domain Conversational Agents: The Schema-Guided Dialogue DatasetCode1
Language Models are Unsupervised Multitask LearnersCode1
MultiWOZ -- A Large-Scale Multi-Domain Wizard-of-Oz Dataset for Task-Oriented Dialogue ModellingCode1
Polite Dialogue Generation Without Parallel DataCode1
Disambiguation-Centric Finetuning Makes Enterprise Tool-Calling LLMs More Realistic and Less Risky0
Knowledge Augmented Finetuning Matters in both RAG and Agent Based Dialog Systems0
SAFEx: Analyzing Vulnerabilities of MoE-Based LLMs via Stable Safety-critical Expert Identification0
From What to Respond to When to Respond: Timely Response Generation for Open-domain Dialogue AgentsCode0
Improving Factuality for Dialogue Response Generation via Graph-Based Knowledge Augmentation0
CIIR@LiveRAG 2025: Optimizing Multi-Agent Retrieval Augmented Generation through Self-TrainingCode0
AMIA: Automatic Masking and Joint Intention Analysis Makes LVLMs Robust Jailbreak Defenders0
OmniResponse: Online Multimodal Conversational Response Generation in Dyadic Interactions0
Enhancing LLMs' Reasoning-Intensive Multimedia Search Capabilities through Fine-Tuning and Reinforcement Learning0
Retrieval Augmented Generation-based Large Language Models for Bridging Transportation Cybersecurity Legal Knowledge Gaps0
DecoupledESC: Enhancing Emotional Support Generation via Strategy-Response Decoupled Preference Optimization0
Hallucinate at the Last in Long Response Generation: A Case Study on Long Document Summarization0
Void in Language ModelsCode0
DecIF: Improving Instruction-Following through Meta-Decomposition0
Multi-Armed Bandits Meet Large Language Models0
Rethinking Stateful Tool Use in Multi-Turn Dialogues: Benchmarks and Challenges0
ProDS: Preference-oriented Data Selection for Instruction Tuning0
Personalizing Large Language Models using Retrieval Augmented Generation and Knowledge GraphCode0
DIF: A Framework for Benchmarking and Verifying Implicit Bias in LLMs0
GE-Chat: A Graph Enhanced RAG Framework for Evidential Response Generation of LLMs0
PIPA: A Unified Evaluation Protocol for Diagnosing Interactive Planning Agents0
Deep Learning Characterizes Depression and Suicidal Ideation from Eye Movements0
PICO: Secure Transformers via Robust Prompt Isolation and Cybersecurity Oversight0
Auto-SLURP: A Benchmark Dataset for Evaluating Multi-Agent Frameworks in Smart Personal AssistantCode0
Beyond Whole Dialogue Modeling: Contextual Disentanglement for Conversational Recommendation0
LegalRAG: A Hybrid RAG System for Multilingual Legal Information Retrieval0
Accommodate Knowledge Conflicts in Retrieval-augmented LLMs: Towards Reliable Response Generation in the Wild0
The Quantum LLM: Modeling Semantic Spaces with Quantum Principles0
SafeChat: A Framework for Building Trustworthy Collaborative Assistants and a Case Study of its UsefulnessCode0
RS-RAG: Bridging Remote Sensing Imagery and Comprehensive Knowledge with a Multi-Modal Dataset and Retrieval-Augmented Generation Model0
AGITB: A Signal-Level Benchmark for Evaluating Artificial General IntelligenceCode0
Hawkeye:Efficient Reasoning with Model Collaboration0
Enhancing Large Language Models (LLMs) for Telecommunications using Knowledge Graphs and Retrieval-Augmented Generation0
Show:102550
← PrevPage 4 of 19Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PaCEBLEU34.1Unverified
2BART-largeBLEU33.1Unverified
3BART-baseBLEU29.4Unverified
4MTNBLEU21.7Unverified
5GPT-2BLEU19.2Unverified
#ModelMetricClaimedVerifiedStatus
1LED(Q,F)Message-F119.54Unverified
2LED(Q,P,H)Message-F116.14Unverified
3LED(Q,P)Message-F114.25Unverified
#ModelMetricClaimedVerifiedStatus
1PaCEBLEU22Unverified
2SimpleTODBLEU20.3Unverified