| Beyond Domain APIs: Task-oriented Conversational Modeling with Unstructured Knowledge Access | Jun 5, 2020 | Response GenerationTask-Oriented Dialogue Systems | CodeCode Available | 1 |
| Situated and Interactive Multimodal Conversations | Jun 2, 2020 | Response Generation | CodeCode Available | 1 |
| Fluent Response Generation for Conversational Question Answering | May 21, 2020 | Conversational Question AnsweringData Augmentation | CodeCode Available | 1 |
| SOLOIST: Building Task Bots at Scale with Transfer Learning and Machine Teaching | May 11, 2020 | End-To-End Dialogue ModellingFew-Shot Learning | CodeCode Available | 1 |
| A Simple Language Model for Task-Oriented Dialogue | May 2, 2020 | Dialogue State TrackingEnd-To-End Dialogue Modelling | CodeCode Available | 1 |
| A Controllable Model of Grounded Response Generation | May 1, 2020 | Informativenessmodel | CodeCode Available | 1 |
| Conversations with Search Engines: SERP-based Conversational Response Generation | Apr 29, 2020 | Conversational Response GenerationConversational Search | CodeCode Available | 1 |
| Multi-Domain Dialogue Acts and Response Co-Generation | Apr 26, 2020 | Response GenerationTask-Oriented Dialogue Systems | CodeCode Available | 1 |
| PALM: Pre-training an Autoencoding&Autoregressive Language Model for Context-conditioned Generation | Apr 14, 2020 | Abstractive Text SummarizationConversational Response Generation | CodeCode Available | 1 |
| Variational Transformers for Diverse Response Generation | Mar 28, 2020 | DecoderDiversity | CodeCode Available | 1 |
| Non-Autoregressive Dialog State Tracking | Feb 19, 2020 | dialog state trackingDialogue State Tracking | CodeCode Available | 1 |
| Automating App Review Response Generation | Feb 10, 2020 | Response Generation | CodeCode Available | 1 |
| DialoGPT: Large-Scale Generative Pre-training for Conversational Response Generation | Nov 1, 2019 | Conversational Response GenerationResponse Generation | CodeCode Available | 1 |
| Towards Scalable Multi-domain Conversational Agents: The Schema-Guided Dialogue Dataset | Sep 12, 2019 | 16kDialogue State Tracking | CodeCode Available | 1 |
| Language Models are Unsupervised Multitask Learners | Feb 14, 2019 | Common Sense ReasoningCoreference Resolution | CodeCode Available | 1 |
| MultiWOZ -- A Large-Scale Multi-Domain Wizard-of-Oz Dataset for Task-Oriented Dialogue Modelling | Sep 29, 2018 | Response Generation | CodeCode Available | 1 |
| Polite Dialogue Generation Without Parallel Data | May 8, 2018 | DecoderDialogue Generation | CodeCode Available | 1 |
| Disambiguation-Centric Finetuning Makes Enterprise Tool-Calling LLMs More Realistic and Less Risky | Jul 4, 2025 | Response Generation | —Unverified | 0 |
| Knowledge Augmented Finetuning Matters in both RAG and Agent Based Dialog Systems | Jun 28, 2025 | RAGResponse Generation | —Unverified | 0 |
| SAFEx: Analyzing Vulnerabilities of MoE-Based LLMs via Stable Safety-critical Expert Identification | Jun 20, 2025 | Mixture-of-ExpertsResponse Generation | —Unverified | 0 |
| From What to Respond to When to Respond: Timely Response Generation for Open-domain Dialogue Agents | Jun 17, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Improving Factuality for Dialogue Response Generation via Graph-Based Knowledge Augmentation | Jun 14, 2025 | Response Generation | —Unverified | 0 |
| CIIR@LiveRAG 2025: Optimizing Multi-Agent Retrieval Augmented Generation through Self-Training | Jun 12, 2025 | RAGResponse Generation | CodeCode Available | 0 |
| AMIA: Automatic Masking and Joint Intention Analysis Makes LVLMs Robust Jailbreak Defenders | May 30, 2025 | Response Generation | —Unverified | 0 |
| OmniResponse: Online Multimodal Conversational Response Generation in Dyadic Interactions | May 27, 2025 | Audio-Visual SynchronizationConversational Response Generation | —Unverified | 0 |
| Enhancing LLMs' Reasoning-Intensive Multimedia Search Capabilities through Fine-Tuning and Reinforcement Learning | May 24, 2025 | Multiple-choicePrompt Engineering | —Unverified | 0 |
| Retrieval Augmented Generation-based Large Language Models for Bridging Transportation Cybersecurity Legal Knowledge Gaps | May 23, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| DecoupledESC: Enhancing Emotional Support Generation via Strategy-Response Decoupled Preference Optimization | May 22, 2025 | Response Generation | —Unverified | 0 |
| Hallucinate at the Last in Long Response Generation: A Case Study on Long Document Summarization | May 21, 2025 | Document SummarizationHallucination | —Unverified | 0 |
| Void in Language Models | May 20, 2025 | MMLUResponse Generation | CodeCode Available | 0 |
| DecIF: Improving Instruction-Following through Meta-Decomposition | May 20, 2025 | Instruction FollowingResponse Generation | —Unverified | 0 |
| Multi-Armed Bandits Meet Large Language Models | May 19, 2025 | Decision MakingMulti-Armed Bandits | —Unverified | 0 |
| Rethinking Stateful Tool Use in Multi-Turn Dialogues: Benchmarks and Challenges | May 19, 2025 | Response Generation | —Unverified | 0 |
| ProDS: Preference-oriented Data Selection for Instruction Tuning | May 19, 2025 | Response Generation | —Unverified | 0 |
| Personalizing Large Language Models using Retrieval Augmented Generation and Knowledge Graph | May 15, 2025 | Knowledge GraphsRAG | CodeCode Available | 0 |
| DIF: A Framework for Benchmarking and Verifying Implicit Bias in LLMs | May 15, 2025 | BenchmarkingFairness | —Unverified | 0 |
| GE-Chat: A Graph Enhanced RAG Framework for Evidential Response Generation of LLMs | May 15, 2025 | RAGResponse Generation | —Unverified | 0 |
| PIPA: A Unified Evaluation Protocol for Diagnosing Interactive Planning Agents | May 2, 2025 | Instruction FollowingResponse Generation | —Unverified | 0 |
| Deep Learning Characterizes Depression and Suicidal Ideation from Eye Movements | Apr 29, 2025 | Deep LearningResponse Generation | —Unverified | 0 |
| PICO: Secure Transformers via Robust Prompt Isolation and Cybersecurity Oversight | Apr 26, 2025 | Mixture-of-ExpertsPICO | —Unverified | 0 |
| Auto-SLURP: A Benchmark Dataset for Evaluating Multi-Agent Frameworks in Smart Personal Assistant | Apr 25, 2025 | Natural Language UnderstandingResponse Generation | CodeCode Available | 0 |
| Beyond Whole Dialogue Modeling: Contextual Disentanglement for Conversational Recommendation | Apr 24, 2025 | Conversational Recommendationcounterfactual | —Unverified | 0 |
| LegalRAG: A Hybrid RAG System for Multilingual Legal Information Retrieval | Apr 19, 2025 | Information RetrievalQuestion Answering | —Unverified | 0 |
| Accommodate Knowledge Conflicts in Retrieval-augmented LLMs: Towards Reliable Response Generation in the Wild | Apr 17, 2025 | Decision MakingInformation Retrieval | —Unverified | 0 |
| The Quantum LLM: Modeling Semantic Spaces with Quantum Principles | Apr 13, 2025 | Response Generationvalid | —Unverified | 0 |
| SafeChat: A Framework for Building Trustworthy Collaborative Assistants and a Case Study of its Usefulness | Apr 8, 2025 | ChatbotExtractive Summarization | CodeCode Available | 0 |
| RS-RAG: Bridging Remote Sensing Imagery and Comprehensive Knowledge with a Multi-Modal Dataset and Retrieval-Augmented Generation Model | Apr 7, 2025 | Image Captioningimage-classification | —Unverified | 0 |
| AGITB: A Signal-Level Benchmark for Evaluating Artificial General Intelligence | Apr 6, 2025 | MemorizationResponse Generation | CodeCode Available | 0 |
| Hawkeye:Efficient Reasoning with Model Collaboration | Apr 1, 2025 | Mathmodel | —Unverified | 0 |
| Enhancing Large Language Models (LLMs) for Telecommunications using Knowledge Graphs and Retrieval-Augmented Generation | Mar 31, 2025 | Knowledge GraphsQuestion Answering | —Unverified | 0 |