| Disambiguation-Centric Finetuning Makes Enterprise Tool-Calling LLMs More Realistic and Less Risky | Jul 4, 2025 | Response Generation | —Unverified | 0 |
| Knowledge Augmented Finetuning Matters in both RAG and Agent Based Dialog Systems | Jun 28, 2025 | RAGResponse Generation | —Unverified | 0 |
| SAFEx: Analyzing Vulnerabilities of MoE-Based LLMs via Stable Safety-critical Expert Identification | Jun 20, 2025 | Mixture-of-ExpertsResponse Generation | —Unverified | 0 |
| From What to Respond to When to Respond: Timely Response Generation for Open-domain Dialogue Agents | Jun 17, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Improving Factuality for Dialogue Response Generation via Graph-Based Knowledge Augmentation | Jun 14, 2025 | Response Generation | —Unverified | 0 |
| CIIR@LiveRAG 2025: Optimizing Multi-Agent Retrieval Augmented Generation through Self-Training | Jun 12, 2025 | RAGResponse Generation | CodeCode Available | 0 |
| AMIA: Automatic Masking and Joint Intention Analysis Makes LVLMs Robust Jailbreak Defenders | May 30, 2025 | Response Generation | —Unverified | 0 |
| OmniResponse: Online Multimodal Conversational Response Generation in Dyadic Interactions | May 27, 2025 | Audio-Visual SynchronizationConversational Response Generation | —Unverified | 0 |
| Enhancing LLMs' Reasoning-Intensive Multimedia Search Capabilities through Fine-Tuning and Reinforcement Learning | May 24, 2025 | Multiple-choicePrompt Engineering | —Unverified | 0 |
| Retrieval Augmented Generation-based Large Language Models for Bridging Transportation Cybersecurity Legal Knowledge Gaps | May 23, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Teaching Large Language Models to Maintain Contextual Faithfulness via Synthetic Tasks and Reinforcement Learning | May 22, 2025 | FormQuestion Answering | CodeCode Available | 1 |
| DecoupledESC: Enhancing Emotional Support Generation via Strategy-Response Decoupled Preference Optimization | May 22, 2025 | Response Generation | —Unverified | 0 |
| Hallucinate at the Last in Long Response Generation: A Case Study on Long Document Summarization | May 21, 2025 | Document SummarizationHallucination | —Unverified | 0 |
| Deliberation on Priors: Trustworthy Reasoning of Large Language Models on Knowledge Graphs | May 21, 2025 | Knowledge DistillationKnowledge Graphs | CodeCode Available | 1 |
| DecIF: Improving Instruction-Following through Meta-Decomposition | May 20, 2025 | Instruction FollowingResponse Generation | —Unverified | 0 |
| Void in Language Models | May 20, 2025 | MMLUResponse Generation | CodeCode Available | 0 |
| Rethinking Stateful Tool Use in Multi-Turn Dialogues: Benchmarks and Challenges | May 19, 2025 | Response Generation | —Unverified | 0 |
| ProDS: Preference-oriented Data Selection for Instruction Tuning | May 19, 2025 | Response Generation | —Unverified | 0 |
| Multi-Armed Bandits Meet Large Language Models | May 19, 2025 | Decision MakingMulti-Armed Bandits | —Unverified | 0 |
| Neuro-Symbolic Query Compiler | May 17, 2025 | RAGResponse Generation | CodeCode Available | 1 |
| DIF: A Framework for Benchmarking and Verifying Implicit Bias in LLMs | May 15, 2025 | BenchmarkingFairness | —Unverified | 0 |
| GE-Chat: A Graph Enhanced RAG Framework for Evidential Response Generation of LLMs | May 15, 2025 | RAGResponse Generation | —Unverified | 0 |
| Personalizing Large Language Models using Retrieval Augmented Generation and Knowledge Graph | May 15, 2025 | Knowledge GraphsRAG | CodeCode Available | 0 |
| PIPA: A Unified Evaluation Protocol for Diagnosing Interactive Planning Agents | May 2, 2025 | Instruction FollowingResponse Generation | —Unverified | 0 |
| Antidote: A Unified Framework for Mitigating LVLM Hallucinations in Counterfactual Presupposition and Object Perception | Apr 29, 2025 | counterfactualHallucination | CodeCode Available | 1 |
| Deep Learning Characterizes Depression and Suicidal Ideation from Eye Movements | Apr 29, 2025 | Deep LearningResponse Generation | —Unverified | 0 |
| PICO: Secure Transformers via Robust Prompt Isolation and Cybersecurity Oversight | Apr 26, 2025 | Mixture-of-ExpertsPICO | —Unverified | 0 |
| Auto-SLURP: A Benchmark Dataset for Evaluating Multi-Agent Frameworks in Smart Personal Assistant | Apr 25, 2025 | Natural Language UnderstandingResponse Generation | CodeCode Available | 0 |
| Beyond Whole Dialogue Modeling: Contextual Disentanglement for Conversational Recommendation | Apr 24, 2025 | Conversational Recommendationcounterfactual | —Unverified | 0 |
| LegalRAG: A Hybrid RAG System for Multilingual Legal Information Retrieval | Apr 19, 2025 | Information RetrievalQuestion Answering | —Unverified | 0 |
| Accommodate Knowledge Conflicts in Retrieval-augmented LLMs: Towards Reliable Response Generation in the Wild | Apr 17, 2025 | Decision MakingInformation Retrieval | —Unverified | 0 |
| MSCRS: Multi-modal Semantic Graph Prompt Learning Framework for Conversational Recommender Systems | Apr 15, 2025 | Prompt LearningRecommendation Systems | CodeCode Available | 1 |
| The Quantum LLM: Modeling Semantic Spaces with Quantum Principles | Apr 13, 2025 | Response Generationvalid | —Unverified | 0 |
| SafeChat: A Framework for Building Trustworthy Collaborative Assistants and a Case Study of its Usefulness | Apr 8, 2025 | ChatbotExtractive Summarization | CodeCode Available | 0 |
| RS-RAG: Bridging Remote Sensing Imagery and Comprehensive Knowledge with a Multi-Modal Dataset and Retrieval-Augmented Generation Model | Apr 7, 2025 | Image Captioningimage-classification | —Unverified | 0 |
| AGITB: A Signal-Level Benchmark for Evaluating Artificial General Intelligence | Apr 6, 2025 | MemorizationResponse Generation | CodeCode Available | 0 |
| Hawkeye:Efficient Reasoning with Model Collaboration | Apr 1, 2025 | Mathmodel | —Unverified | 0 |
| Enhancing Large Language Models (LLMs) for Telecommunications using Knowledge Graphs and Retrieval-Augmented Generation | Mar 31, 2025 | Knowledge GraphsQuestion Answering | —Unverified | 0 |
| When LLM Therapists Become Salespeople: Evaluating Large Language Models for Ethical Motivational Interviewing | Mar 30, 2025 | EthicsResponse Generation | —Unverified | 0 |
| Firm or Fickle? Evaluating Large Language Models Consistency in Sequential Interactions | Mar 28, 2025 | Response Generation | —Unverified | 0 |
| Clean & Clear: Feasibility of Safe LLM Clinical Guidance | Mar 26, 2025 | ChatbotDiagnostic | —Unverified | 0 |
| DEMENTIA-PLAN: An Agent-Based Framework for Multi-Knowledge Graph Retrieval-Augmented Generation in Dementia Care | Mar 26, 2025 | Knowledge GraphsResponse Generation | —Unverified | 0 |
| CoMAC: Conversational Agent for Multi-Source Auxiliary Context with Sparse and Symmetric Latent Interactions | Mar 25, 2025 | Response Generationtext similarity | CodeCode Available | 0 |
| Mitigating Reward Over-Optimization in RLHF via Behavior-Supported Regularization | Mar 23, 2025 | Reinforcement Learning (RL)Response Generation | —Unverified | 0 |
| GINGER: Grounded Information Nugget-Based Generation of Responses | Mar 23, 2025 | RAGResponse Generation | CodeCode Available | 0 |
| Conversational User-AI Intervention: A Study on Prompt Rewriting for Improved LLM Response Generation | Mar 21, 2025 | ChatbotResponse Generation | —Unverified | 0 |
| Relevance Isn't All You Need: Scaling RAG Systems With Inference-Time Compute Via Multi-Criteria Reranking | Mar 14, 2025 | AllLarge Language Model | CodeCode Available | 13 |
| FG-RAG: Enhancing Query-Focused Summarization with Context-Aware Fine-Grained Graph RAG | Mar 13, 2025 | DiversityQuery-focused Summarization | CodeCode Available | 0 |
| Intent-Aware Self-Correction for Mitigating Social Biases in Large Language Models | Mar 8, 2025 | Response Generation | —Unverified | 0 |
| Attentive Reasoning Queries: A Systematic Method for Optimizing Instruction-Following in Large Language Models | Mar 5, 2025 | HallucinationInstruction Following | CodeCode Available | 11 |