| Learning Conformal Abstention Policies for Adaptive Risk Management in Large Language and Vision-Language Models | Feb 8, 2025 | Conformal PredictionDecision Making | CodeCode Available | 0 |
| Self-Rationalization in the Wild: A Large Scale Out-of-Distribution Evaluation on NLI-related tasks | Feb 7, 2025 | Abstractive Text SummarizationExplanation Generation | CodeCode Available | 0 |
| ChallengeMe: An Adversarial Learning-enabled Text Summarization Framework | Feb 7, 2025 | HallucinationSpecificity | —Unverified | 0 |
| Enhancing Hallucination Detection through Noise Injection | Feb 6, 2025 | Hallucination | —Unverified | 0 |
| Linear Correlation in LM's Compositional Generalization and Hallucination | Feb 6, 2025 | Hallucination | CodeCode Available | 0 |
| TruthFlow: Truthful LLM Generation via Representation Flow Correction | Feb 6, 2025 | HallucinationTruthfulQA | —Unverified | 0 |
| A Schema-Guided Reason-while-Retrieve framework for Reasoning on Scene Graphs with Large-Language-Models (LLMs) | Feb 5, 2025 | HallucinationSpatial Reasoning | —Unverified | 0 |
| Mitigating Object Hallucinations in Large Vision-Language Models via Attention Calibration | Feb 4, 2025 | AttributeHallucination | —Unverified | 0 |
| Eliciting Language Model Behaviors with Investigator Agents | Feb 3, 2025 | Bayesian InferenceHallucination | —Unverified | 0 |
| Assessing the use of Diffusion models for motion artifact correction in brain MRI | Feb 3, 2025 | DiagnosticHallucination | —Unverified | 0 |