| Confidence-aware Denoised Fine-tuning of Off-the-shelf Models for Certified Robustness | Nov 13, 2024 | Adversarial RobustnessDenoising | CodeCode Available | 0 | 5 |
| LLMs and Memorization: On Quality and Specificity of Copyright Compliance | May 28, 2024 | HallucinationMemorization | CodeCode Available | 0 | 5 |
| Exploring the Trade-Offs: Quantization Methods, Task Difficulty, and Model Size in Large Language Models From Edge to Giant | Sep 17, 2024 | HallucinationInstruction Following | CodeCode Available | 0 | 5 |
| Logic Query of Thoughts: Guiding Large Language Models to Answer Complex Logic Queries with Knowledge Graphs | Mar 17, 2024 | HallucinationKnowledge Graphs | CodeCode Available | 0 | 5 |
| LLM Inference Enhanced by External Knowledge: A Survey | May 30, 2025 | HallucinationKnowledge Graphs | CodeCode Available | 0 | 5 |
| LLM Hallucinations in Practical Code Generation: Phenomena, Mechanism, and Mitigation | Sep 30, 2024 | Code GenerationHallucination | CodeCode Available | 0 | 5 |
| LLM Internal States Reveal Hallucination Risk Faced With a Query | Jul 3, 2024 | HallucinationResponse Generation | CodeCode Available | 0 | 5 |
| LLM-based Query Expansion Fails for Unfamiliar and Ambiguous Queries | May 19, 2025 | HallucinationRetrieval | CodeCode Available | 0 | 5 |
| Linear Correlation in LM's Compositional Generalization and Hallucination | Feb 6, 2025 | Hallucination | CodeCode Available | 0 | 5 |
| Learning with privileged information via adversarial discriminative modality distillation | Oct 19, 2018 | Action RecognitionHallucination | CodeCode Available | 0 | 5 |
| A Comparative Study on Language Models for Task-Oriented Dialogue Systems | Jan 21, 2022 | Dialogue State TrackingHallucination | CodeCode Available | 0 | 5 |
| Learning to Localize Objects Improves Spatial Reasoning in Visual-LLMs | Apr 11, 2024 | DescriptiveHallucination | CodeCode Available | 0 | 5 |
| Fine-tuning Large Language Models for Improving Factuality in Legal Question Answering | Jan 11, 2025 | HallucinationQuestion Answering | CodeCode Available | 0 | 5 |
| Leveraging Pretrained Models for Automatic Summarization of Doctor-Patient Conversations | Sep 24, 2021 | Hallucination | CodeCode Available | 0 | 5 |
| Learning Conformal Abstention Policies for Adaptive Risk Management in Large Language and Vision-Language Models | Feb 8, 2025 | Conformal PredictionDecision Making | CodeCode Available | 0 | 5 |
| Localizing and Mitigating Errors in Long-form Question Answering | Jul 16, 2024 | FormHallucination | CodeCode Available | 0 | 5 |
| Learning Fine-grained Domain Generalization via Hyperbolic State Space Hallucination | Apr 10, 2025 | Domain GeneralizationHallucination | CodeCode Available | 0 | 5 |
| Fine-grained Contract NER using instruction based model | Jan 24, 2024 | Few-Shot LearningHallucination | CodeCode Available | 0 | 5 |
| AIstorian lets AI be a historian: A KG-powered multi-agent system for accurate biography generation | Mar 14, 2025 | Abstractive Text SummarizationChunking | CodeCode Available | 0 | 5 |
| Large Language Models on Wikipedia-Style Survey Generation: an Evaluation in NLP Concepts | Aug 21, 2023 | ArticlesHallucination | CodeCode Available | 0 | 5 |
| Fidelity-Enriched Contrastive Search: Reconciling the Faithfulness-Diversity Trade-Off in Text Generation | Oct 23, 2023 | Abstractive Text SummarizationDialogue Generation | CodeCode Available | 0 | 5 |
| Language Models Hallucinate, but May Excel at Fact Verification | Oct 23, 2023 | Fact VerificationHallucination | CodeCode Available | 0 | 5 |
| Multi-Source Knowledge Pruning for Retrieval-Augmented Generation: A Benchmark and Empirical Study | Sep 3, 2024 | BenchmarkingHallucination | CodeCode Available | 0 | 5 |
| Few-shot learning via tensor hallucination | Apr 19, 2021 | Data AugmentationFew-Shot Learning | CodeCode Available | 0 | 5 |
| AILS-NTUA at SemEval-2024 Task 6: Efficient model tuning for hallucination detection and analysis | Apr 1, 2024 | Binary ClassificationHallucination | CodeCode Available | 0 | 5 |