SOTAVerified

Hallucination

Papers

Showing 801825 of 1816 papers

TitleStatusHype
FRAG: A Flexible Modular Framework for Retrieval-Augmented Generation based on Knowledge Graphs0
ArxEval: Evaluating Retrieval and Generation in Language Models for Scientific Literature0
A Survey on Responsible LLMs: Inherent Risk, Malicious Use, and Mitigation Strategy0
HALoGEN: Fantastic LLM Hallucinations and Where to Find Them0
GPT as a Monte Carlo Language Tree: A Probabilistic Perspective0
MedCT: A Clinical Terminology Graph for Generative AI Applications in Healthcare0
Fine-tuning Large Language Models for Improving Factuality in Legal Question AnsweringCode0
Hermit Kingdom Through the Lens of Multiple Perspectives: A Case Study of LLM Hallucination on North Korea0
Seeing with Partial Certainty: Conformal Prediction for Robotic Scene Recognition in Built Environments0
Feedback-Driven Vision-Language Alignment with Minimal Human Supervision0
RAG-Check: Evaluating Multimodal Retrieval Augmented Generation Performance0
EAGLE: Enhanced Visual Grounding Minimizes Hallucinations in Instructional Multimodal Models0
Socratic Questioning: Learn to Self-guide Multimodal Reasoning in the WildCode0
Foundations of GenIR0
FlippedRAG: Black-Box Opinion Manipulation Adversarial Attacks to Retrieval-Augmented Generation Models0
CHAIR -- Classifier of Hallucination as ImproverCode0
LLMs & Legal Aid: Understanding Legal Needs Exhibited Through User Queries0
CarbonChat: Large Language Model-Based Corporate Carbon Emission Analysis and Climate Knowledge Q&A System0
Think More, Hallucinate Less: Mitigating Hallucinations via Dual Process of Fast and Slow Thinking0
Large Language Model-Enhanced Symbolic Reasoning for Knowledge Base Completion0
Enhancing Uncertainty Modeling with Semantic Graph for Hallucination Detection0
RRHF-V: Ranking Responses to Mitigate Hallucinations in Multimodal Large Language Models with Human FeedbackCode0
IllusionBench: A Large-scale and Comprehensive Benchmark for Visual Illusion Understanding in Vision-Language Models0
POPEN: Preference-Based Optimization and Ensemble for LVLM-Based Reasoning Segmentation0
VL-RewardBench: A Challenging Benchmark for Vision-Language Generative Reward Models0
Show:102550
← PrevPage 33 of 73Next →

No leaderboard results yet.