SOTAVerified

Hallucination

Papers

Showing 626650 of 1816 papers

TitleStatusHype
Learning to Generate and Evaluate Fact-checking Explanations with Transformers0
A Survey of Hallucination in Large Visual Language Models0
Hallucination Detox: Sensitivity Dropout (SenD) for Large Language Model Training0
Explaining Graph Neural Networks with Large Language Models: A Counterfactual Perspective for Molecular Property PredictionCode0
Coarse-to-Fine Highlighting: Reducing Knowledge Hallucination in Large Language Models0
Good Parenting is all you need -- Multi-agentic LLM Hallucination Mitigation0
ELOQ: Resources for Enhancing LLM Detection of Out-of-Scope QuestionsCode0
Paths-over-Graph: Knowledge Graph Empowered Large Language Model ReasoningCode1
ETF: An Entity Tracing Framework for Hallucination Detection in Code Summaries0
From Single to Multi: How LLMs Hallucinate in Multi-Document SummarizationCode0
MCQG-SRefine: Multiple Choice Question Generation and Evaluation with Iterative Self-Critique, Correction, and Comparison FeedbackCode0
Mitigating Hallucinations in Large Vision-Language Models via Summary-Guided DecodingCode1
Utilizing Large Language Models in an iterative paradigm with domain feedback for zero-shot molecule optimization0
FaithBench: A Diverse Hallucination Benchmark for Summarization by Modern LLMsCode1
RosePO: Aligning LLM-based Recommenders with Human Values0
MMed-RAG: Versatile Multimodal RAG System for Medical Vision Language ModelsCode3
On A Scale From 1 to 5: Quantifying Hallucination in Faithfulness Evaluation0
When Not to Answer: Evaluating Prompts on GPT Models for Effective Abstention in Unanswerable Math Word Problems0
Iter-AHMCL: Alleviate Hallucination for Large Language Model via Iterative Model-level Contrastive Learning0
Graph-constrained Reasoning: Faithful Reasoning on Knowledge Graphs with Large Language ModelsCode3
What Do LLMs Need to Understand Graphs: A Survey of Parametric Representation of Graphs0
Controlled Automatic Task-Specific Synthetic Data Generation for Hallucination Detection0
A Claim Decomposition Benchmark for Long-form Answer VerificationCode0
The Curse of Multi-Modalities: Evaluating Hallucinations of Large Multimodal Models across Language, Visual, and AudioCode3
Search Engines in an AI Era: The False Promise of Factual and Verifiable Source-Cited ResponsesCode1
Show:102550
← PrevPage 26 of 73Next →

No leaderboard results yet.