SOTAVerified

Hallucination

Papers

Showing 401425 of 1816 papers

TitleStatusHype
Chain-of-Knowledge: Grounding Large Language Models via Dynamic Knowledge Adapting over Heterogeneous SourcesCode1
A Head to Predict and a Head to Question: Pre-trained Uncertainty Quantification Heads for Hallucination Detection in LLM OutputsCode1
DiffFuSR: Super-Resolution of all Sentinel-2 Multispectral Bands using Diffusion ModelsCode1
GraphArena: Benchmarking Large Language Models on Graph Computational ProblemsCode1
FactAlign: Long-form Factuality Alignment of Large Language ModelsCode1
Face Hallucination via Split-Attention in Split-Attention NetworkCode1
Factored Verification: Detecting and Reducing Hallucination in Summaries of Academic PapersCode1
RegaVAE: A Retrieval-Augmented Gaussian Mixture Variational Auto-Encoder for Language ModelingCode1
Exploring the Transferability of Visual Prompting for Multimodal Large Language ModelsCode1
Extract Free Dense Misalignment from CLIPCode1
FAIR GPT: A virtual consultant for research data management in ChatGPTCode1
Detecting Machine-Generated Texts by Multi-Population Aware Optimization for Maximum Mean DiscrepancyCode1
Are Large Language Models Really Good Logical Reasoners? A Comprehensive Evaluation and BeyondCode1
Evaluation and Analysis of Hallucination in Large Vision-Language ModelsCode1
AGIR: Automating Cyber Threat Intelligence Reporting with Natural Language GenerationCode1
Analyzing and Mitigating Object Hallucination in Large Vision-Language ModelsCode1
Benchmarking Llama2, Mistral, Gemma and GPT for Factuality, Toxicity, Bias and Propensity for HallucinationsCode1
Detecting and Mitigating Hallucination in Large Vision Language Models via Fine-Grained AI FeedbackCode1
EventHallusion: Diagnosing Event Hallucinations in Video LLMsCode1
Benchmarking LLM Faithfulness in RAG with Evolving LeaderboardsCode1
Exploring Hallucination of Large Multimodal Models in Video Understanding: Benchmark, Analysis and MitigationCode1
Detecting Hallucinated Content in Conditional Neural Sequence GenerationCode1
Analyzing LLMs' Knowledge Boundary Cognition Across Languages Through the Lens of Internal RepresentationsCode1
FaithBench: A Diverse Hallucination Benchmark for Summarization by Modern LLMsCode1
Evaluating Image Hallucination in Text-to-Image Generation with Question-AnsweringCode1
Show:102550
← PrevPage 17 of 73Next →

No leaderboard results yet.