SOTAVerified

Hallucination

Papers

Showing 651675 of 1816 papers

TitleStatusHype
KG-FPQ: Evaluating Factuality Hallucination in LLMs with Knowledge Graph-based False Premise QuestionsCode0
From Single to Multi: How LLMs Hallucinate in Multi-Document SummarizationCode0
Joint stereo 3D object detection and implicit surface reconstructionCode0
AI-Enhanced Cognitive Behavioral Therapy: Deep Learning and Large Language Models for Extracting Cognitive Pathways from Social Media TextsCode0
JourneyBench: A Challenging One-Stop Vision-Language Understanding Benchmark of Generated ImagesCode0
keepitsimple at SemEval-2025 Task 3: LLM-Uncertainty based Approach for Multilingual Hallucination Span DetectionCode0
Iterative Teaching by Data HallucinationCode0
Assessing the Reliability of Large Language Model KnowledgeCode0
Investigating Multi-Pivot Ensembling with Massively Multilingual Machine Translation ModelsCode0
Investigating and Mitigating Object Hallucinations in Pretrained Vision-Language (CLIP) ModelsCode0
Confidence Estimation for LLM-Based Dialogue State TrackingCode0
Chain of Visual Perception: Harnessing Multimodal Large Language Models for Zero-shot Camouflaged Object DetectionCode0
Investigating the performance of Retrieval-Augmented Generation and fine-tuning for the development of AI-driven knowledge-based systemsCode0
Characterizing Multimodal Long-form Summarization: A Case Study on Financial ReportsCode0
Instruction Makes a DifferenceCode0
Characterizing Context Influence and Hallucination in SummarizationCode0
Improving Factual Error Correction by Learning to Inject Factual ErrorsCode0
Improving Factuality in Large Language Models via Decoding-Time Hallucinatory and Truthful ComparatorsCode0
Incorporating Task-specific Concept Knowledge into Script LearningCode0
Integrating Chemistry Knowledge in Large Language Models via Prompt EngineeringCode0
Im2Avatar: Colorful 3D Reconstruction from a Single ImageCode0
HypoTermQA: Hypothetical Terms Dataset for Benchmarking Hallucination Tendency of LLMsCode0
Im2Flow: Motion Hallucination from Static Images for Action RecognitionCode0
How Much Do LLMs Hallucinate across Languages? On Multilingual Estimation of LLM Hallucination in the WildCode0
A Claim Decomposition Benchmark for Long-form Answer VerificationCode0
Show:102550
← PrevPage 27 of 73Next →

No leaderboard results yet.