SOTAVerified

Hallucination

Papers

Showing 176200 of 1816 papers

TitleStatusHype
Hallucination-Aware Multimodal Benchmark for Gastrointestinal Image Analysis with Large Vision-Language ModelsCode1
3D Sketch-aware Semantic Scene Completion via Semi-supervised Structure PriorCode1
Hallucination Augmented Contrastive Learning for Multimodal Large Language ModelCode1
Hallucination Detection in LLMs Using Spectral Features of Attention MapsCode1
HallE-Control: Controlling Object Hallucination in Large Multimodal ModelsCode1
HalluciDoctor: Mitigating Hallucinatory Toxicity in Visual Instruction DataCode1
BTR: Binary Token Representations for Efficient Retrieval Augmented Language ModelsCode1
ADeLA: Automatic Dense Labeling with Attention for Viewpoint Adaptation in Semantic SegmentationCode1
Hallucinated Neural Radiance Fields in the WildCode1
GraphArena: Benchmarking Large Language Models on Graph Computational ProblemsCode1
An Empirical Study on Parameter-Efficient Fine-Tuning for MultiModal Large Language ModelsCode1
Generating Natural Language Proofs with Verifier-Guided SearchCode1
Analyzing LLMs' Knowledge Boundary Cognition Across Languages Through the Lens of Internal RepresentationsCode1
GeoBenchX: Benchmarking LLMs for Multistep Geospatial TasksCode1
A Data-Centric Approach To Generate Faithful and High Quality Patient Summaries with Large Language ModelsCode1
Analyzing and Mitigating Object Hallucination in Large Vision-Language ModelsCode1
FlySearch: Exploring how vision-language models exploreCode1
Phare: A Safety Probe for Large Language ModelsCode1
Gemini Goes to Med School: Exploring the Capabilities of Multimodal Large Language Models on Medical Challenge Problems & HallucinationsCode1
A Multitask, Multilingual, Multimodal Evaluation of ChatGPT on Reasoning, Hallucination, and InteractivityCode1
BAMBOO: A Comprehensive Benchmark for Evaluating Long Text Modeling Capacities of Large Language ModelsCode1
Finetune-RAG: Fine-Tuning Language Models to Resist Hallucination in Retrieval-Augmented GenerationCode1
Balanced Classification: A Unified Framework for Long-Tailed Object DetectionCode1
BachGAN: High-Resolution Image Synthesis from Salient Object LayoutCode1
FineSurE: Fine-grained Summarization Evaluation using LLMsCode1
Show:102550
← PrevPage 8 of 73Next →

No leaderboard results yet.