SOTAVerified

Hallucination

Papers

Showing 151200 of 1816 papers

TitleStatusHype
Devils in Middle Layers of Large Vision-Language Models: Interpreting, Detecting and Mitigating Object Hallucinations via Attention LensCode2
KnowHalu: Hallucination Detection via Multi-Form Knowledge Based Factual CheckingCode2
In-Context Sharpness as Alerts: An Inner Representation Perspective for Hallucination MitigationCode2
InstructGraph: Boosting Large Language Models via Graph-centric Instruction Tuning and Preference AlignmentCode2
Knowledge Graph-Guided Retrieval Augmented GenerationCode2
Calibrated Self-Rewarding Vision Language ModelsCode2
DeliLaw: A Chinese Legal Counselling System Based on a Large Language ModelCode2
PIP-KAG: Mitigating Knowledge Conflicts in Knowledge-Augmented Generation via Parametric PruningCode2
Differential TransformerCode2
MLAgentBench: Evaluating Language Agents on Machine Learning ExperimentationCode2
Image Textualization: An Automatic Framework for Creating Accurate and Detailed Image DescriptionsCode2
MeMemo: On-device Retrieval Augmentation for Private and Personalized Text GenerationCode2
VideoAgent: Self-Improving Video GenerationCode2
High-resolution Face Swapping via Latent Semantics DisentanglementCode1
Holistic Analysis of Hallucination in GPT-4V(ision): Bias and Interference ChallengesCode1
Harnessing GPT-4V(ision) for Insurance: A Preliminary ExplorationCode1
Harnessing Large Language Models for Knowledge Graph Question Answering via Adaptive Multi-Aspect Retrieval-AugmentationCode1
How Language Model Hallucinations Can SnowballCode1
Antidote: A Unified Framework for Mitigating LVLM Hallucinations in Counterfactual Presupposition and Object PerceptionCode1
Adversarial Feature Hallucination Networks for Few-Shot LearningCode1
Bridging the Data Gap between Training and Inference for Unsupervised Neural Machine TranslationCode1
BTR: Binary Token Representations for Efficient Retrieval Augmented Language ModelsCode1
HaloQuest: A Visual Hallucination Dataset for Advancing Multimodal ReasoningCode1
How well can a large language model explain business processes as perceived by users?Code1
AMBER: An LLM-free Multi-dimensional Benchmark for MLLMs Hallucination EvaluationCode1
Advancing TTP Analysis: Harnessing the Power of Large Language Models with Retrieval Augmented GenerationCode1
Hallucination Detection in LLMs Using Spectral Features of Attention MapsCode1
3D Sketch-aware Semantic Scene Completion via Semi-supervised Structure PriorCode1
Hallucination-Aware Multimodal Benchmark for Gastrointestinal Image Analysis with Large Vision-Language ModelsCode1
HalluciDoctor: Mitigating Hallucinatory Toxicity in Visual Instruction DataCode1
HallE-Control: Controlling Object Hallucination in Large Multimodal ModelsCode1
Hallucinated Neural Radiance Fields in the WildCode1
ADeLA: Automatic Dense Labeling with Attention for Viewpoint Adaptation in Semantic SegmentationCode1
Hallucination Augmented Contrastive Learning for Multimodal Large Language ModelCode1
GraphArena: Benchmarking Large Language Models on Graph Computational ProblemsCode1
An Empirical Study on Parameter-Efficient Fine-Tuning for MultiModal Large Language ModelsCode1
Generating Natural Language Proofs with Verifier-Guided SearchCode1
Analyzing LLMs' Knowledge Boundary Cognition Across Languages Through the Lens of Internal RepresentationsCode1
GeoBenchX: Benchmarking LLMs for Multistep Geospatial TasksCode1
A Data-Centric Approach To Generate Faithful and High Quality Patient Summaries with Large Language ModelsCode1
Analyzing and Mitigating Object Hallucination in Large Vision-Language ModelsCode1
FlySearch: Exploring how vision-language models exploreCode1
Phare: A Safety Probe for Large Language ModelsCode1
Gemini Goes to Med School: Exploring the Capabilities of Multimodal Large Language Models on Medical Challenge Problems & HallucinationsCode1
A Multitask, Multilingual, Multimodal Evaluation of ChatGPT on Reasoning, Hallucination, and InteractivityCode1
BAMBOO: A Comprehensive Benchmark for Evaluating Long Text Modeling Capacities of Large Language ModelsCode1
Finetune-RAG: Fine-Tuning Language Models to Resist Hallucination in Retrieval-Augmented GenerationCode1
Balanced Classification: A Unified Framework for Long-Tailed Object DetectionCode1
BachGAN: High-Resolution Image Synthesis from Salient Object LayoutCode1
FineSurE: Fine-grained Summarization Evaluation using LLMsCode1
Show:102550
← PrevPage 4 of 37Next →

No leaderboard results yet.