Hallucination

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1–50 of 1816 papers

Title	Date	Tasks	Status	Hype	Score
MiniCPM-V: A GPT-4V Level MLLM on Your Phone	Aug 3, 2024	HallucinationMultiple-choice	CodeCode Available	12	5
Attentive Reasoning Queries: A Systematic Method for Optimizing Instruction-Following in Large Language Models	Mar 5, 2025	HallucinationInstruction Following	CodeCode Available	11	5
RLAIF-V: Open-Source AI Feedback Leads to Super GPT-4V Trustworthiness	May 27, 2024	HallucinationImage Captioning	CodeCode Available	11	5
SWIFT:A Scalable lightWeight Infrastructure for Fine-Tuning	Aug 10, 2024	HallucinationOptical Character Recognition	CodeCode Available	11	5
MoE-LLaVA: Mixture of Experts for Large Vision-Language Models	Jan 29, 2024	HallucinationMixture-of-Experts	CodeCode Available	7	5
O1 Replication Journey -- Part 2: Surpassing O1-preview through Simple Distillation, Big Progress or Bitter Lesson?	Nov 25, 2024	HallucinationKnowledge Distillation	CodeCode Available	7	5
RLHF-V: Towards Trustworthy MLLMs via Behavior Alignment from Fine-grained Correctional Human Feedback	Dec 1, 2023	HallucinationImage Captioning	CodeCode Available	6	5
Gorilla: Large Language Model Connected with Massive APIs	May 24, 2023	HallucinationLanguage Modeling	CodeCode Available	6	5
Lean Copilot: Large Language Models as Copilots for Theorem Proving in Lean	Apr 18, 2024	Automated Theorem ProvingHallucination	CodeCode Available	5	5
DeepEyes: Incentivizing "Thinking with Images" via Reinforcement Learning	May 20, 2025	HallucinationMathematical Reasoning	CodeCode Available	5	5
Uncertainty Quantification for Language Models: A Suite of Black-Box, White-Box, LLM Judge, and Ensemble Scorers	Apr 27, 2025	HallucinationQuestion Answering	CodeCode Available	5	5
Weakly Supervised Detection of Hallucinations in LLM Activations	Dec 5, 2023	HallucinationLanguage Modeling	CodeCode Available	5	5
Chatlaw: A Multi-Agent Collaborative Legal Assistant with Knowledge Graph Enhanced Mixture-of-Experts Large Language Model	Jun 28, 2023	HallucinationKnowledge Graphs	CodeCode Available	5	5
Ferret: Refer and Ground Anything Anywhere at Any Granularity	Oct 11, 2023	HallucinationLanguage Modeling	CodeCode Available	5	5
UQLM: A Python Package for Uncertainty Quantification in Large Language Models	Jul 8, 2025	HallucinationUncertainty Quantification	CodeCode Available	5	5
Retrieval-Augmented Generation for Large Language Models: A Survey	Dec 18, 2023	HallucinationRAG	CodeCode Available	4	5
Hallucination of Multimodal Large Language Models: A Survey	Apr 29, 2024	HallucinationSurvey	CodeCode Available	4	5
LettuceDetect: A Hallucination Detection Framework for RAG Applications	Feb 24, 2025	8kGPU	CodeCode Available	4	5
ReAct: Synergizing Reasoning and Acting in Language Models	Oct 6, 2022	Decision MakingFact Verification	CodeCode Available	4	5
Do LLMs Possess a Personality? Making the MBTI Test an Amazing Evaluation for Large Language Models	Jul 30, 2023	HallucinationPrompt Engineering	CodeCode Available	4	5
LLM-Enhanced Data Management	Feb 4, 2024	HallucinationManagement	CodeCode Available	4	5
Distil-Whisper: Robust Knowledge Distillation via Large-Scale Pseudo Labelling	Nov 1, 2023	HallucinationKnowledge Distillation	CodeCode Available	4	5
Knowledge-tuning Large Language Models with Structured Medical Knowledge Bases for Reliable Response Generation in Chinese	Sep 8, 2023	Domain AdaptationHallucination	CodeCode Available	4	5
Multimodal Chain-of-Thought Reasoning in Language Models	Feb 2, 2023	HallucinationLanguage Modelling	CodeCode Available	4	5
Prismatic VLMs: Investigating the Design Space of Visually-Conditioned Language Models	Feb 12, 2024	HallucinationObject Localization	CodeCode Available	4	5
Halu-J: Critique-Based Hallucination Judge	Jul 17, 2024	Evidence SelectionHallucination	CodeCode Available	4	5
The All-Seeing Project V2: Towards General Relation Comprehension of the Open World	Feb 29, 2024	AllHallucination	CodeCode Available	4	5
Unleashing the Emergent Cognitive Synergy in Large Language Models: A Task-Solving Agent through Multi-Persona Self-Collaboration	Jul 11, 2023	HallucinationLogic Grid Puzzle	CodeCode Available	4	5
G-Retriever: Retrieval-Augmented Generation for Textual Graph Understanding and Question Answering	Feb 12, 2024	Common Sense ReasoningGraph Classification	CodeCode Available	4	5
A Survey of State of the Art Large Vision Language Models: Alignment, Benchmark, Evaluations and Challenges	Jan 4, 2025	FairnessHallucination	CodeCode Available	4	5
Tarsier2: Advancing Large Vision-Language Models from Detailed Video Description to Comprehensive Video Understanding	Jan 14, 2025	Embodied Question AnsweringHallucination	CodeCode Available	4	5
Retrieval Head Mechanistically Explains Long-Context Factuality	Apr 24, 2024	Continual PretrainingHallucination	CodeCode Available	3	5
RefChecker: Reference-based Fine-grained Hallucination Checker and Benchmark for Large Language Models	May 23, 2024	HallucinationSentence	CodeCode Available	3	5
EventRL: Enhancing Event Extraction with Outcome Supervision for Large Language Models	Feb 18, 2024	Event ExtractionHallucination	CodeCode Available	3	5
ResumeFlow: An LLM-facilitated Pipeline for Personalized Resume Generation and Refinement	Feb 9, 2024	HallucinationLanguage Modelling	CodeCode Available	3	5
RAG and RAU: A Survey on Retrieval-Augmented Language Model in Natural Language Processing	Apr 30, 2024	Computational EfficiencyHallucination	CodeCode Available	3	5
PULSE: Self-Supervised Photo Upsampling via Latent Space Exploration of Generative Models	Mar 8, 2020	Face HallucinationHallucination	CodeCode Available	3	5
RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Horizon Generation	Mar 8, 2024	Code GenerationHallucination	CodeCode Available	3	5
Evaluating Hallucinations in Chinese Large Language Models	Oct 5, 2023	HallucinationQuestion Answering	CodeCode Available	3	5
PokeLLMon: A Human-Parity Agent for Pokemon Battles with Large Language Models	Feb 2, 2024	Action GenerationDecision Making	CodeCode Available	3	5
Florence-VL: Enhancing Vision-Language Models with Generative Vision Encoder and Depth-Breadth Fusion	Dec 5, 2024	Contrastive LearningHallucination	CodeCode Available	3	5
PoisonedRAG: Knowledge Corruption Attacks to Retrieval-Augmented Generation of Large Language Models	Feb 12, 2024	Answer GenerationHallucination	CodeCode Available	3	5
AudioTrust: Benchmarking the Multifaceted Trustworthiness of Audio Large Language Models	May 22, 2025	BenchmarkingFairness	CodeCode Available	3	5
MMed-RAG: Versatile Multimodal RAG System for Medical Vision Language Models	Oct 16, 2024	DiagnosticHallucination	CodeCode Available	3	5
AutoHallusion: Automatic Generation of Hallucination Benchmarks for Vision-Language Models	Jun 16, 2024	HallucinationHallucination Evaluation	CodeCode Available	3	5
Learning Dynamics of LLM Finetuning	Jul 15, 2024	Hallucination	CodeCode Available	3	5
Automated Hypothesis Validation with Agentic Sequential Falsifications	Feb 14, 2025	Decision MakingHallucination	CodeCode Available	3	5
Embodied Agent Interface: Benchmarking LLMs for Embodied Decision Making	Oct 9, 2024	BenchmarkingDecision Making	CodeCode Available	3	5
LLaVA-MoD: Making LLaVA Tiny via MoE Knowledge Distillation	Aug 28, 2024	Computational EfficiencyHallucination	CodeCode Available	3	5
Agent-FLAN: Designing Data and Methods of Effective Agent Tuning for Large Language Models	Mar 19, 2024	Hallucination	CodeCode Available	3	5

Show:10 25 50

← PrevPage 1 of 37Next →

No leaderboard results yet.