Hallucination

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 26–50 of 1816 papers

Title	Date	Tasks	Status	Hype
Distil-Whisper: Robust Knowledge Distillation via Large-Scale Pseudo Labelling	Nov 1, 2023	HallucinationKnowledge Distillation	CodeCode Available	4
Knowledge-tuning Large Language Models with Structured Medical Knowledge Bases for Reliable Response Generation in Chinese	Sep 8, 2023	Domain AdaptationHallucination	CodeCode Available	4
Do LLMs Possess a Personality? Making the MBTI Test an Amazing Evaluation for Large Language Models	Jul 30, 2023	HallucinationPrompt Engineering	CodeCode Available	4
Unleashing the Emergent Cognitive Synergy in Large Language Models: A Task-Solving Agent through Multi-Persona Self-Collaboration	Jul 11, 2023	HallucinationLogic Grid Puzzle	CodeCode Available	4
Multimodal Chain-of-Thought Reasoning in Language Models	Feb 2, 2023	HallucinationLanguage Modelling	CodeCode Available	4
ReAct: Synergizing Reasoning and Acting in Language Models	Oct 6, 2022	Decision MakingFact Verification	CodeCode Available	4
AudioTrust: Benchmarking the Multifaceted Trustworthiness of Audio Large Language Models	May 22, 2025	BenchmarkingFairness	CodeCode Available	3
Verdict: A Library for Scaling Judge-Time Compute	Feb 25, 2025	Fact CheckingHallucination	CodeCode Available	3
Automated Hypothesis Validation with Agentic Sequential Falsifications	Feb 14, 2025	Decision MakingHallucination	CodeCode Available	3
VideoRoPE: What Makes for Good Video Rotary Position Embedding?	Feb 7, 2025	HallucinationPosition	CodeCode Available	3
Florence-VL: Enhancing Vision-Language Models with Generative Vision Encoder and Depth-Breadth Fusion	Dec 5, 2024	Contrastive LearningHallucination	CodeCode Available	3
Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning Agent	Nov 5, 2024	BenchmarkingHallucination	CodeCode Available	3
HtmlRAG: HTML is Better Than Plain Text for Modeling Retrieved Knowledge in RAG Systems	Nov 5, 2024	HallucinationRAG	CodeCode Available	3
MMed-RAG: Versatile Multimodal RAG System for Medical Vision Language Models	Oct 16, 2024	DiagnosticHallucination	CodeCode Available	3
The Curse of Multi-Modalities: Evaluating Hallucinations of Large Multimodal Models across Language, Visual, and Audio	Oct 16, 2024	Hallucination	CodeCode Available	3
Graph-constrained Reasoning: Faithful Reasoning on Knowledge Graphs with Large Language Models	Oct 16, 2024	HallucinationKnowledge Graphs	CodeCode Available	3
Embodied Agent Interface: Benchmarking LLMs for Embodied Decision Making	Oct 9, 2024	BenchmarkingDecision Making	CodeCode Available	3
LLaVA-MoD: Making LLaVA Tiny via MoE Knowledge Distillation	Aug 28, 2024	Computational EfficiencyHallucination	CodeCode Available	3
Graph Retrieval-Augmented Generation: A Survey	Aug 15, 2024	HallucinationRAG	CodeCode Available	3
RAGEval: Scenario Specific RAG Evaluation Dataset Generation Framework	Aug 2, 2024	BenchmarkingDataset Generation	CodeCode Available	3
Learning Dynamics of LLM Finetuning	Jul 15, 2024	Hallucination	CodeCode Available	3
AutoHallusion: Automatic Generation of Hallucination Benchmarks for Vision-Language Models	Jun 16, 2024	HallucinationHallucination Evaluation	CodeCode Available	3
CRAG -- Comprehensive RAG Benchmark	Jun 7, 2024	HallucinationLanguage Modelling	CodeCode Available	3
RefChecker: Reference-based Fine-grained Hallucination Checker and Benchmark for Large Language Models	May 23, 2024	HallucinationSentence	CodeCode Available	3
RAG and RAU: A Survey on Retrieval-Augmented Language Model in Natural Language Processing	Apr 30, 2024	Computational EfficiencyHallucination	CodeCode Available	3

Show:10 25 50

← PrevPage 2 of 73Next →

No leaderboard results yet.