Hallucination

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 251–300 of 1816 papers

Title	Date	Tasks	Status	Hype
Automated Review Generation Method Based on Large Language Models	Jul 30, 2024	ArticlesHallucination	CodeCode Available	1
Enhancing LLM's Cognition via Structurization	Jul 23, 2024	HallucinationHallucination Evaluation	CodeCode Available	1
HaloQuest: A Visual Hallucination Dataset for Advancing Multimodal Reasoning	Jul 22, 2024	BenchmarkingHallucination	CodeCode Available	1
Synergistic Multi-Agent Framework with Trajectory Learning for Knowledge-Intensive Tasks	Jul 13, 2024	HallucinationNavigate	CodeCode Available	1
Multi-Object Hallucination in Vision-Language Models	Jul 8, 2024	HallucinationObject Hallucination	CodeCode Available	1
MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation?	Jul 5, 2024	HallucinationImage Generation	CodeCode Available	1
MedVH: Towards Systematic Evaluation of Hallucination for Large Vision Language Models in the Medical Context	Jul 3, 2024	HallucinationResponse Generation	CodeCode Available	1
FineSurE: Fine-grained Summarization Evaluation using LLMs	Jul 1, 2024	BenchmarkingHallucination	CodeCode Available	1
Investigating and Mitigating the Multimodal Hallucination Snowballing in Large Vision-Language Models	Jun 30, 2024	Hallucinationmultimodal interaction	CodeCode Available	1
GraphArena: Benchmarking Large Language Models on Graph Computational Problems	Jun 29, 2024	BenchmarkingHallucination	CodeCode Available	1
ToolBeHonest: A Multi-level Hallucination Diagnostic Benchmark for Tool-Augmented Large Language Models	Jun 28, 2024	DiagnosticHallucination	CodeCode Available	1
Evaluating the Quality of Hallucination Benchmarks for Large Vision-Language Models	Jun 24, 2024	Hallucination	CodeCode Available	1
Evaluating and Analyzing Relationship Hallucinations in Large Vision-Language Models	Jun 24, 2024	Common Sense ReasoningHallucination	CodeCode Available	1
Knowledge Graph-Enhanced Large Language Models via Path Selection	Jun 19, 2024	HallucinationKnowledge Graphs	CodeCode Available	1
Fast and Slow Generating: An Empirical Study on Large and Small Language Models Collaborative Decoding	Jun 18, 2024	Hallucination	CodeCode Available	1
Small Agent Can Also Rock! Empowering Small Language Models as Hallucination Detector	Jun 17, 2024	2kHallucination	CodeCode Available	1
MoE-RBench: Towards Building Reliable Language Models with Sparse Mixture-of-Experts	Jun 17, 2024	HallucinationMixture-of-Experts	CodeCode Available	1
MMRel: A Relation Understanding Benchmark in the MLLM Era	Jun 13, 2024	DiversityHallucination	CodeCode Available	1
We Have a Package for You! A Comprehensive Analysis of Package Hallucinations by Code Generating LLMs	Jun 12, 2024	Code GenerationHallucination	CodeCode Available	1
REAL Sampling: Boosting Factuality and Diversity of Open-Ended Generation via Asymptotic Entropy	Jun 11, 2024	DiversityHallucination	CodeCode Available	1
DomainRAG: A Chinese Benchmark for Evaluating Domain-specific Retrieval-Augmented Generation	Jun 9, 2024	Common Sense ReasoningDenoising	CodeCode Available	1
An Empirical Study on Parameter-Efficient Fine-Tuning for MultiModal Large Language Models	Jun 7, 2024	Hallucinationparameter-efficient fine-tuning	CodeCode Available	1
Enhancing Noise Robustness of Retrieval-Augmented Language Models with Adaptive Adversarial Training	May 31, 2024	HallucinationMulti-Task Learning	CodeCode Available	1
TimeChara: Evaluating Point-in-Time Character Hallucination of Role-Playing Large Language Models	May 28, 2024	Hallucination	CodeCode Available	1
Personalized Steering of Large Language Models: Versatile Steering Vectors Through Bi-directional Preference Optimization	May 28, 2024	Hallucination	CodeCode Available	1
DEEM: Diffusion Models Serve as the Eyes of Large Language Models for Image Perception	May 24, 2024	Hallucination	CodeCode Available	1
Visual Description Grounding Reduces Hallucinations and Boosts Reasoning in LVLMs	May 24, 2024	HallucinationResponse Generation	CodeCode Available	1
The 2nd FutureDial Challenge: Dialog Systems with Retrieval Augmented Generation (FutureDial-RAG)	May 21, 2024	HallucinationRAG	CodeCode Available	1
Automated Multi-level Preference for MLLMs	May 18, 2024	Dataset GenerationHallucination	CodeCode Available	1
Enhancing Semantics in Multimodal Chain of Thought via Soft Negative Sampling	May 16, 2024	Contrastive LearningHallucination	CodeCode Available	1
THRONE: An Object-based Hallucination Benchmark for the Free-form Generations of Large Vision-Language Models	May 8, 2024	AttributeData Augmentation	CodeCode Available	1
CodeHalu: Investigating Code Hallucinations in LLMs via Execution-based Verification	Apr 30, 2024	Code GenerationHallucination	CodeCode Available	1
LLMs Know What They Need: Leveraging a Missing Information Guided Framework to Empower Retrieval-Augmented Generation	Apr 22, 2024	HallucinationRAG	CodeCode Available	1
VALOR-EVAL: Holistic Coverage and Faithfulness Evaluation of Large Vision-Language Models	Apr 22, 2024	HallucinationInformativeness	CodeCode Available	1
Detecting and Mitigating Hallucination in Large Vision Language Models via Fine-Grained AI Feedback	Apr 22, 2024	AttributeHallucination	CodeCode Available	1
Exploring the Transferability of Visual Prompting for Multimodal Large Language Models	Apr 17, 2024	HallucinationMultimodal Reasoning	CodeCode Available	1
MemLLM: Finetuning LLMs to Use An Explicit Read-Write Memory	Apr 17, 2024	HallucinationLanguage Modeling	CodeCode Available	1
Benchmarking Llama2, Mistral, Gemma and GPT for Factuality, Toxicity, Bias and Propensity for Hallucinations	Apr 15, 2024	BenchmarkingBias Detection	CodeCode Available	1
Harnessing GPT-4V(ision) for Insurance: A Preliminary Exploration	Apr 15, 2024	Hallucination	CodeCode Available	1
Constructing Benchmarks and Interventions for Combating Hallucinations in LLMs	Apr 15, 2024	HallucinationLanguage Modeling	CodeCode Available	1
CuriousLLM: Elevating Multi-Document QA with Reasoning-Infused Knowledge Graph Prompting	Apr 13, 2024	HallucinationKnowledge Graphs	CodeCode Available	1
Tackling Structural Hallucination in Image Translation with Local Diffusion	Apr 9, 2024	HallucinationImage Generation	CodeCode Available	1
Learning From Correctness Without Prompting Makes LLM Efficient Reasoner	Mar 28, 2024	Hallucination	CodeCode Available	1
Retrieval-enhanced Knowledge Editing in Language Models for Multi-Hop Question Answering	Mar 28, 2024	HallucinationIn-Context Learning	CodeCode Available	1
JDocQA: Japanese Document Question Answering Dataset for Generative Language Models	Mar 28, 2024	HallucinationQuestion Answering	CodeCode Available	1
UrbanVLP: Multi-Granularity Vision-Language Pretraining for Urban Socioeconomic Indicator Prediction	Mar 25, 2024	HallucinationText Generation	CodeCode Available	1
Pensieve: Retrospect-then-Compare Mitigates Visual Hallucination	Mar 21, 2024	HallucinationMME	CodeCode Available	1
What if...?: Thinking Counterfactual Keywords Helps to Mitigate Hallucination in Large Multi-modal Models	Mar 20, 2024	counterfactualHallucination	CodeCode Available	1
PhD: A ChatGPT-Prompted Visual hallucination Evaluation Dataset	Mar 17, 2024	AttributeCommon Sense Reasoning	CodeCode Available	1
Circuit Transformer: A Transformer That Preserves Logical Equivalence	Mar 14, 2024	Hallucination	CodeCode Available	1

Show:10 25 50

← PrevPage 6 of 37Next →

No leaderboard results yet.