Hallucination

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 651–675 of 1816 papers

Title	Date	Tasks	Status	Hype
LargePiG: Your Large Language Model is Secretly a Pointer Generator	Oct 15, 2024	HallucinationLanguage Modeling	—Unverified	0
ReDeEP: Detecting Hallucination in Retrieval-Augmented Generation via Mechanistic Interpretability	Oct 15, 2024	HallucinationRAG	—Unverified	0
MLLM can see? Dynamic Correction Decoding for Hallucination Mitigation	Oct 15, 2024	HallucinationLanguage Modeling	CodeCode Available	2
Automatically Generating Visual Hallucination Test Cases for Multimodal Large Language Models	Oct 15, 2024	HallucinationLarge Language Model	CodeCode Available	0
Have the VLMs Lost Confidence? A Study of Sycophancy in VLMs	Oct 15, 2024	Hallucination	—Unverified	0
On the Capacity of Citation Generation by Large Language Models	Oct 15, 2024	AttributeHallucination	—Unverified	0
Magnifier Prompt: Tackling Multimodal Hallucination via Extremely Simple Instructions	Oct 15, 2024	Hallucination	—Unverified	0
AGENTiGraph: An Interactive Knowledge Graph Platform for LLM-based Chatbots Utilizing Private Data	Oct 15, 2024	HallucinationKnowledge Graphs	—Unverified	0
Can Structured Data Reduce Epistemic Uncertainty?	Oct 14, 2024	HallucinationRetrieval	—Unverified	0
Parenting: Optimizing Knowledge Selection of Retrieval-Augmented Language Models with Parameter Decoupling and Tailored Tuning	Oct 14, 2024	HallucinationRAG	—Unverified	0
SkillAggregation: Reference-free LLM-Dependent Aggregation	Oct 14, 2024	ChatbotHallucination	—Unverified	0
Medico: Towards Hallucination Detection and Correction with Multi-source Evidence Fusion	Oct 14, 2024	Hallucination	—Unverified	0
VideoAgent: Self-Improving Video Generation	Oct 14, 2024	HallucinationVideo Generation	CodeCode Available	2
LongHalQA: Long-Context Hallucination Evaluation for MultiModal Large Language Models	Oct 13, 2024	HallucinationHallucination Evaluation	CodeCode Available	0
Collu-Bench: A Benchmark for Predicting Language Model Hallucinations in Code	Oct 13, 2024	Code GenerationHallucination	—Unverified	0
Honest AI: Fine-Tuning "Small" Language Models to Say "I Don't Know", and Reducing Hallucination in RAG	Oct 13, 2024	HallucinationRAG	—Unverified	0
VLFeedback: A Large-Scale AI Feedback Dataset for Large Vision-Language Models Alignment	Oct 12, 2024	DiversityHallucination	—Unverified	0
VERIFIED: A Video Corpus Moment Retrieval Benchmark for Fine-Grained Video Understanding	Oct 11, 2024	HallucinationMoment Retrieval	CodeCode Available	1
A Methodology for Evaluating RAG Systems: A Case Study On Configuration Dependency Validation	Oct 11, 2024	HallucinationRAG	CodeCode Available	0
Measuring the Inconsistency of Large Language Models in Preferential Ranking	Oct 11, 2024	DiagnosticHallucination	—Unverified	0
PublicHearingBR: A Brazilian Portuguese Dataset of Public Hearing Transcripts for Summarization of Long Documents	Oct 10, 2024	ArticlesDocument Summarization	—Unverified	0
Can Knowledge Graphs Make Large Language Models More Trustworthy? An Empirical Study over Open-ended Question Answering	Oct 10, 2024	HallucinationKnowledge Graphs	—Unverified	0
Automatic Curriculum Expert Iteration for Reliable LLM Reasoning	Oct 10, 2024	HallucinationLogical Reasoning	CodeCode Available	1
OneNet: A Fine-Tuning Free Framework for Few-Shot Entity Linking via Large Language Model Prompting	Oct 10, 2024	Entity LinkingFew-Shot Learning	CodeCode Available	1
LatteCLIP: Unsupervised CLIP Fine-Tuning via LMM-Synthetic Texts	Oct 10, 2024	Hallucination	—Unverified	0

Show:10 25 50

← PrevPage 27 of 73Next →

No leaderboard results yet.