SOTAVerified

Hallucination

Papers

Showing 5175 of 1816 papers

TitleStatusHype
AutoHallusion: Automatic Generation of Hallucination Benchmarks for Vision-Language ModelsCode3
Graph Retrieval-Augmented Generation: A SurveyCode3
PokeLLMon: A Human-Parity Agent for Pokemon Battles with Large Language ModelsCode3
Automated Hypothesis Validation with Agentic Sequential FalsificationsCode3
RAGEval: Scenario Specific RAG Evaluation Dataset Generation FrameworkCode3
PULSE: Self-Supervised Photo Upsampling via Latent Space Exploration of Generative ModelsCode3
HtmlRAG: HTML is Better Than Plain Text for Modeling Retrieved Knowledge in RAG SystemsCode3
When Large Language Models Meet Vector Databases: A SurveyCode3
MMed-RAG: Versatile Multimodal RAG System for Medical Vision Language ModelsCode3
CRAG -- Comprehensive RAG BenchmarkCode3
Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning AgentCode3
Evaluating Hallucinations in Chinese Large Language ModelsCode3
Agent-FLAN: Designing Data and Methods of Effective Agent Tuning for Large Language ModelsCode3
LLaVA-MoD: Making LLaVA Tiny via MoE Knowledge DistillationCode3
RAG and RAU: A Survey on Retrieval-Augmented Language Model in Natural Language ProcessingCode3
Learning Dynamics of LLM FinetuningCode3
InstructGraph: Boosting Large Language Models via Graph-centric Instruction Tuning and Preference AlignmentCode2
Interleaving Retrieval with Chain-of-Thought Reasoning for Knowledge-Intensive Multi-Step QuestionsCode2
CHiP: Cross-modal Hierarchical Direct Preference Optimization for Multimodal LLMsCode2
Self-Introspective Decoding: Alleviating Hallucinations for Large Vision-Language ModelsCode2
In-Context Sharpness as Alerts: An Inner Representation Perspective for Hallucination MitigationCode2
HaluEval: A Large-Scale Hallucination Evaluation Benchmark for Large Language ModelsCode2
A Diffusion-Based Generative Equalizer for Music RestorationCode2
HalOmi: A Manually Annotated Benchmark for Multilingual Hallucination and Omission Detection in Machine TranslationCode2
HallusionBench: An Advanced Diagnostic Suite for Entangled Language Hallucination and Visual Illusion in Large Vision-Language ModelsCode2
Show:102550
← PrevPage 3 of 73Next →

No leaderboard results yet.