SOTAVerified

Hallucination

Papers

Showing 5175 of 1816 papers

TitleStatusHype
RAG and RAU: A Survey on Retrieval-Augmented Language Model in Natural Language ProcessingCode3
PULSE: Self-Supervised Photo Upsampling via Latent Space Exploration of Generative ModelsCode3
VideoRoPE: What Makes for Good Video Rotary Position Embedding?Code3
RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Horizon GenerationCode3
PokeLLMon: A Human-Parity Agent for Pokemon Battles with Large Language ModelsCode3
Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning AgentCode3
WikiChat: Stopping the Hallucination of Large Language Model Chatbots by Few-Shot Grounding on WikipediaCode3
HtmlRAG: HTML is Better Than Plain Text for Modeling Retrieved Knowledge in RAG SystemsCode3
RAGEval: Scenario Specific RAG Evaluation Dataset Generation FrameworkCode3
Embodied Agent Interface: Benchmarking LLMs for Embodied Decision MakingCode3
MMed-RAG: Versatile Multimodal RAG System for Medical Vision Language ModelsCode3
LLaVA-MoD: Making LLaVA Tiny via MoE Knowledge DistillationCode3
CRAG -- Comprehensive RAG BenchmarkCode3
Learning Dynamics of LLM FinetuningCode3
Agent-FLAN: Designing Data and Methods of Effective Agent Tuning for Large Language ModelsCode3
RefChecker: Reference-based Fine-grained Hallucination Checker and Benchmark for Large Language ModelsCode3
InstructGraph: Boosting Large Language Models via Graph-centric Instruction Tuning and Preference AlignmentCode2
Interleaving Retrieval with Chain-of-Thought Reasoning for Knowledge-Intensive Multi-Step QuestionsCode2
Self-Introspective Decoding: Alleviating Hallucinations for Large Vision-Language ModelsCode2
CHiP: Cross-modal Hierarchical Direct Preference Optimization for Multimodal LLMsCode2
A Diffusion-Based Generative Equalizer for Music RestorationCode2
Image Textualization: An Automatic Framework for Creating Accurate and Detailed Image DescriptionsCode2
HaluEval: A Large-Scale Hallucination Evaluation Benchmark for Large Language ModelsCode2
In-Context Sharpness as Alerts: An Inner Representation Perspective for Hallucination MitigationCode2
KnowHalu: Hallucination Detection via Multi-Form Knowledge Based Factual CheckingCode2
Show:102550
← PrevPage 3 of 73Next →

No leaderboard results yet.