SOTAVerified

Hallucination

Papers

Showing 4150 of 1816 papers

TitleStatusHype
RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Horizon GenerationCode3
Embodied Agent Interface: Benchmarking LLMs for Embodied Decision MakingCode3
MMed-RAG: Versatile Multimodal RAG System for Medical Vision Language ModelsCode3
PoisonedRAG: Knowledge Corruption Attacks to Retrieval-Augmented Generation of Large Language ModelsCode3
Learning Dynamics of LLM FinetuningCode3
LLaVA-MoD: Making LLaVA Tiny via MoE Knowledge DistillationCode3
CRAG -- Comprehensive RAG BenchmarkCode3
PokeLLMon: A Human-Parity Agent for Pokemon Battles with Large Language ModelsCode3
AudioTrust: Benchmarking the Multifaceted Trustworthiness of Audio Large Language ModelsCode3
Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning AgentCode3
Show:102550
← PrevPage 5 of 182Next →

No leaderboard results yet.