SOTAVerified

Hallucination

Papers

Showing 276300 of 1816 papers

TitleStatusHype
AtomR: Atomic Operator-Empowered Large Language Models for Heterogeneous Knowledge ReasoningCode1
A Token-level Reference-free Hallucination Detection Benchmark for Free-form Text GenerationCode1
Hallu-PI: Evaluating Hallucination in Multi-modal Large Language Models within Perturbed InputsCode1
GraphArena: Benchmarking Large Language Models on Graph Computational ProblemsCode1
HalluciDoctor: Mitigating Hallucinatory Toxicity in Visual Instruction DataCode1
Harnessing Large Language Models for Knowledge Graph Question Answering via Adaptive Multi-Aspect Retrieval-AugmentationCode1
KCTS: Knowledge-Constrained Tree Search Decoding with Token-Level Hallucination DetectionCode1
Aladdin: Zero-Shot Hallucination of Stylized 3D Assets from Abstract Scene DescriptionsCode1
Gemini Goes to Med School: Exploring the Capabilities of Multimodal Large Language Models on Medical Challenge Problems & HallucinationsCode1
FlySearch: Exploring how vision-language models exploreCode1
PAINT: Paying Attention to INformed Tokens to Mitigate Hallucination in Large Vision-Language ModelCode1
FineSurE: Fine-grained Summarization Evaluation using LLMsCode1
Collaborative Large Language Model for Recommender SystemsCode1
Finetune-RAG: Fine-Tuning Language Models to Resist Hallucination in Retrieval-Augmented GenerationCode1
A Survey of Hallucination in Large Foundation ModelsCode1
Citation-Enhanced Generation for LLM-based ChatbotsCode1
Federated Recommendation via Hybrid Retrieval Augmented GenerationCode1
Circuit Transformer: A Transformer That Preserves Logical EquivalenceCode1
Fast and Slow Generating: An Empirical Study on Large and Small Language Models Collaborative DecodingCode1
Cognitive Mirage: A Review of Hallucinations in Large Language ModelsCode1
FaithDial: A Faithful Benchmark for Information-Seeking DialogueCode1
Filter-then-Generate: Large Language Models with Structure-Text Adapter for Knowledge Graph CompletionCode1
AssistRAG: Boosting the Potential of Large Language Models with an Intelligent Information AssistantCode1
CHATREPORT: Democratizing Sustainability Disclosure Analysis through LLM-based ToolsCode1
Factored Verification: Detecting and Reducing Hallucination in Summaries of Academic PapersCode1
Show:102550
← PrevPage 12 of 73Next →

No leaderboard results yet.