SOTAVerified

Hallucination

Papers

Showing 251275 of 1816 papers

TitleStatusHype
GeoBenchX: Benchmarking LLMs for Multistep Geospatial TasksCode1
EFUF: Efficient Fine-grained Unlearning Framework for Mitigating Hallucinations in Multimodal Large Language ModelsCode1
Automated Multi-level Preference for MLLMsCode1
Generating Natural Language Proofs with Verifier-Guided SearchCode1
Know Or Not: a library for evaluating out-of-knowledge base robustnessCode1
LAN-HDR: Luminance-based Alignment Network for High Dynamic Range Video ReconstructionCode1
Knowledge Graph-based Retrieval-Augmented Generation for Schema MatchingCode1
Knowledge Graph-Enhanced Large Language Models via Path SelectionCode1
ECBench: Can Multi-modal Foundation Models Understand the Egocentric World? A Holistic Embodied Cognition BenchmarkCode1
EDFace-Celeb-1M: Benchmarking Face Hallucination with a Million-scale DatasetCode1
Efficient Dynamic Clustering-Based Document Compression for Retrieval-Augmented-GenerationCode1
All in an Aggregated Image for In-Image LearningCode1
DomainRAG: A Chinese Benchmark for Evaluating Domain-specific Retrieval-Augmented GenerationCode1
Mitigating Multilingual Hallucination in Large Vision-Language ModelsCode1
Large Language Models are Versatile Decomposers: Decompose Evidence and Questions for Table-based ReasoningCode1
Distinguishing Ignorance from Error in LLM HallucinationsCode1
Alleviating Hallucinations of Large Language Models through Induced HallucinationsCode1
Joint Evaluation of Answer and Reasoning Consistency for Hallucination Detection in Large Reasoning ModelsCode1
Doc2Query--: When Less is MoreCode1
KCTS: Knowledge-Constrained Tree Search Decoding with Token-Level Hallucination DetectionCode1
DiffFuSR: Super-Resolution of all Sentinel-2 Multispectral Bands using Diffusion ModelsCode1
JDocQA: Japanese Document Question Answering Dataset for Generative Language ModelsCode1
"Kelly is a Warm Person, Joseph is a Role Model": Gender Biases in LLM-Generated Reference LettersCode1
Detecting Machine-Generated Texts by Multi-Population Aware Optimization for Maximum Mean DiscrepancyCode1
Invoke Interfaces Only When Needed: Adaptive Invocation for Large Language Models in Question AnsweringCode1
Show:102550
← PrevPage 11 of 73Next →

No leaderboard results yet.