SOTAVerified

Hallucination

Papers

Showing 251300 of 1816 papers

TitleStatusHype
Evaluating Image Hallucination in Text-to-Image Generation with Question-AnsweringCode1
Hallucination Augmented Contrastive Learning for Multimodal Large Language ModelCode1
No-Reference Image Quality Assessment by Hallucinating Pristine FeaturesCode1
Evaluating and Analyzing Relationship Hallucinations in Large Vision-Language ModelsCode1
Entity-level Factual Consistency of Abstractive Text SummarizationCode1
All in an Aggregated Image for In-Image LearningCode1
Investigating Hallucinations in Pruned Large Language Models for Abstractive SummarizationCode1
Entity-Based Knowledge Conflicts in Question AnsweringCode1
Mitigating Multilingual Hallucination in Large Vision-Language ModelsCode1
LightLM: A Lightweight Deep and Narrow Language Model for Generative RecommendationCode1
Enhancing Noise Robustness of Retrieval-Augmented Language Models with Adaptive Adversarial TrainingCode1
Alleviating Hallucinations of Large Language Models through Induced HallucinationsCode1
Enhancing Semantics in Multimodal Chain of Thought via Soft Negative SamplingCode1
Enhancing LLM's Cognition via StructurizationCode1
Let there be a clock on the beach: Reducing Object Hallucination in Image CaptioningCode1
Enhancing Uncertainty-Based Hallucination Detection with Stronger FocusCode1
Improving Large Language Models in Event Relation Logical PredictionCode1
Element-aware Summarization with Large Language Models: Expert-aligned Evaluation and Chain-of-Thought MethodCode1
Efficient Dynamic Clustering-Based Document Compression for Retrieval-Augmented-GenerationCode1
EFUF: Efficient Fine-grained Unlearning Framework for Mitigating Hallucinations in Multimodal Large Language ModelsCode1
EmbodiedAgent: A Scalable Hierarchical Approach to Overcome Practical Challenge in Multi-Robot ControlCode1
Learning From Correctness Without Prompting Makes LLM Efficient ReasonerCode1
AtomR: Atomic Operator-Empowered Large Language Models for Heterogeneous Knowledge ReasoningCode1
A Token-level Reference-free Hallucination Detection Benchmark for Free-form Text GenerationCode1
Hallu-PI: Evaluating Hallucination in Multi-modal Large Language Models within Perturbed InputsCode1
EDFace-Celeb-1M: Benchmarking Face Hallucination with a Million-scale DatasetCode1
Exploring Hallucination of Large Multimodal Models in Video Understanding: Benchmark, Analysis and MitigationCode1
Learning to Automate Follow-up Question Generation using Process Knowledge for Depression Triage on Reddit PostsCode1
LiDAR-based 4D Occupancy Completion and ForecastingCode1
Look, Compare, Decide: Alleviating Hallucination in Large Vision-Language Models via Multi-View Multi-Path ReasoningCode1
Aladdin: Zero-Shot Hallucination of Stylized 3D Assets from Abstract Scene DescriptionsCode1
DomainRAG: A Chinese Benchmark for Evaluating Domain-specific Retrieval-Augmented GenerationCode1
K-QA: A Real-World Medical Q&A BenchmarkCode1
Label Hallucination for Few-Shot ClassificationCode1
LAN-HDR: Luminance-based Alignment Network for High Dynamic Range Video ReconstructionCode1
Know Or Not: a library for evaluating out-of-knowledge base robustnessCode1
KnowRL: Exploring Knowledgeable Reinforcement Learning for FactualityCode1
Doc2Query--: When Less is MoreCode1
Distinguishing Ignorance from Error in LLM HallucinationsCode1
KoLA: Carefully Benchmarking World Knowledge of Large Language ModelsCode1
Large Language Models are Versatile Decomposers: Decompose Evidence and Questions for Table-based ReasoningCode1
A Survey of Hallucination in Large Foundation ModelsCode1
Citation-Enhanced Generation for LLM-based ChatbotsCode1
DiffFuSR: Super-Resolution of all Sentinel-2 Multispectral Bands using Diffusion ModelsCode1
Circuit Transformer: A Transformer That Preserves Logical EquivalenceCode1
DiaHalu: A Dialogue-level Hallucination Evaluation Benchmark for Large Language ModelsCode1
Knowledge Graph-based Retrieval-Augmented Generation for Schema MatchingCode1
Knowledge Graph-Enhanced Large Language Models via Path SelectionCode1
AssistRAG: Boosting the Potential of Large Language Models with an Intelligent Information AssistantCode1
CHATREPORT: Democratizing Sustainability Disclosure Analysis through LLM-based ToolsCode1
Show:102550
← PrevPage 6 of 37Next →

No leaderboard results yet.