SOTAVerified

Hallucination

Papers

Showing 376400 of 1816 papers

TitleStatusHype
Efficient Dynamic Clustering-Based Document Compression for Retrieval-Augmented-GenerationCode1
Element-aware Summarization with Large Language Models: Expert-aligned Evaluation and Chain-of-Thought MethodCode1
Automatic Curriculum Expert Iteration for Reliable LLM ReasoningCode1
MedVH: Towards Systematic Evaluation of Hallucination for Large Vision Language Models in the Medical ContextCode1
MemLLM: Finetuning LLMs to Use An Explicit Read-Write MemoryCode1
EDFace-Celeb-1M: Benchmarking Face Hallucination with a Million-scale DatasetCode1
Dataset Distillation via FactorizationCode1
MedChat: A Multi-Agent Framework for Multimodal Diagnosis with Large Language ModelsCode1
ECBench: Can Multi-modal Foundation Models Understand the Egocentric World? A Holistic Embodied Cognition BenchmarkCode1
High-resolution Face Swapping via Latent Semantics DisentanglementCode1
Lyra: Orchestrating Dual Correction in Automated Theorem ProvingCode1
Holistic Analysis of Hallucination in GPT-4V(ision): Bias and Interference ChallengesCode1
How Language Model Hallucinations Can SnowballCode1
Med-HALT: Medical Domain Hallucination Test for Large Language ModelsCode1
Chain of Natural Language Inference for Reducing Large Language Model Ungrounded HallucinationsCode1
DEEM: Diffusion Models Serve as the Eyes of Large Language Models for Image PerceptionCode1
BachGAN: High-Resolution Image Synthesis from Salient Object LayoutCode1
Balanced Classification: A Unified Framework for Long-Tailed Object DetectionCode1
DomainRAG: A Chinese Benchmark for Evaluating Domain-specific Retrieval-Augmented GenerationCode1
Chain-of-Knowledge: Grounding Large Language Models via Dynamic Knowledge Adapting over Heterogeneous SourcesCode1
A Head to Predict and a Head to Question: Pre-trained Uncertainty Quantification Heads for Hallucination Detection in LLM OutputsCode1
A Multitask, Multilingual, Multimodal Evaluation of ChatGPT on Reasoning, Hallucination, and InteractivityCode1
Robust 3D Object Detection from LiDAR-Radar Point Clouds via Cross-Modal Feature AugmentationCode1
Enhancing LLM's Cognition via StructurizationCode1
Mitigating Hallucinations in Large Vision-Language Models by Adaptively Constraining Information FlowCode1
Show:102550
← PrevPage 16 of 73Next →

No leaderboard results yet.