SOTAVerified

Hallucination

Papers

Showing 351400 of 1816 papers

TitleStatusHype
LightLM: A Lightweight Deep and Narrow Language Model for Generative RecommendationCode1
FactCHD: Benchmarking Fact-Conflicting Hallucination DetectionCode1
LiDAR-based 4D Occupancy Completion and ForecastingCode1
Theory of Mind for Multi-Agent Collaboration via Large Language ModelsCode1
RegaVAE: A Retrieval-Augmented Gaussian Mixture Variational Auto-Encoder for Language ModelingCode1
Factored Verification: Detecting and Reducing Hallucination in Summaries of Academic PapersCode1
Improving Large Language Models in Event Relation Logical PredictionCode1
"Kelly is a Warm Person, Joseph is a Role Model": Gender Biases in LLM-Generated Reference LettersCode1
KCTS: Knowledge-Constrained Tree Search Decoding with Token-Level Hallucination DetectionCode1
Enhancing Text-based Knowledge Graph Completion with Zero-Shot Large Language Models: A Focus on Semantic EnhancementCode1
OpsEval: A Comprehensive IT Operations Benchmark Suite for Large Language ModelsCode1
Chain of Natural Language Inference for Reducing Large Language Model Ungrounded HallucinationsCode1
AGIR: Automating Cyber Threat Intelligence Reporting with Natural Language GenerationCode1
HallE-Control: Controlling Object Hallucination in Large Multimodal ModelsCode1
BTR: Binary Token Representations for Efficient Retrieval Augmented Language ModelsCode1
LLM Lies: Hallucinations are not Bugs, but Features as Adversarial ExamplesCode1
Analyzing and Mitigating Object Hallucination in Large Vision-Language ModelsCode1
Robust 3D Object Detection from LiDAR-Radar Point Clouds via Cross-Modal Feature AugmentationCode1
Self-supervised Cross-view Representation Reconstruction for Change CaptioningCode1
Lyra: Orchestrating Dual Correction in Automated Theorem ProvingCode1
BAMBOO: A Comprehensive Benchmark for Evaluating Long Text Modeling Capacities of Large Language ModelsCode1
Struc-Bench: Are Large Language Models Really Good at Generating Complex Structured Data?Code1
Cognitive Mirage: A Review of Hallucinations in Large Language ModelsCode1
A Survey of Hallucination in Large Foundation ModelsCode1
Evaluation and Analysis of Hallucination in Large Vision-Language ModelsCode1
VIGC: Visual Instruction Generation and CorrectionCode1
PREFER: Prompt Ensemble Learning via Feedback-Reflect-RefineCode1
LAN-HDR: Luminance-based Alignment Network for High Dynamic Range Video ReconstructionCode1
Detecting and Preventing Hallucinations in Large Vision Language ModelsCode1
Balanced Classification: A Unified Framework for Long-Tailed Object DetectionCode1
Beyond Generic: Enhancing Image Captioning with Real-World Knowledge using Vision-Language Pre-Training ModelCode1
Retrieval Augmented Generation and Representative Vector Summarization for large unstructured textual data in Medical EducationCode1
Transferable Decoding with Visual Entities for Zero-Shot Image CaptioningCode1
Med-HALT: Medical Domain Hallucination Test for Large Language ModelsCode1
CHATREPORT: Democratizing Sustainability Disclosure Analysis through LLM-based ToolsCode1
Selective Generation for Controllable Language ModelsCode1
Deficiency-Aware Masked Transformer for Video InpaintingCode1
Effective Prompt Extraction from Language ModelsCode1
Parametric Depth Based Feature Representation Learning for Object Detection and Segmentation in Bird's Eye ViewCode1
Are Large Language Models Really Good Logical Reasoners? A Comprehensive Evaluation and BeyondCode1
KoLA: Carefully Benchmarking World Knowledge of Large Language ModelsCode1
Aladdin: Zero-Shot Hallucination of Stylized 3D Assets from Abstract Scene DescriptionsCode1
AdaPlanner: Adaptive Planning from Feedback with Language ModelsCode1
RefGPT: Dialogue Generation of GPT, by GPT, and for GPTCode1
Sources of Hallucination by Large Language Models on Inference TasksCode1
Element-aware Summarization with Large Language Models: Expert-aligned Evaluation and Chain-of-Thought MethodCode1
How Language Model Hallucinations Can SnowballCode1
Chain-of-Knowledge: Grounding Large Language Models via Dynamic Knowledge Adapting over Heterogeneous SourcesCode1
Scene Graph as Pivoting: Inference-time Image-free Unsupervised Multimodal Machine Translation with Visual Scene HallucinationCode1
Is ChatGPT a Good Causal Reasoner? A Comprehensive EvaluationCode1
Show:102550
← PrevPage 8 of 37Next →

No leaderboard results yet.