SOTAVerified

Hallucination

Papers

Showing 351400 of 1816 papers

TitleStatusHype
HaloQuest: A Visual Hallucination Dataset for Advancing Multimodal ReasoningCode1
AssistRAG: Boosting the Potential of Large Language Models with an Intelligent Information AssistantCode1
CHATREPORT: Democratizing Sustainability Disclosure Analysis through LLM-based ToolsCode1
Grounded Chain-of-Thought for Multimodal Large Language ModelsCode1
Detecting Hallucinated Content in Conditional Neural Sequence GenerationCode1
Detecting and Mitigating Hallucination in Large Vision Language Models via Fine-Grained AI FeedbackCode1
ChartSumm: A Comprehensive Benchmark for Automatic Chart Summarization of Long and Short SummariesCode1
Mitigating Hallucination for Large Vision Language Model by Inter-Modality Correlation Calibration DecodingCode1
ChartInsighter: An Approach for Mitigating Hallucination in Time-series Chart Summary Generation with A Benchmark DatasetCode1
GraphArena: Benchmarking Large Language Models on Graph Computational ProblemsCode1
JDocQA: Japanese Document Question Answering Dataset for Generative Language ModelsCode1
Gemini Goes to Med School: Exploring the Capabilities of Multimodal Large Language Models on Medical Challenge Problems & HallucinationsCode1
FlySearch: Exploring how vision-language models exploreCode1
Generating Natural Language Proofs with Verifier-Guided SearchCode1
Mitigating Object Hallucinations via Sentence-Level Early InterventionCode1
Finetune-RAG: Fine-Tuning Language Models to Resist Hallucination in Retrieval-Augmented GenerationCode1
PAINT: Paying Attention to INformed Tokens to Mitigate Hallucination in Large Vision-Language ModelCode1
CRUSH4SQL: Collective Retrieval Using Schema Hallucination For Text2SQLCode1
Finding and Editing Multi-Modal Neurons in Pre-Trained TransformersCode1
Mitigating Open-Vocabulary Caption HallucinationsCode1
Filter-then-Generate: Large Language Models with Structure-Text Adapter for Knowledge Graph CompletionCode1
CuriousLLM: Elevating Multi-Document QA with Reasoning-Infused Knowledge Graph PromptingCode1
FineSurE: Fine-grained Summarization Evaluation using LLMsCode1
GeoBenchX: Benchmarking LLMs for Multistep Geospatial TasksCode1
Fast and Slow Generating: An Empirical Study on Large and Small Language Models Collaborative DecodingCode1
NOH-NMS: Improving Pedestrian Detection by Nearby Objects HallucinationCode1
DAMO: Data- and Model-aware Alignment of Multi-modal LLMsCode1
Automatic Curriculum Expert Iteration for Reliable LLM ReasoningCode1
FaithBench: A Diverse Hallucination Benchmark for Summarization by Modern LLMsCode1
Deficiency-Aware Masked Transformer for Video InpaintingCode1
FaithDial: A Faithful Benchmark for Information-Seeking DialogueCode1
Dataset Distillation via FactorizationCode1
Federated Recommendation via Hybrid Retrieval Augmented GenerationCode1
DCR-Consistency: Divide-Conquer-Reasoning for Consistency Evaluation and Improvement of Large Language ModelsCode1
Doc2Query--: When Less is MoreCode1
FactAlign: Long-form Factuality Alignment of Large Language ModelsCode1
Selective Generation for Controllable Language ModelsCode1
Parametric Depth Based Feature Representation Learning for Object Detection and Segmentation in Bird's Eye ViewCode1
Paths-over-Graph: Knowledge Graph Empowered Large Language Model ReasoningCode1
Paying More Attention to Image: A Training-Free Method for Alleviating Hallucination in LVLMsCode1
Chain of Natural Language Inference for Reducing Large Language Model Ungrounded HallucinationsCode1
Factored Verification: Detecting and Reducing Hallucination in Summaries of Academic PapersCode1
Chain-of-Knowledge: Grounding Large Language Models via Dynamic Knowledge Adapting over Heterogeneous SourcesCode1
A Head to Predict and a Head to Question: Pre-trained Uncertainty Quantification Heads for Hallucination Detection in LLM OutputsCode1
BAMBOO: A Comprehensive Benchmark for Evaluating Long Text Modeling Capacities of Large Language ModelsCode1
Face Hallucination via Split-Attention in Split-Attention NetworkCode1
FAIR GPT: A virtual consultant for research data management in ChatGPTCode1
Prevent the Language Model from being Overconfident in Neural Machine TranslationCode1
EventHallusion: Diagnosing Event Hallucinations in Video LLMsCode1
Exploring Hallucination of Large Multimodal Models in Video Understanding: Benchmark, Analysis and MitigationCode1
Show:102550
← PrevPage 8 of 37Next →

No leaderboard results yet.