SOTAVerified

Hallucination

Papers

Showing 301350 of 1816 papers

TitleStatusHype
A Survey of Hallucination in Large Foundation ModelsCode1
Gemini Goes to Med School: Exploring the Capabilities of Multimodal Large Language Models on Medical Challenge Problems & HallucinationsCode1
FlySearch: Exploring how vision-language models exploreCode1
Citation-Enhanced Generation for LLM-based ChatbotsCode1
Circuit Transformer: A Transformer That Preserves Logical EquivalenceCode1
PAINT: Paying Attention to INformed Tokens to Mitigate Hallucination in Large Vision-Language ModelCode1
FineSurE: Fine-grained Summarization Evaluation using LLMsCode1
Finetune-RAG: Fine-Tuning Language Models to Resist Hallucination in Retrieval-Augmented GenerationCode1
AssistRAG: Boosting the Potential of Large Language Models with an Intelligent Information AssistantCode1
CHATREPORT: Democratizing Sustainability Disclosure Analysis through LLM-based ToolsCode1
Federated Recommendation via Hybrid Retrieval Augmented GenerationCode1
CodeHalu: Investigating Code Hallucinations in LLMs via Execution-based VerificationCode1
CogniBench: A Legal-inspired Framework and Dataset for Assessing Cognitive Faithfulness of Large Language ModelsCode1
Hallucinated Neural Radiance Fields in the WildCode1
Filter-then-Generate: Large Language Models with Structure-Text Adapter for Knowledge Graph CompletionCode1
Aladdin: Zero-Shot Hallucination of Stylized 3D Assets from Abstract Scene DescriptionsCode1
FaithDial: A Faithful Benchmark for Information-Seeking DialogueCode1
CRUSH4SQL: Collective Retrieval Using Schema Hallucination For Text2SQLCode1
ChartSumm: A Comprehensive Benchmark for Automatic Chart Summarization of Long and Short SummariesCode1
ChartInsighter: An Approach for Mitigating Hallucination in Time-series Chart Summary Generation with A Benchmark DatasetCode1
A Token-level Reference-free Hallucination Detection Benchmark for Free-form Text GenerationCode1
High-resolution Face Swapping via Latent Semantics DisentanglementCode1
Fast and Slow Generating: An Empirical Study on Large and Small Language Models Collaborative DecodingCode1
Finding and Editing Multi-Modal Neurons in Pre-Trained TransformersCode1
HyperPocket: Generative Point Cloud CompletionCode1
Generating Natural Language Proofs with Verifier-Guided SearchCode1
Harnessing GPT-4V(ision) for Insurance: A Preliminary ExplorationCode1
Extract Free Dense Misalignment from CLIPCode1
Into the Unknown: Self-Learning Large Language ModelsCode1
Introspective Planning: Aligning Robots' Uncertainty with Inherent Task AmbiguityCode1
CR-LT-KGQA: A Knowledge Graph Question Answering Dataset Requiring Commonsense Reasoning and Long-Tail KnowledgeCode1
IterGen: Iterative Semantic-aware Structured LLM Generation with BacktrackingCode1
Joint Evaluation of Answer and Reasoning Consistency for Hallucination Detection in Large Reasoning ModelsCode1
KCTS: Knowledge-Constrained Tree Search Decoding with Token-Level Hallucination DetectionCode1
Face Hallucination via Split-Attention in Split-Attention NetworkCode1
Exploring Hallucination of Large Multimodal Models in Video Understanding: Benchmark, Analysis and MitigationCode1
Context-aware Decoding Reduces Hallucination in Query-focused SummarizationCode1
Context-Patch Face Hallucination Based on Thresholding Locality-constrained Representation and Reproducing LearningCode1
Exploring the Transferability of Visual Prompting for Multimodal Large Language ModelsCode1
FactAlign: Long-form Factuality Alignment of Large Language ModelsCode1
Evaluation and Analysis of Hallucination in Large Vision-Language ModelsCode1
Know Or Not: a library for evaluating out-of-knowledge base robustnessCode1
Evaluating the Quality of Hallucination Benchmarks for Large Vision-Language ModelsCode1
Controllable Neural Dialogue Summarization with Personal Named Entity PlanningCode1
Label Hallucination for Few-Shot ClassificationCode1
LAN-HDR: Luminance-based Alignment Network for High Dynamic Range Video ReconstructionCode1
EventHallusion: Diagnosing Event Hallucinations in Video LLMsCode1
Large Language Models for Multi-Robot Systems: A SurveyCode1
Evaluating Image Hallucination in Text-to-Image Generation with Question-AnsweringCode1
Factored Verification: Detecting and Reducing Hallucination in Summaries of Academic PapersCode1
Show:102550
← PrevPage 7 of 37Next →

No leaderboard results yet.