SOTAVerified

Hallucination

Papers

Showing 13511400 of 1816 papers

TitleStatusHype
Crafting In-context Examples according to LMs' Parametric KnowledgeCode0
Investigating Hallucinations in Pruned Large Language Models for Abstractive SummarizationCode1
How Trustworthy are Open-Source LLMs? An Assessment under Malicious Demonstrations Shows their VulnerabilitiesCode0
Ever: Mitigating Hallucination in Large Language Models through Real-Time Verification and RectificationCode0
Enhancing Emergency Decision-making with Knowledge Graphs and Large Language Models0
Chain-of-Note: Enhancing Robustness in Retrieval-Augmented Language Models0
Insights into Classifying and Mitigating LLMs' Hallucinations0
Predicting Text Preference Via Structured Comparative Reasoning0
Volcano: Mitigating Multimodal Hallucination through Self-Feedback Guided RevisionCode1
AMBER: An LLM-free Multi-dimensional Benchmark for MLLMs Hallucination EvaluationCode1
Finding and Editing Multi-Modal Neurons in Pre-Trained TransformersCode1
GPT-4V(ision) as A Social Media Analysis Engine0
Hallucination Augmented Recitations for Language Models0
Investigating Multi-Pivot Ensembling with Massively Multilingual Machine Translation ModelsCode0
Hallucination-minimized Data-to-answer Framework for Financial Decision-makers0
A Survey on Hallucination in Large Language Models: Principles, Taxonomy, Challenges, and Open QuestionsCode2
CBSiMT: Mitigating Hallucination in Simultaneous Machine Translation with Weighted Prefix-to-Prefix Training0
Holistic Analysis of Hallucination in GPT-4V(ision): Bias and Interference ChallengesCode1
ChEF: A Comprehensive Evaluation Framework for Standardized Assessment of Multimodal Large Language Models0
SAC3: Reliable Hallucination Detection in Black-Box Language Models via Semantic-aware Cross-check ConsistencyCode1
CRUSH4SQL: Collective Retrieval Using Schema Hallucination For Text2SQLCode1
Collaborative Large Language Model for Recommender SystemsCode1
Learn to Refuse: Making Large Language Models More Controllable and Reliable through Knowledge Scope Limitation and Refusal Mechanism0
Distil-Whisper: Robust Knowledge Distillation via Large-Scale Pseudo LabellingCode4
Brain-like Flexible Visual Inference by Harnessing Feedback-Feedforward AlignmentCode0
Synthetic Imitation Edit Feedback for Factual Alignment in Clinical SummarizationCode0
Sequence-Level Certainty Reduces Hallucination In Knowledge-Grounded Dialogue Generation0
N-Critics: Self-Refinement of Large Language Models with Ensemble of Critics0
Virtual Accessory Try-On via Keypoint Hallucination0
LightLM: A Lightweight Deep and Narrow Language Model for Generative RecommendationCode1
Critic-Driven Decoding for Mitigating Hallucinations in Data-to-text GenerationCode0
Correction with Backtracking Reduces Hallucination in SummarizationCode0
Learned, uncertainty-driven adaptive acquisition for photon-efficient scanning microscopy0
Woodpecker: Hallucination Correction for Multimodal Large Language ModelsCode2
Hallucination Detection for Grounded Instruction Generation0
Fidelity-Enriched Contrastive Search: Reconciling the Faithfulness-Diversity Trade-Off in Text GenerationCode0
Language Models Hallucinate, but May Excel at Fact VerificationCode0
HallusionBench: An Advanced Diagnostic Suite for Entangled Language Hallucination and Visual Illusion in Large Vision-Language ModelsCode2
Unleashing the potential of prompt engineering for large language models0
Chainpoll: A high efficacy method for LLM hallucination detectionCode0
Long-Form Speech Translation through Segmentation with Finite-State Decoding Constraints on Large Language Models0
Reliable Academic Conference Question Answering: A Study Based on Large Language ModelCode0
MAF: Multi-Aspect Feedback for Improving Reasoning in Large Language ModelsCode0
ReEval: Automatic Hallucination Evaluation for Retrieval-Augmented Large Language Models via Transferable Adversarial Attacks0
Know Where to Go: Make LLM a Relevant, Responsible, and Trustworthy Searcher0
FactCHD: Benchmarking Fact-Conflicting Hallucination DetectionCode1
LiDAR-based 4D Occupancy Completion and ForecastingCode1
Theory of Mind for Multi-Agent Collaboration via Large Language ModelsCode1
Towards reducing hallucination in extracting information from financial reports using Large Language Models0
Flow Dynamics Correction for Action Recognition0
Show:102550
← PrevPage 28 of 37Next →

No leaderboard results yet.