SOTAVerified

Hallucination

Papers

Showing 601650 of 1816 papers

TitleStatusHype
MAVEN-Fact: A Large-scale Event Factuality Detection DatasetCode0
MCiteBench: A Multimodal Benchmark for Generating Text with CitationsCode0
Mechanistic Understanding and Mitigation of Language Model Non-Factual HallucinationsCode0
From Artificial Needles to Real Haystacks: Improving Retrieval Capabilities in LLMs by Finetuning on Synthetic DataCode0
Machine Translation Hallucination Detection for Low and High Resource Languages using Large Language ModelsCode0
Low to High Dimensional Modality Hallucination using Aggregated Fields of ViewCode0
LVLM-Compress-Bench: Benchmarking the Broader Impact of Large Vision-Language Model CompressionCode0
MAF: Multi-Aspect Feedback for Improving Reasoning in Large Language ModelsCode0
MedScore: Factuality Evaluation of Free-Form Medical AnswersCode0
Mitigating Hallucination of Large Vision-Language Models via Dynamic Logits CalibrationCode0
On hallucinations in tomographic image reconstructionCode0
LLM Internal States Reveal Hallucination Risk Faced With a QueryCode0
LLM Hallucinations in Practical Code Generation: Phenomena, Mechanism, and MitigationCode0
LLM Inference Enhanced by External Knowledge: A SurveyCode0
LLM-based Query Expansion Fails for Unfamiliar and Ambiguous QueriesCode0
LLMs and Memorization: On Quality and Specificity of Copyright ComplianceCode0
A Comparative Study on Language Models for Task-Oriented Dialogue SystemsCode0
LightHouse: A Survey of AGI HallucinationCode0
Linear Correlation in LM's Compositional Generalization and HallucinationCode0
Leveraging Pretrained Models for Automatic Summarization of Doctor-Patient ConversationsCode0
Fine-tuning Large Language Models for Improving Factuality in Legal Question AnsweringCode0
Learning with privileged information via adversarial discriminative modality distillationCode0
Localizing and Mitigating Errors in Long-form Question AnsweringCode0
Learning to Localize Objects Improves Spatial Reasoning in Visual-LLMsCode0
Learning Fine-grained Domain Generalization via Hyperbolic State Space HallucinationCode0
Learning on LLM Output Signatures for gray-box LLM Behavior AnalysisCode0
Fine-grained Contract NER using instruction based modelCode0
AIstorian lets AI be a historian: A KG-powered multi-agent system for accurate biography generationCode0
Fidelity-Enriched Contrastive Search: Reconciling the Faithfulness-Diversity Trade-Off in Text GenerationCode0
Multi-Source Knowledge Pruning for Retrieval-Augmented Generation: A Benchmark and Empirical StudyCode0
Large Language Models on Wikipedia-Style Survey Generation: an Evaluation in NLP ConceptsCode0
Learning Conformal Abstention Policies for Adaptive Risk Management in Large Language and Vision-Language ModelsCode0
Large Language Models Are Involuntary Truth-Tellers: Exploiting Fallacy Failure for Jailbreak AttacksCode0
Few-shot learning via tensor hallucinationCode0
AILS-NTUA at SemEval-2024 Task 6: Efficient model tuning for hallucination detection and analysisCode0
CiteBART: Learning to Generate Citations for Local Citation RecommendationCode0
Fakes of Varying Shades: How Warning Affects Human Perception and Engagement Regarding LLM HallucinationsCode0
KG-FPQ: Evaluating Factuality Hallucination in LLMs with Knowledge Graph-based False Premise QuestionsCode0
keepitsimple at SemEval-2025 Task 3: LLM-Uncertainty based Approach for Multilingual Hallucination Span DetectionCode0
AIGCs Confuse AI Too: Investigating and Explaining Synthetic Image-induced Hallucinations in Large Vision-Language ModelsCode0
Iterative Teaching by Data HallucinationCode0
Investigating the performance of Retrieval-Augmented Generation and fine-tuning for the development of AI-driven knowledge-based systemsCode0
AI-Enhanced Cognitive Behavioral Therapy: Deep Learning and Large Language Models for Extracting Cognitive Pathways from Social Media TextsCode0
Investigating Multi-Pivot Ensembling with Massively Multilingual Machine Translation ModelsCode0
Joint stereo 3D object detection and implicit surface reconstructionCode0
Integrating Chemistry Knowledge in Large Language Models via Prompt EngineeringCode0
Assessing the Reliability of Large Language Model KnowledgeCode0
Instruction Makes a DifferenceCode0
Incorporating Task-specific Concept Knowledge into Script LearningCode0
Investigating and Mitigating Object Hallucinations in Pretrained Vision-Language (CLIP) ModelsCode0
Show:102550
← PrevPage 13 of 37Next →

No leaderboard results yet.