SOTAVerified

Hallucination

Papers

Showing 451500 of 1816 papers

TitleStatusHype
Object Hallucination in Image CaptioningCode0
NormSAGE: Multi-Lingual Multi-Cultural Norm Discovery from Conversations On-the-FlyCode0
A Benchmark and Robustness Study of In-Context-Learning with Large Language Models in Music Entity DetectionCode0
NVP-HRI: Zero Shot Natural Voice and Posture-based Human-Robot Interaction via Large Language ModelCode0
Noise Augmented Fine Tuning for Mitigating Hallucinations in Large Language ModelsCode0
NoiseBoost: Alleviating Hallucination with Noise Perturbation for Multimodal Large Language ModelsCode0
NCL-UoR at SemEval-2025 Task 3: Detecting Multilingual Hallucination and Related Observable Overgeneration Text Spans with Modified RefChecker and Modified SeflCheckGPTCode0
NGEP: A Graph-based Event Planning Framework for Story GenerationCode0
Adversarial Semantic Hallucination for Domain Generalized Semantic SegmentationCode0
Navigating Noisy Feedback: Enhancing Reinforcement Learning with Error-Prone Language ModelsCode0
Multimodal Preference Data Synthetic Alignment with Reward ModelCode0
Multimodal Survival Modeling in the Age of Foundation ModelsCode0
MultiHal: Multilingual Dataset for Knowledge-Graph Grounded Evaluation of LLM HallucinationsCode0
Embedding Hallucination for Few-Shot Language Fine-tuningCode0
Multi-FAct: Assessing Factuality of Multilingual LLMs using FActScoreCode0
Elevating Legal LLM Responses: Harnessing Trainable Logical Structures and Semantic Knowledge with Legal ReasoningCode0
Modality Distillation with Multiple Stream Networks for Action RecognitionCode0
Anticipation-Free Training for Simultaneous Machine TranslationCode0
MMBoundary: Advancing MLLM Knowledge Boundary Awareness through Reasoning Step Confidence CalibrationCode0
Multi-party Goal Tracking with LLMs: Comparing Pre-training, Fine-tuning, and Prompt EngineeringCode0
Efficient and Interpretable Compressive Text Summarisation with Unsupervised Dual-Agent Reinforcement LearningCode0
Effectively Enhancing Vision Language Large Models by Prompt Augmentation and Caption UtilizationCode0
Editing Factual Knowledge and Explanatory Ability of Medical Large Language ModelsCode0
An Inflectional Database for GitksanCode0
Mixture of Decoding: An Attention-Inspired Adaptive Decoding Strategy to Mitigate Hallucinations in Large Vision-Language ModelsCode0
Bridging the Visual Gap: Fine-Tuning Multimodal Models with Knowledge-Adapted CaptionsCode0
The Knowledge Alignment Problem: Bridging Human and External Knowledge for Large Language ModelsCode0
An Investigation of Evaluation Metrics for Automated Medical Note GenerationCode0
DualFocus: Integrating Macro and Micro Perspectives in Multi-modal Large Language ModelsCode0
Mitigating Hallucination of Large Vision-Language Models via Dynamic Logits CalibrationCode0
Mitigating Entity-Level Hallucination in Large Language ModelsCode0
ANHALTEN: Cross-Lingual Transfer for German Token-Level Reference-Free Hallucination DetectionCode0
Brain MRI Image Super Resolution using Phase Stretch Transform and Transfer LearningCode0
DO-RAG: A Domain-Specific QA Framework Using Knowledge Graph-Enhanced Retrieval-Augmented GenerationCode0
A New Benchmark and Reverse Validation Method for Passage-level Hallucination DetectionCode0
Brain-like Flexible Visual Inference by Harnessing Feedback-Feedforward AlignmentCode0
BordIRlines: A Dataset for Evaluating Cross-lingual Retrieval-Augmented GenerationCode0
"Merge Conflicts!" Exploring the Impacts of External Distractors to Parametric Knowledge GraphsCode0
Mitigating Hallucination in Abstractive Summarization with Domain-Conditional Mutual InformationCode0
MedTSS: transforming abstractive summarization of scientific articles with linguistic analysis and concept reinforcementCode0
Do Language Models Know When They're Hallucinating References?Code0
MedScore: Factuality Evaluation of Free-Form Medical AnswersCode0
MELO: Enhancing Model Editing with Neuron-Indexed Dynamic LoRACode0
BioKGBench: A Knowledge Graph Checking Benchmark of AI Agent for Biomedical ScienceCode0
Diving Deep into Modes of Fact Hallucinations in Dialogue SystemsCode0
Mechanistic Understanding and Mitigation of Language Model Non-Factual HallucinationsCode0
MedHallTune: An Instruction-Tuning Benchmark for Mitigating Medical Hallucination in Vision-Language ModelsCode0
Mitigating Hallucination in Fictional Character Role-PlayCode0
Addressing Topic Granularity and Hallucination in Large Language Models for Topic ModellingCode0
Machine Translation Hallucination Detection for Low and High Resource Languages using Large Language ModelsCode0
Show:102550
← PrevPage 10 of 37Next →

No leaderboard results yet.