SOTAVerified

Hallucination

Papers

Showing 10011050 of 1816 papers

TitleStatusHype
LongHalQA: Long-Context Hallucination Evaluation for MultiModal Large Language ModelsCode0
VLFeedback: A Large-Scale AI Feedback Dataset for Large Vision-Language Models Alignment0
Measuring the Inconsistency of Large Language Models in Preferential Ranking0
A Methodology for Evaluating RAG Systems: A Case Study On Configuration Dependency ValidationCode0
LatteCLIP: Unsupervised CLIP Fine-Tuning via LMM-Synthetic Texts0
PublicHearingBR: A Brazilian Portuguese Dataset of Public Hearing Transcripts for Summarization of Long Documents0
Can Knowledge Graphs Make Large Language Models More Trustworthy? An Empirical Study over Open-ended Question Answering0
Utilize the Flow before Stepping into the Same River Twice: Certainty Represented Knowledge Flow for Refusal-Aware Instruction TuningCode0
From Pixels to Tokens: Revisiting Object Hallucinations in Large Vision-Language Models0
FG-PRM: Fine-grained Hallucination Detection and Mitigation in Language Model Mathematical Reasoning0
Gradual Learning: Optimizing Fine-Tuning with Partially Mastered Knowledge in Large Language Models0
Listening to Patients: A Framework of Detecting and Mitigating Patient Misreport for Medical Dialogue Generation0
EMMA: Empowering Multi-modal Mamba with Structural and Hierarchical Alignment0
AI-Enhanced Ethical Hacking: A Linux-Focused Experiment0
TLDR: Token-Level Detective Reward Model for Large Vision Language Models0
DAMRO: Dive into the Attention Mechanism of LVLM to Reduce Object Hallucination0
Mitigating Hallucinations Using Ensemble of Knowledge Graph and Vector Store in Large Language Models to Enhance Mental Health Support0
DiDOTS: Knowledge Distillation from Large-Language-Models for Dementia Obfuscation in Transcribed Speech0
TUBench: Benchmarking Large Vision-Language Models on Trustworthiness with Unanswerable QuestionsCode0
Auto-GDA: Automatic Domain Adaptation for Efficient Grounding Verification in Retrieval Augmented Generation0
SAG: Style-Aligned Article Generation via Model Collaboration0
Investigating and Mitigating Object Hallucinations in Pretrained Vision-Language (CLIP) ModelsCode0
FactCheckmate: Preemptively Detecting and Mitigating Hallucinations in LMs0
Salient Information Prompting to Steer Content in Prompt-based Abstractive SummarizationCode0
Characterizing Context Influence and Hallucination in SummarizationCode0
Enhancing Training Data Attribution for Large Language Models with Fitting Error Consideration0
LMOD: A Large Multimodal Ophthalmology Dataset and Benchmark for Large Vision-Language Models0
The Labyrinth of Links: Navigating the Associative Maze of Multi-modal LLMs0
BordIRlines: A Dataset for Evaluating Cross-lingual Retrieval-Augmented GenerationCode0
VideoCLIP-XL: Advancing Long Description Understanding for Video CLIP Models0
ScVLM: Enhancing Vision-Language Model for Safety-Critical Event UnderstandingCode0
HELPD: Mitigating Hallucination of LVLMs by Hierarchical Feedback Learning with Vision-enhanced Penalty DecodingCode0
Contrastive Token Learning with Similarity Decay for Repetition Suppression in Machine Translation0
Ingest-And-Ground: Dispelling Hallucinations from Continually-Pretrained LLMs with RAG0
LLM Hallucinations in Practical Code Generation: Phenomena, Mechanism, and MitigationCode0
MedHalu: Hallucinations in Responses to Healthcare Queries by Large Language Models0
DENEB: A Hallucination-Robust Automatic Evaluation Metric for Image Captioning0
HaloScope: Harnessing Unlabeled LLM Generations for Hallucination DetectionCode0
Pre-trained Language Models Return Distinguishable Probability Distributions to Unfaithfully Hallucinated TextsCode0
RoleBreak: Character Hallucination as a Jailbreak Attack in Role-Playing Systems0
Enhancing Guardrails for Safe and Secure Healthcare AI0
A Unified Hallucination Mitigation Framework for Large Vision-Language ModelsCode0
Enhancing Text-to-SQL Capabilities of Large Language Models via Domain Database Knowledge Injection0
Long-horizon Embodied Planning with Implicit Logical Inference and Hallucination Mitigation0
AsthmaBot: Multi-modal, Multi-Lingual Retrieval Augmented Generation For Asthma Patient Support0
Controlling Risk of Retrieval-augmented Generation: A Counterfactual Prompting FrameworkCode0
Planning in the Dark: LLM-Symbolic Planning Pipeline without Experts0
A Preliminary Study of o1 in Medicine: Are We Closer to an AI Doctor?0
Parse Trees Guided LLM Prompt CompressionCode0
Enhancing Scientific Reproducibility Through Automated BioCompute Object Creation Using Retrieval-Augmented Generation from Publications0
Show:102550
← PrevPage 21 of 37Next →

No leaderboard results yet.