SOTAVerified

Hallucination

Papers

Showing 701725 of 1816 papers

TitleStatusHype
BordIRlines: A Dataset for Evaluating Cross-lingual Retrieval-Augmented GenerationCode0
Enhancing Training Data Attribution for Large Language Models with Fitting Error Consideration0
The Labyrinth of Links: Navigating the Associative Maze of Multi-modal LLMs0
FactAlign: Long-form Factuality Alignment of Large Language ModelsCode1
LMOD: A Large Multimodal Ophthalmology Dataset and Benchmark for Large Vision-Language Models0
VideoCLIP-XL: Advancing Long Description Understanding for Video CLIP Models0
ScVLM: Enhancing Vision-Language Model for Safety-Critical Event UnderstandingCode0
FaithEval: Can Your Language Model Stay Faithful to Context, Even If "The Moon is Made of Marshmallows"Code2
Ingest-And-Ground: Dispelling Hallucinations from Continually-Pretrained LLMs with RAG0
HELPD: Mitigating Hallucination of LVLMs by Hierarchical Feedback Learning with Vision-enhanced Penalty DecodingCode0
Contrastive Token Learning with Similarity Decay for Repetition Suppression in Machine Translation0
LLM Hallucinations in Practical Code Generation: Phenomena, Mechanism, and MitigationCode0
MedHalu: Hallucinations in Responses to Healthcare Queries by Large Language Models0
DENEB: A Hallucination-Robust Automatic Evaluation Metric for Image Captioning0
HaloScope: Harnessing Unlabeled LLM Generations for Hallucination DetectionCode0
Enhancing Guardrails for Safe and Secure Healthcare AI0
Pre-trained Language Models Return Distinguishable Probability Distributions to Unfaithfully Hallucinated TextsCode0
RoleBreak: Character Hallucination as a Jailbreak Attack in Role-Playing Systems0
EventHallusion: Diagnosing Event Hallucinations in Video LLMsCode1
A Unified Hallucination Mitigation Framework for Large Vision-Language ModelsCode0
Controlling Risk of Retrieval-augmented Generation: A Counterfactual Prompting FrameworkCode0
XTRUST: On the Multilingual Trustworthiness of Large Language ModelsCode1
Planning in the Dark: LLM-Symbolic Planning Pipeline without Experts0
AsthmaBot: Multi-modal, Multi-Lingual Retrieval Augmented Generation For Asthma Patient Support0
Long-horizon Embodied Planning with Implicit Logical Inference and Hallucination Mitigation0
Show:102550
← PrevPage 29 of 73Next →

No leaderboard results yet.