SOTAVerified

Hallucination

Papers

Showing 16011625 of 1816 papers

TitleStatusHype
Leveraging Pretrained Models for Automatic Summarization of Doctor-Patient ConversationsCode0
NormSAGE: Multi-Lingual Multi-Cultural Norm Discovery from Conversations On-the-FlyCode0
DAHL: Domain-specific Automated Hallucination Evaluation of Long-Form Text through a Benchmark Dataset in BiomedicineCode0
NVP-HRI: Zero Shot Natural Voice and Posture-based Human-Robot Interaction via Large Language ModelCode0
Learning with privileged information via adversarial discriminative modality distillationCode0
Self-Rationalization in the Wild: A Large Scale Out-of-Distribution Evaluation on NLI-related tasksCode0
Object Hallucination in Image CaptioningCode0
DAFNet: Dynamic Auxiliary Fusion for Sequential Model Editing in Large Language ModelsCode0
Learning to Localize Objects Improves Spatial Reasoning in Visual-LLMsCode0
Localizing and Mitigating Errors in Long-form Question AnsweringCode0
Self-training Large Language Models through Knowledge DetectionCode0
Fine-grained Contract NER using instruction based modelCode0
Learning on LLM Output Signatures for gray-box LLM Behavior AnalysisCode0
Semantic Noise Matters for Neural Natural Language GenerationCode0
Learning Fine-grained Domain Generalization via Hyperbolic State Space HallucinationCode0
Learning Conformal Abstention Policies for Adaptive Risk Management in Large Language and Vision-Language ModelsCode0
Large Language Models on Wikipedia-Style Survey Generation: an Evaluation in NLP ConceptsCode0
Cross-modal Learning by Hallucinating Missing Modalities in RGB-D VisionCode0
SemEval-2025 Task 3: Mu-SHROOM, the Multilingual Shared Task on Hallucinations and Related Observable Overgeneration MistakesCode0
Large Language Models Are Involuntary Truth-Tellers: Exploiting Fallacy Failure for Jailbreak AttacksCode0
Brain-like Flexible Visual Inference by Harnessing Feedback-Feedforward AlignmentCode0
BordIRlines: A Dataset for Evaluating Cross-lingual Retrieval-Augmented GenerationCode0
On hallucinations in tomographic image reconstructionCode0
OnionEval: An Unified Evaluation of Fact-conflicting Hallucination for Small-Large Language ModelsCode0
On Large Language Models' Hallucination with Regard to Known FactsCode0
Show:102550
← PrevPage 65 of 73Next →

No leaderboard results yet.