SOTAVerified

Hallucination

Papers

Showing 15261550 of 1816 papers

TitleStatusHype
Sources of Hallucination by Large Language Models on Inference TasksCode1
WikiChat: Stopping the Hallucination of Large Language Model Chatbots by Few-Shot Grounding on WikipediaCode3
The Knowledge Alignment Problem: Bridging Human and External Knowledge for Large Language ModelsCode0
PaD: Program-aided Distillation Can Teach Small Models Reasoning Better than Chain-of-thought Fine-tuningCode0
mmT5: Modular Multilingual Pre-Training Solves Source Language Hallucinations0
How Language Model Hallucinations Can SnowballCode1
Element-aware Summarization with Large Language Models: Expert-aligned Evaluation and Chain-of-Thought MethodCode1
Chain-of-Knowledge: Grounding Large Language Models via Dynamic Knowledge Adapting over Heterogeneous SourcesCode1
Scene Graph as Pivoting: Inference-time Image-free Unsupervised Multimodal Machine Translation with Visual Scene HallucinationCode1
HaluEval: A Large-Scale Hallucination Evaluation Benchmark for Large Language ModelsCode2
HalOmi: A Manually Annotated Benchmark for Multilingual Hallucination and Omission Detection in Machine TranslationCode2
RCOT: Detecting and Rectifying Factual Inconsistency in Reasoning by Reversing Chain-of-Thought0
Appraising the Potential Uses and Harms of LLMs for Medical Systematic ReviewsCode0
Evaluating Object Hallucination in Large Vision-Language ModelsCode2
Is ChatGPT a Good Causal Reasoner? A Comprehensive EvaluationCode1
Meta-hallucinator: Towards Few-Shot Cross-Modality Cardiac Image Segmentation0
Simple Token-Level Confidence Improves Caption Correctness0
Exploring Human-Like Translation Strategy with Large Language ModelsCode2
ChartSumm: A Comprehensive Benchmark for Automatic Chart Summarization of Long and Short SummariesCode1
Benchmarking ChatGPT-4 on ACR Radiation Oncology In-Training (TXIT) Exam and Red Journal Gray Zone Cases: Potentials and Challenges for AI-Assisted Medical Education and Decision Making in Radiation OncologyCode0
The Dark Side of ChatGPT: Legal and Ethical Challenges from Stochastic Parrots and Hallucination0
Using Mobile Data and Deep Models to Assess Auditory Verbal Hallucinations0
GPT-NER: Named Entity Recognition via Large Language ModelsCode2
Dual Stage Stylization Modulation for Domain Generalized Semantic Segmentation0
OVTrack: Open-Vocabulary Multiple Object TrackingCode1
Show:102550
← PrevPage 62 of 73Next →

No leaderboard results yet.