SOTAVerified

Hallucination

Papers

Showing 15011550 of 1816 papers

TitleStatusHype
Chatlaw: A Multi-Agent Collaborative Legal Assistant with Knowledge Graph Enhanced Mixture-of-Experts Large Language ModelCode5
Mitigating Hallucination in Large Multi-Modal Models via Robust Instruction TuningCode2
Evidence for Reduced Sensory Precision and Increased Reliance on Priors in Hallucination-Prone Individuals in a General Population Sample0
IERL: Interpretable Ensemble Representation Learning -- Combining CrowdSourced Knowledge and Distributed Semantic Representations0
ToolQA: A Dataset for LLM Question Answering with External ToolsCode2
A Survey on Multimodal Large Language Models0
Hallucination is the last thing you need0
Vision Transformer with Attention Map Hallucination and FFN Compaction0
Are Large Language Models Really Good Logical Reasoners? A Comprehensive Evaluation and BeyondCode1
Pushing the Limits of ChatGPT on NLP Tasks0
Explaining Legal Concepts with Augmented Large Language Models (GPT-4)0
KoLA: Carefully Benchmarking World Knowledge of Large Language ModelsCode1
LVLM-eHub: A Comprehensive Evaluation Benchmark for Large Vision-Language ModelsCode2
Aladdin: Zero-Shot Hallucination of Stylized 3D Assets from Abstract Scene DescriptionsCode1
Trapping LLM Hallucinations Using Tagged Context Prompts0
Defocus to focus: Photo-realistic bokeh rendering by fusing defocus and radiance priors0
Efficient and Interpretable Compressive Text Summarisation with Unsupervised Dual-Agent Reinforcement LearningCode0
Do Language Models Know When They're Hallucinating References?Code0
An Investigation of Evaluation Metrics for Automated Medical Note GenerationCode0
AdaPlanner: Adaptive Planning from Feedback with Language ModelsCode1
Getting Sick After Seeing a Doctor? Diagnosing and Mitigating Knowledge Conflicts in Event Temporal ReasoningCode0
Enabling Large Language Models to Generate Text with CitationsCode2
Lawyer LLaMA Technical ReportCode2
Gorilla: Large Language Model Connected with Massive APIsCode6
RefGPT: Dialogue Generation of GPT, by GPT, and for GPTCode1
Sources of Hallucination by Large Language Models on Inference TasksCode1
WikiChat: Stopping the Hallucination of Large Language Model Chatbots by Few-Shot Grounding on WikipediaCode3
The Knowledge Alignment Problem: Bridging Human and External Knowledge for Large Language ModelsCode0
PaD: Program-aided Distillation Can Teach Small Models Reasoning Better than Chain-of-thought Fine-tuningCode0
mmT5: Modular Multilingual Pre-Training Solves Source Language Hallucinations0
How Language Model Hallucinations Can SnowballCode1
Element-aware Summarization with Large Language Models: Expert-aligned Evaluation and Chain-of-Thought MethodCode1
Chain-of-Knowledge: Grounding Large Language Models via Dynamic Knowledge Adapting over Heterogeneous SourcesCode1
Scene Graph as Pivoting: Inference-time Image-free Unsupervised Multimodal Machine Translation with Visual Scene HallucinationCode1
HaluEval: A Large-Scale Hallucination Evaluation Benchmark for Large Language ModelsCode2
HalOmi: A Manually Annotated Benchmark for Multilingual Hallucination and Omission Detection in Machine TranslationCode2
RCOT: Detecting and Rectifying Factual Inconsistency in Reasoning by Reversing Chain-of-Thought0
Appraising the Potential Uses and Harms of LLMs for Medical Systematic ReviewsCode0
Evaluating Object Hallucination in Large Vision-Language ModelsCode2
Is ChatGPT a Good Causal Reasoner? A Comprehensive EvaluationCode1
Meta-hallucinator: Towards Few-Shot Cross-Modality Cardiac Image Segmentation0
Simple Token-Level Confidence Improves Caption Correctness0
Exploring Human-Like Translation Strategy with Large Language ModelsCode2
ChartSumm: A Comprehensive Benchmark for Automatic Chart Summarization of Long and Short SummariesCode1
Benchmarking ChatGPT-4 on ACR Radiation Oncology In-Training (TXIT) Exam and Red Journal Gray Zone Cases: Potentials and Challenges for AI-Assisted Medical Education and Decision Making in Radiation OncologyCode0
The Dark Side of ChatGPT: Legal and Ethical Challenges from Stochastic Parrots and Hallucination0
Using Mobile Data and Deep Models to Assess Auditory Verbal Hallucinations0
GPT-NER: Named Entity Recognition via Large Language ModelsCode2
Dual Stage Stylization Modulation for Domain Generalized Semantic Segmentation0
OVTrack: Open-Vocabulary Multiple Object TrackingCode1
Show:102550
← PrevPage 31 of 37Next →

No leaderboard results yet.