SOTAVerified

Hallucination

Papers

Showing 476500 of 1816 papers

TitleStatusHype
Token Preference Optimization with Self-Calibrated Visual-Anchored Rewards for Hallucination Mitigation0
Think&Cite: Improving Attributed Text Generation with Self-Guided Tree Search and Progress Reward Modeling0
A Comparative Study of DSPy Teleprompter Algorithms for Aligning Large Language Models Evaluation Metrics to Human Evaluation0
Query pipeline optimization for cancer patient question answering systems0
Dehallucinating Parallel Context Extension for Retrieval-Augmented Generation0
Cracking the Code of Hallucination in LVLMs with Vision-aware Head Divergence0
Are LLMs Good Literature Review Writers? Evaluating the Literature Review Writing Ability of Large Language Models0
ReXTrust: A Model for Fine-Grained Hallucination Detection in AI-Generated Radiology Reports0
A MapReduce Approach to Effectively Utilize Long Context Information in Retrieval Augmented Language Models0
When to Speak, When to Abstain: Contrastive Decoding with Abstention0
What External Knowledge is Preferred by LLMs? Characterizing and Exploring Chain of Evidence in Imperfect Context0
A Benchmark and Robustness Study of In-Context-Learning with Large Language Models in Music Entity DetectionCode0
Emma-X: An Embodied Multimodal Action Model with Grounded Chain of Thought and Look-ahead Spatial ReasoningCode2
CG-Bench: Clue-grounded Question Answering Benchmark for Long Video Understanding0
Task-Oriented Dialog Systems for the Senegalese Wolof Language0
RAC3: Retrieval-Augmented Corner Case Comprehension for Autonomous Driving with Vision-Language Models0
Combating Multimodal LLM Hallucination via Bottom-Up Holistic Reasoning0
Accelerating Retrieval-Augmented Generation0
NoisyEQA: Benchmarking Embodied Question Answering Against Noisy Queries0
Thinking with Knowledge Graphs: Enhancing LLM Reasoning Through Structured Data0
TACOMORE: Leveraging the Potential of LLMs in Corpus-based Discourse Analysis with Prompt Engineering0
Detecting LLM Hallucination Through Layer-wise Information Deficiency: Analysis of Unanswerable Questions and Ambiguous Prompts0
Benchmarking large language models for materials synthesis: the case of atomic layer deposition0
Multi-Task Learning with LLMs for Implicit Sentiment Analysis: Data-level and Task-level Automatic Weight Learning0
Filter-then-Generate: Large Language Models with Structure-Text Adapter for Knowledge Graph CompletionCode1
Show:102550
← PrevPage 20 of 73Next →

No leaderboard results yet.