SOTAVerified

Hallucination

Papers

Showing 13011325 of 1816 papers

TitleStatusHype
The Tower of Babel Revisited: Multilingual Jailbreak Prompts on Closed-Source Large Language Models0
The Troubling Emergence of Hallucination in Large Language Models -- An Extensive Definition, Quantification, and Prescriptive Remediations0
The Two Sides of the Coin: Hallucination Generation and Detection with LLMs as Evaluators for LLMs0
The UniMelb Submission to the SIGMORPHON 2020 Shared Task 0: Typologically Diverse Morphological Inflection0
Think&Cite: Improving Attributed Text Generation with Self-Guided Tree Search and Progress Reward Modeling0
Thinking with Knowledge Graphs: Enhancing LLM Reasoning Through Structured Data0
Think More, Hallucinate Less: Mitigating Hallucinations via Dual Process of Fast and Slow Thinking0
Think Twice Before Trusting: Self-Detection for Large Language Models through Comprehensive Answer Reflection0
Thutmose Tagger: Single-pass neural model for Inverse Text Normalization0
Tianyi: A Traditional Chinese Medicine all-rounder language model and its Real-World Clinical Practice0
TLDR: Token-Level Detective Reward Model for Large Vision Language Models0
TN-Eval: Rubric and Evaluation Protocols for Measuring the Quality of Behavioral Therapy Notes0
Token Preference Optimization with Self-Calibrated Visual-Anchored Rewards for Hallucination Mitigation0
Tomographic Foundation Model -- FORCE: Flow-Oriented Reconstruction Conditioning Engine0
Comprehensive Evaluation of Large Language Models for Topic Modeling0
Toward Personalizing Quantum Computing Education: An Evolutionary LLM-Powered Approach0
Toward Robust Hyper-Detailed Image Captioning: A Multiagent Approach and Dual Evaluation Metrics for Factuality and Coverage0
Towards Analyzing and Mitigating Sycophancy in Large Vision-Language Models0
Towards a Reliable Offline Personal AI Assistant for Long Duration Spaceflight0
CorpusLM: Towards a Unified Language Model on Corpus for Knowledge-Intensive Tasks0
Towards Clinical Encounter Summarization: Learning to Compose Discharge Summaries from Prior Notes0
Towards Detecting LLMs Hallucination via Markov Chain-based Multi-agent Debate Framework0
Towards Mitigating Hallucination in Large Language Models via Self-Reflection0
Towards Multi-Source Retrieval-Augmented Generation via Synergizing Reasoning and Preference-Driven Retrieval0
Towards Omnidirectional Reasoning with 360-R1: A Dataset, Benchmark, and GRPO-based Method0
Show:102550
← PrevPage 53 of 73Next →

No leaderboard results yet.