SOTAVerified

Hallucination

Papers

Showing 941950 of 1816 papers

TitleStatusHype
Exploring the Knowledge Mismatch Hypothesis: Hallucination Propensity in Small Models Fine-tuned on Data from Larger Models0
Improbable Bigrams Expose Vulnerabilities of Incomplete Tokens in Byte-Level Tokenizers0
VisAidMath: Benchmarking Visual-Aided Mathematical Reasoning0
EF-LLM: Energy Forecasting LLM with AI-assisted Automation, Enhanced Sparse Prediction, Hallucination Detection0
Unified Triplet-Level Hallucination Evaluation for Large Vision-Language ModelsCode0
Beyond Ontology in Dialogue State Tracking for Goal-Oriented ChatbotCode0
FactBench: A Dynamic Benchmark for In-the-Wild Language Model Factuality Evaluation0
MARCO: Multi-Agent Real-time Chat Orchestration0
A Perspective for Adapting Generalist AI to Specialized Medical AI Applications and Their Challenges0
A Debate-Driven Experiment on LLM Hallucinations and Accuracy0
Show:102550
← PrevPage 95 of 182Next →

No leaderboard results yet.