SOTAVerified

Hallucination

Papers

Showing 13811390 of 1816 papers

TitleStatusHype
Re-Ex: Revising after Explanation Reduces the Factual Errors in LLM ResponsesCode0
GROUNDHOG: Grounding Large Language Models to Holistic Segmentation0
Look Before You Leap: Towards Decision-Aware and Generalizable Tool-Usage for Large Language Models0
AVI-Talking: Learning Audio-Visual Instructions for Expressive 3D Talking Face Generation0
HypoTermQA: Hypothetical Terms Dataset for Benchmarking Hallucination Tendency of LLMsCode0
Rethinking Software Engineering in the Foundation Model Era: A Curated Catalogue of Challenges in the Development of Trustworthy FMware0
Hal-Eval: A Universal and Fine-grained Hallucination Evaluation Framework for Large Vision Language Models0
CARBD-Ko: A Contextually Annotated Review Benchmark Dataset for Aspect-Level Sentiment Classification in Korean0
UFO: a Unified and Flexible Framework for Evaluating Factuality of Large Language ModelsCode0
Does the Generator Mind its Contexts? An Analysis of Generative Model Faithfulness under Context Transfer0
Show:102550
← PrevPage 139 of 182Next →

No leaderboard results yet.