SOTAVerified

Hallucination

Papers

Showing 13761400 of 1816 papers

TitleStatusHype
Navigating Hallucinations for Reasoning of Unintentional Activities0
Editing Factual Knowledge and Explanatory Ability of Medical Large Language ModelsCode0
Collaborative decoding of critical tokens for boosting factuality of large language models0
Multi-FAct: Assessing Factuality of Multilingual LLMs using FActScoreCode0
Securing Reliability: A Brief Overview on Enhancing In-Context Learning for Foundation Models0
Re-Ex: Revising after Explanation Reduces the Factual Errors in LLM ResponsesCode0
GROUNDHOG: Grounding Large Language Models to Holistic Segmentation0
Look Before You Leap: Towards Decision-Aware and Generalizable Tool-Usage for Large Language Models0
AVI-Talking: Learning Audio-Visual Instructions for Expressive 3D Talking Face Generation0
HypoTermQA: Hypothetical Terms Dataset for Benchmarking Hallucination Tendency of LLMsCode0
Rethinking Software Engineering in the Foundation Model Era: A Curated Catalogue of Challenges in the Development of Trustworthy FMware0
Hal-Eval: A Universal and Fine-grained Hallucination Evaluation Framework for Large Vision Language Models0
CARBD-Ko: A Contextually Annotated Review Benchmark Dataset for Aspect-Level Sentiment Classification in Korean0
UFO: a Unified and Flexible Framework for Evaluating Factuality of Large Language ModelsCode0
Does the Generator Mind its Contexts? An Analysis of Generative Model Faithfulness under Context Transfer0
DualFocus: Integrating Macro and Micro Perspectives in Multi-modal Large Language ModelsCode0
Science Checker Reloaded: A Bidirectional Paradigm for Transparency and Logical ReasoningCode0
OPDAI at SemEval-2024 Task 6: Small LLMs can Accelerate Hallucination Detection with Weakly Supervised Data0
Enhanced Hallucination Detection in Neural Machine Translation through Simple Detector Aggregation0
Emergence and dynamics of delusions and hallucinations across stages in early psychosis0
GOOD: Towards Domain Generalized Orientated Object Detection0
OWSM-CTC: An Open Encoder-Only Speech Foundation Model for Speech Recognition, Translation, and Language Identification0
Structured Chain-of-Thought Prompting for Few-Shot Generation of Content-Grounded QA Conversations0
Enabling Weak LLMs to Judge Response Reliability via Meta Ranking0
M2K-VDG: Model-Adaptive Multimodal Knowledge Anchor Enhanced Video-grounded Dialogue Generation0
Show:102550
← PrevPage 56 of 73Next →

No leaderboard results yet.