SOTAVerified

Hallucination

Papers

Showing 851875 of 1816 papers

TitleStatusHype
A Benchmark and Robustness Study of In-Context-Learning with Large Language Models in Music Entity DetectionCode0
CG-Bench: Clue-grounded Question Answering Benchmark for Long Video Understanding0
RAC3: Retrieval-Augmented Corner Case Comprehension for Autonomous Driving with Vision-Language Models0
Task-Oriented Dialog Systems for the Senegalese Wolof Language0
Combating Multimodal LLM Hallucination via Bottom-Up Holistic Reasoning0
Thinking with Knowledge Graphs: Enhancing LLM Reasoning Through Structured Data0
NoisyEQA: Benchmarking Embodied Question Answering Against Noisy Queries0
Accelerating Retrieval-Augmented Generation0
Detecting LLM Hallucination Through Layer-wise Information Deficiency: Analysis of Unanswerable Questions and Ambiguous Prompts0
Benchmarking large language models for materials synthesis: the case of atomic layer deposition0
TACOMORE: Leveraging the Potential of LLMs in Corpus-based Discourse Analysis with Prompt Engineering0
Multi-Task Learning with LLMs for Implicit Sentiment Analysis: Data-level and Task-level Automatic Weight Learning0
Hallucination Elimination and Semantic Enhancement Framework for Vision-Language Models in Traffic ScenariosCode0
HalluCana: Fixing LLM Hallucination with A Canary Lookahead0
Delve into Visual Contrastive Decoding for Hallucination Mitigation of Large Vision-Language ModelsCode0
Methods for Legal Citation Prediction in the Age of LLMs: An Australian Law Case Study0
Evaluating Hallucination in Text-to-Image Diffusion Models with Scene-Graph based Question-Answering Agent0
LLM-Align: Utilizing Large Language Models for Entity Alignment in Knowledge Graphs0
Verb Mirage: Unveiling and Assessing Verb Concept Hallucinations in Multimodal Large Language Models0
Steps are all you need: Rethinking STEM Education with Prompt Engineering0
Multi-Objective Alignment of Large Language Models Through Hypervolume Maximization0
100% Elimination of Hallucinations on RAGTruth for GPT-4 and GPT-3.5 Turbo0
Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling0
TOBUGraph: Knowledge Graph-Based Retrieval for Enhanced LLM Performance Beyond RAG0
Deep priors for satellite image restoration with accurate uncertainties0
Show:102550
← PrevPage 35 of 73Next →

No leaderboard results yet.