SOTAVerified

Hallucination

Papers

Showing 10011050 of 1816 papers

TitleStatusHype
The 2nd FutureDial Challenge: Dialog Systems with Retrieval Augmented Generation (FutureDial-RAG)Code1
Retrieval-Augmented Language Model for Extreme Multi-Label Knowledge Graph Link PredictionCode0
CT-Eval: Benchmarking Chinese Text-to-Table Performance in Large Language Models0
Automated Multi-level Preference for MLLMsCode1
Evaluating Text-to-Speech Synthesis from a Large Discrete Token-based Speech Language Model0
Enhancing Semantics in Multimodal Chain of Thought via Soft Negative SamplingCode1
Spurious reconstruction from brain activityCode0
A Comprehensive Survey of Hallucination in Large Language, Image, Video and Audio Foundation Models0
Word Alignment as Preference for Machine Translation0
Navigating LLM Ethics: Advancements, Challenges, and Future Directions0
ALMol: Aligned Language-Molecule Translation LLMs through Offline Preference Contrastive Optimisation0
Control Token with Dense Passage Retrieval0
Benchmarking Retrieval-Augmented Large Language Models in Biomedical NLP: Application, Robustness, and Self-Awareness0
Mitigating Hallucinations in Large Language Models via Self-Refinement-Enhanced Knowledge Retrieval0
LLMs can Find Mathematical Reasoning Mistakes by Pedagogical Chain-of-Thought0
THRONE: An Object-based Hallucination Benchmark for the Free-form Generations of Large Vision-Language ModelsCode1
Is the House Ready For Sleeptime? Generating and Evaluating Situational Queries for Embodied Question Answering0
SUTRA: Scalable Multilingual Language Model Architecture0
Sora Detector: A Unified Hallucination Detection for Large Text-to-Video ModelsCode0
Deception in Reinforced Autonomous Agents0
Quantifying the Capabilities of LLMs across Scale and Precision0
Score-based Generative Priors Guided Model-driven Network for MRI Reconstruction0
R4: Reinforced Retriever-Reorder-Responder for Retrieval-Augmented Large Language Models0
Attribution in Scientific Literature: New Benchmark and Methods0
FLAME: Factuality-Aware Alignment for Large Language Models0
Can a Hallucinating Model help in Reducing Human "Hallucination"?0
Addressing Topic Granularity and Hallucination in Large Language Models for Topic ModellingCode0
What Makes for Good Image Captions?0
CodeHalu: Investigating Code Hallucinations in LLMs via Execution-based VerificationCode1
RAG and RAU: A Survey on Retrieval-Augmented Language Model in Natural Language ProcessingCode3
Harmonic LLMs are Trustworthy0
Visual Fact Checker: Enabling High-Fidelity Detailed Caption Generation0
A robust and scalable framework for hallucination detection in virtual tissue staining and digital pathology0
Hallucination of Multimodal Large Language Models: A SurveyCode4
MMAC-Copilot: Multi-modal Agent Collaboration Operating Copilot0
SERPENT-VLM : Self-Refining Radiology Report Generation Using Vision Language Models0
Fake Artificial Intelligence Generated Contents (FAIGC): A Survey of Theories, Detection Methods, and Opportunities0
Can Foundational Large Language Models Assist with Conducting Pharmaceuticals Manufacturing Investigations?0
Retrieval Head Mechanistically Explains Long-Context FactualityCode3
KS-LLM: Knowledge Selection of Large Language Models with Evidence Document for Question Answering0
Student Data Paradox and Curious Case of Single Student-Tutor Model: Regressive Side Effects of Training LLMs for Personalized Learning0
FINEMATCH: Aspect-based Fine-grained Image and Text Mismatch Detection and Correction0
SkinGEN: an Explainable Dermatology Diagnosis-to-Generation Framework with Interactive Vision-Language Models0
Generate-on-Graph: Treat LLM as both Agent and KG in Incomplete Knowledge Graph Question AnsweringCode2
Integrating Chemistry Knowledge in Large Language Models via Prompt EngineeringCode0
LLMs Know What They Need: Leveraging a Missing Information Guided Framework to Empower Retrieval-Augmented GenerationCode1
VALOR-EVAL: Holistic Coverage and Faithfulness Evaluation of Large Vision-Language ModelsCode1
Detecting and Mitigating Hallucination in Large Vision Language Models via Fine-Grained AI FeedbackCode1
Single-sample image-fusion upsampling of fluorescence lifetime images0
TextSquare: Scaling up Text-Centric Visual Instruction Tuning0
Show:102550
← PrevPage 21 of 37Next →

No leaderboard results yet.