SOTAVerified

Hallucination

Papers

Showing 751775 of 1816 papers

TitleStatusHype
HalluShift: Measuring Distribution Shifts towards Hallucination Detection in LLMsCode0
Handling Ontology Gaps in Semantic ParsingCode0
SmurfCat at SemEval-2024 Task 6: Leveraging Synthetic Data for Hallucination DetectionCode0
Enhancing Retrieval Processes for Language Generation with Augmented Queries0
Enhancing RAG with Active Learning on Conversation Records: Reject Incapables and Answer Capables0
Can We Catch the Elephant? A Survey of the Evolvement of Hallucination Evaluation on Natural Language Generation0
From Training-Free to Adaptive: Empirical Insights into MLLMs' Understanding of Detection Information0
Can Structured Data Reduce Epistemic Uncertainty?0
Enhancing Multi-Agent Consensus through Third-Party LLM Integration: Analyzing Uncertainty and Mitigating Hallucinations in Large Language Models0
Enhancing Mathematical Reasoning in Large Language Models with Self-Consistency-Based Hallucination Detection0
Can Open-source LLMs Enhance Data Synthesis for Toxic Detection?: An Experimental Study0
Enhancing LLM Generation with Knowledge Hypergraph for Evidence-Based Medicine0
Can LLMs Detect Intrinsic Hallucinations in Paraphrasing and Machine Translation?0
Applying RLAIF for Code Generation with API-usage in Lightweight LLMs0
Enhancing Hallucination Detection through Noise Injection0
Enhancing Guardrails for Safe and Secure Healthcare AI0
Enhancing Emergency Decision-making with Knowledge Graphs and Large Language Models0
Can LLM be a Good Path Planner based on Prompt Engineering? Mitigating the Hallucination for Path Planning0
Applications of Large Language Model Reasoning in Feature Generation0
Enhanced Hallucination Detection in Neural Machine Translation through Simple Detector Aggregation0
Enhanced document retrieval with topic embeddings0
Can Large Language Models Play Games? A Case Study of A Self-Play Approach0
Endowing Embodied Agents with Spatial Reasoning Capabilities for Vision-and-Language Navigation0
Can Knowledge Graphs Make Large Language Models More Trustworthy? An Empirical Study over Open-ended Question Answering0
A Perspective for Adapting Generalist AI to Specialized Medical AI Applications and Their Challenges0
Show:102550
← PrevPage 31 of 73Next →

No leaderboard results yet.