SOTAVerified

Hallucination

Papers

Showing 301325 of 1816 papers

TitleStatusHype
HallE-Control: Controlling Object Hallucination in Large Multimodal ModelsCode1
A Survey of Hallucination in Large Foundation ModelsCode1
Citation-Enhanced Generation for LLM-based ChatbotsCode1
FlySearch: Exploring how vision-language models exploreCode1
Gemini Goes to Med School: Exploring the Capabilities of Multimodal Large Language Models on Medical Challenge Problems & HallucinationsCode1
Circuit Transformer: A Transformer That Preserves Logical EquivalenceCode1
Finetune-RAG: Fine-Tuning Language Models to Resist Hallucination in Retrieval-Augmented GenerationCode1
Finding and Editing Multi-Modal Neurons in Pre-Trained TransformersCode1
FineSurE: Fine-grained Summarization Evaluation using LLMsCode1
PAINT: Paying Attention to INformed Tokens to Mitigate Hallucination in Large Vision-Language ModelCode1
AssistRAG: Boosting the Potential of Large Language Models with an Intelligent Information AssistantCode1
CodeHalu: Investigating Code Hallucinations in LLMs via Execution-based VerificationCode1
CogniBench: A Legal-inspired Framework and Dataset for Assessing Cognitive Faithfulness of Large Language ModelsCode1
Hallucinated Neural Radiance Fields in the WildCode1
CHATREPORT: Democratizing Sustainability Disclosure Analysis through LLM-based ToolsCode1
Aladdin: Zero-Shot Hallucination of Stylized 3D Assets from Abstract Scene DescriptionsCode1
Federated Recommendation via Hybrid Retrieval Augmented GenerationCode1
Fast and Slow Generating: An Empirical Study on Large and Small Language Models Collaborative DecodingCode1
Controllable Neural Dialogue Summarization with Personal Named Entity PlanningCode1
CriSPO: Multi-Aspect Critique-Suggestion-guided Automatic Prompt Optimization for Text GenerationCode1
A Token-level Reference-free Hallucination Detection Benchmark for Free-form Text GenerationCode1
High-resolution Face Swapping via Latent Semantics DisentanglementCode1
Contrastive Learning Reduces Hallucination in ConversationsCode1
FaithDial: A Faithful Benchmark for Information-Seeking DialogueCode1
ChartSumm: A Comprehensive Benchmark for Automatic Chart Summarization of Long and Short SummariesCode1
Show:102550
← PrevPage 13 of 73Next →

No leaderboard results yet.