SOTAVerified

World Knowledge

Papers

Showing 151175 of 818 papers

TitleStatusHype
Head-to-Tail: How Knowledgeable are Large Language Models (LLMs)? A.K.A. Will LLMs Replace Knowledge Graphs?Code1
Infusing Disease Knowledge into BERT for Health Question Answering, Medical Inference and Disease Name RecognitionCode1
Hallucinated but Factual! Inspecting the Factuality of Hallucinations in Abstractive SummarizationCode1
Condor: Enhance LLM Alignment with Knowledge-Driven Data Synthesis and RefinementCode1
Imagine This! Scripts to Compositions to VideosCode1
KELM: Knowledge Enhanced Pre-Trained Language Representations with Message Passing on Hierarchical Relational GraphsCode1
LEARN: Knowledge Adaptation from Large Language Model to Recommendation for Practical Industrial ApplicationCode1
A Unified Encoder-Decoder Framework with Entity MemoryCode1
Combo of Thinking and Observing for Outside-Knowledge VQACode1
F-ViTA: Foundation Model Guided Visible to Thermal TranslationCode1
CogIE: An Information Extraction Toolkit for Bridging Texts and CogNetCode1
Flooding Spread of Manipulated Knowledge in LLM-Based Multi-Agent CommunitiesCode1
Large Language Models Only Pass Primary School Exams in Indonesia: A Comprehensive Test on IndoMMLUCode1
Counterfactual reasoning: Do language models need world knowledge for causal understanding?Code1
Counterfactual reasoning: Testing language models' understanding of hypothetical scenariosCode1
Cross-Care: Assessing the Healthcare Implications of Pre-training Data on Language Model BiasCode1
FusDreamer: Label-efficient Remote Sensing World Model for Multimodal Data ClassificationCode1
Cryptonite: A Cryptic Crossword Benchmark for Extreme Ambiguity in LanguageCode1
CurricuLLM: Automatic Task Curricula Design for Learning Complex Robot Skills using Large Language ModelsCode1
Better Together: Enhancing Generative Knowledge Graph Completion with Language Models and Neighborhood InformationCode1
Beyond Embeddings: The Promise of Visual Table in Visual ReasoningCode1
LiveXiv -- A Multi-Modal Live Benchmark Based on Arxiv Papers ContentCode1
GRE Suite: Geo-localization Inference via Fine-Tuned Vision-Language Models and Enhanced Reasoning ChainsCode1
Exploring the Potential of Large Foundation Models for Open-Vocabulary HOI DetectionCode1
Exploiting Diffusion Prior for Real-World Image Dehazing with Unpaired TrainingCode1
Show:102550
← PrevPage 7 of 33Next →

No leaderboard results yet.