SOTAVerified

World Knowledge

Papers

Showing 301325 of 818 papers

TitleStatusHype
Good at captioning, bad at counting: Benchmarking GPT-4V on Earth observation dataCode0
Log Probabilities Are a Reliable Estimate of Semantic Plausibility in Base and Instruction-Tuned Language ModelsCode0
Large Language Models Need Consultants for Reasoning: Becoming an Expert in a Complex Human System Through Behavior SimulationCode0
Language models show human-like content effects on reasoning tasksCode0
Geographical Erasure in Language GenerationCode0
Language Model Behavior: A Comprehensive SurveyCode0
Contextual Knowledge Pursuit for Faithful Visual SynthesisCode0
Knowledge Graph Completion with Mixed Geometry Tensor FactorizationCode0
LitCQD: Multi-Hop Reasoning in Incomplete Knowledge Graphs with Numeric LiteralsCode0
Memory-Modular Classification: Learning to Generalize with Memory ReplacementCode0
Knowledge Acquisition Disentanglement for Knowledge-based Visual Question Answering with Large Language ModelsCode0
GrowOVER: How Can LLMs Adapt to Growing Real-World Knowledge?Code0
LoFTI: Localization and Factuality Transfer to Indian LocalesCode0
Logic Attention Based Neighborhood Aggregation for Inductive Knowledge Graph EmbeddingCode0
ComDensE : Combined Dense Embedding of Relation-aware and Common Features for Knowledge Graph CompletionCode0
KGQuiz: Evaluating the Generalization of Encoded Knowledge in Large Language ModelsCode0
Knowledge-Augmented Language Model and its Application to Unsupervised Named-Entity RecognitionCode0
Augment or Not? A Comparative Study of Pure and Augmented Large Language Model RecommendersCode0
Combining Analogy with Language Models for Knowledge ExtractionCode0
Is Incoherence Surprising? Targeted Evaluation of Coherence Prediction from Language ModelsCode0
Knowledge Boundary and Persona Dynamic Shape A Better Social Media AgentCode0
DYNAMICQA: Tracing Internal Knowledge Conflicts in Language ModelsCode0
Investigating associative, switchable and negatable Winograd items on renewed French data setsCode0
Augmenting Neural Networks with First-order LogicCode0
Intrinsic Knowledge Evaluation on Chinese Language ModelsCode0
Show:102550
← PrevPage 13 of 33Next →

No leaderboard results yet.