SOTAVerified

World Knowledge

Papers

Showing 726750 of 818 papers

TitleStatusHype
Language models show human-like content effects on reasoning tasksCode0
Scope Ambiguities in Large Language ModelsCode0
Filling the Image Information Gap for VQA: Prompting Large Language Models to Proactively Ask QuestionsCode0
Language Model Behavior: A Comprehensive SurveyCode0
Contextual Knowledge Pursuit for Faithful Visual SynthesisCode0
ComDensE : Combined Dense Embedding of Relation-aware and Common Features for Knowledge Graph CompletionCode0
Figurative Language in Recognizing Textual EntailmentCode0
Combining Analogy with Language Models for Knowledge ExtractionCode0
On the Necessity of World Knowledge for Mitigating Missing Labels in Extreme ClassificationCode0
COFAR: Commonsense and Factual Reasoning in Image SearchCode0
Knowledge Graph Completion with Mixed Geometry Tensor FactorizationCode0
Knowledge Generation -- Variational Bayes on Knowledge GraphsCode0
Towards End-to-End Reinforcement Learning of Dialogue Agents for Information AccessCode0
Benchmarking Multi-Image Understanding in Vision and Language Models: Perception, Knowledge, Reasoning, and Multi-Hop ReasoningCode0
Open-World Knowledge Graph CompletionCode0
Knowledge Boundary and Persona Dynamic Shape A Better Social Media AgentCode0
Augment or Not? A Comparative Study of Pure and Augmented Large Language Model RecommendersCode0
Knowledge-Augmented Language Model and its Application to Unsupervised Named-Entity RecognitionCode0
StorySparkQA: Expert-Annotated QA Pairs with Real-World Knowledge for Children's Story-Based LearningCode0
Knowledge Acquisition Disentanglement for Knowledge-based Visual Question Answering with Large Language ModelsCode0
KGQuiz: Evaluating the Generalization of Encoded Knowledge in Large Language ModelsCode0
CoDA21: Evaluating Language Understanding Capabilities of NLP Models With Context-Definition AlignmentCode0
Fact-or-Fair: A Checklist for Behavioral Testing of AI Models on Fairness-Related QueriesCode0
PCR4ALL: A Comprehensive Evaluation Benchmark for Pronoun Coreference Resolution in EnglishCode0
Is Incoherence Surprising? Targeted Evaluation of Coherence Prediction from Language ModelsCode0
Show:102550
← PrevPage 30 of 33Next →

No leaderboard results yet.