SOTAVerified

World Knowledge

Papers

Showing 701750 of 818 papers

TitleStatusHype
Morph Call: Probing Morphosyntactic Content of Multilingual TransformersCode0
Who Relies More on World Knowledge and Bias for Syntactic Ambiguity Resolution: Humans or LLMs?Code0
BiasKG: Adversarial Knowledge Graphs to Induce Bias in Large Language ModelsCode0
Does Commonsense help in detecting Sarcasm?Code0
AKEW: Assessing Knowledge Editing in the WildCode0
Walk-and-Relate: A Random-Walk-based Algorithm for Representation Learning on Sparse Knowledge GraphsCode0
Arrows are the Verbs of DiagramsCode0
Multi-Preference Lambda-weighted Listwise DPO for Dynamic Preference AlignmentCode0
TimeCausality: Evaluating the Causal Ability in Time Dimension for Vision Language ModelsCode0
LLaVA-VSD: Large Language-and-Vision Assistant for Visual Spatial DescriptionCode0
My Teacher Thinks The World Is Flat! Interpreting Automatic Essay Scoring MechanismCode0
LLMTreeRec: Unleashing the Power of Large Language Models for Cold-Start RecommendationsCode0
Benchmarking Spatiotemporal Reasoning in LLMs and Reasoning Models: Capabilities and ChallengesCode0
NLITrans at SemEval-2018 Task 12: Transfer of Semantic Knowledge for Argument ComprehensionCode0
CoRTEx: Contrastive Learning for Representing Terms via Explanations with Applications on Constructing Biomedical Knowledge GraphsCode0
FLAME: Self-Supervised Low-Resource Taxonomy Expansion using Large Language ModelsCode0
Log Probabilities Are a Reliable Estimate of Semantic Plausibility in Base and Instruction-Tuned Language ModelsCode0
LitCQD: Multi-Hop Reasoning in Incomplete Knowledge Graphs with Numeric LiteralsCode0
ObjCAViT: Improving Monocular Depth Estimation Using Natural Language Models And Image-Object Cross-AttentionCode0
Are Large Language Models True Healthcare Jacks-of-All-Trades? Benchmarking Across Health Professions Beyond Physician ExamsCode0
Large Language Models Need Consultants for Reasoning: Becoming an Expert in a Complex Human System Through Behavior SimulationCode0
Scaling Autoregressive Models for Content-Rich Text-to-Image GenerationCode0
Finding Motifs in Knowledge Graphs using CompressionCode0
Advancing and Benchmarking Personalized Tool Invocation for LLMsCode0
Video Summarization: Towards Entity-Aware CaptionsCode0
Language models show human-like content effects on reasoning tasksCode0
Scope Ambiguities in Large Language ModelsCode0
Filling the Image Information Gap for VQA: Prompting Large Language Models to Proactively Ask QuestionsCode0
Language Model Behavior: A Comprehensive SurveyCode0
Contextual Knowledge Pursuit for Faithful Visual SynthesisCode0
ComDensE : Combined Dense Embedding of Relation-aware and Common Features for Knowledge Graph CompletionCode0
Figurative Language in Recognizing Textual EntailmentCode0
Combining Analogy with Language Models for Knowledge ExtractionCode0
On the Necessity of World Knowledge for Mitigating Missing Labels in Extreme ClassificationCode0
COFAR: Commonsense and Factual Reasoning in Image SearchCode0
Knowledge Graph Completion with Mixed Geometry Tensor FactorizationCode0
Knowledge Generation -- Variational Bayes on Knowledge GraphsCode0
Towards End-to-End Reinforcement Learning of Dialogue Agents for Information AccessCode0
Benchmarking Multi-Image Understanding in Vision and Language Models: Perception, Knowledge, Reasoning, and Multi-Hop ReasoningCode0
Open-World Knowledge Graph CompletionCode0
Knowledge Boundary and Persona Dynamic Shape A Better Social Media AgentCode0
Augment or Not? A Comparative Study of Pure and Augmented Large Language Model RecommendersCode0
Knowledge-Augmented Language Model and its Application to Unsupervised Named-Entity RecognitionCode0
StorySparkQA: Expert-Annotated QA Pairs with Real-World Knowledge for Children's Story-Based LearningCode0
Knowledge Acquisition Disentanglement for Knowledge-based Visual Question Answering with Large Language ModelsCode0
KGQuiz: Evaluating the Generalization of Encoded Knowledge in Large Language ModelsCode0
CoDA21: Evaluating Language Understanding Capabilities of NLP Models With Context-Definition AlignmentCode0
Fact-or-Fair: A Checklist for Behavioral Testing of AI Models on Fairness-Related QueriesCode0
PCR4ALL: A Comprehensive Evaluation Benchmark for Pronoun Coreference Resolution in EnglishCode0
Is Incoherence Surprising? Targeted Evaluation of Coherence Prediction from Language ModelsCode0
Show:102550
← PrevPage 15 of 17Next →

No leaderboard results yet.