SOTAVerified

World Knowledge

Papers

Showing 201250 of 818 papers

TitleStatusHype
How Does Code Pretraining Affect Language Model Task Performance?0
Physical Rule-Guided Convolutional Neural Network0
CV-Probes: Studying the interplay of lexical and world knowledge in visually grounded verb understanding0
Novel-WD: Exploring acquisition of Novel World Knowledge in LLMs Using Prefix-Tuning0
Zero-Shot Visual Reasoning by Vision-Language Models: Benchmarking and Analysis0
Text2SQL is Not Enough: Unifying AI and Databases with TAGCode4
Exploring the Potential of Large Language Models for Heterophilic Graphs0
AgentMove: Predicting Human Mobility Anywhere Using Large Language Model based Agentic FrameworkCode1
To Code, or Not To Code? Exploring Impact of Code in Pre-training0
Efficient and Deployable Knowledge Infusion for Open-World Recommendations via Large Language Models0
CoRA: Collaborative Information Perception by Large Language Model's Weights for Recommendation0
CoDi: Conversational Distillation for Grounded Question Answering0
BLADE: Benchmarking Language Model Agents for Data-Driven ScienceCode1
On the Necessity of World Knowledge for Mitigating Missing Labels in Extreme ClassificationCode0
A Mechanistic Interpretation of Syllogistic Reasoning in Auto-Regressive Language Models0
MAQA: Evaluating Uncertainty Quantification in LLMs Regarding Data UncertaintyCode0
Prompt Tuning as User Inherent Profile Inference Machine0
LLaVA-VSD: Large Language-and-Vision Assistant for Visual Spatial DescriptionCode0
Better Alignment with Instruction Back-and-Forth Translation0
Optimus-1: Hybrid Multimodal Memory Empowered Agents Excel in Long-Horizon TasksCode2
Lifelong Personalized Low-Rank Adaptation of Large Language Models for Recommendation0
CLR-Fact: Evaluating the Complex Logical Reasoning Capability of Large Language Models over Factual Knowledge0
SeaLLMs 3: Open Foundation and Chat Multilingual Large Language Models for Southeast Asian LanguagesCode2
Visual Riddles: a Commonsense and World Knowledge Challenge for Large Vision and Language Models0
DYNAMICQA: Tracing Internal Knowledge Conflicts in Language ModelsCode0
Knowledge Acquisition Disentanglement for Knowledge-based Visual Question Answering with Large Language ModelsCode0
Generalization v.s. Memorization: Tracing Language Models' Capabilities Back to Pretraining Data0
LoFTI: Localization and Factuality Transfer to Indian LocalesCode0
VISA: Reasoning Video Object Segmentation via Large Language ModelsCode3
Flooding Spread of Manipulated Knowledge in LLM-Based Multi-Agent CommunitiesCode1
VQA-Diff: Exploiting VQA and Diffusion for Zero-Shot Image-to-3D Vehicle Asset Generation in Autonomous Driving0
Language Representations Can be What Recommenders Need: Findings and PotentialsCode2
BAPO: Base-Anchored Preference Optimization for Overcoming Forgetting in Large Language Models Personalization0
LLaRA: Supercharging Robot Learning Data for Vision-Language PolicyCode3
Scaling Synthetic Data Creation with 1,000,000,000 PersonasCode11
Mental Modeling of Reinforcement Learning Agents by Language Models0
LABOR-LLM: Language-Based Occupational Representations with Large Language Models0
Mitigating Hallucination in Fictional Character Role-PlayCode0
Exploring Factual Entailment with NLI: A News Media Study0
Evaluating the Ability of Large Language Models to Reason about Cardinal Directions0
On the Role of Long-tail Knowledge in Retrieval Augmented Large Language Models0
LangSuitE: Planning, Controlling and Interacting with Large Language Models in Embodied Text EnvironmentsCode2
OCALM: Object-Centric Assessment with Language Models0
What Teaches Robots to Walk, Teaches Them to Trade too -- Regime Adaptive Execution using Informed Data and LLMs0
Locating and Extracting Relational Concepts in Large Language ModelsCode0
WikiContradict: A Benchmark for Evaluating LLMs on Real-World Knowledge Conflicts from Wikipedia0
Benchmarking Multi-Image Understanding in Vision and Language Models: Perception, Knowledge, Reasoning, and Multi-Hop ReasoningCode0
Are Large Language Models True Healthcare Jacks-of-All-Trades? Benchmarking Across Health Professions Beyond Physician ExamsCode0
A Systematic Analysis of Large Language Models as Soft Reasoners: The Case of Syllogistic InferencesCode0
RWKU: Benchmarking Real-World Knowledge Unlearning for Large Language ModelsCode2
Show:102550
← PrevPage 5 of 17Next →

No leaderboard results yet.