SOTAVerified

World Knowledge

Papers

Showing 351400 of 818 papers

TitleStatusHype
All Entities are Not Created Equal: Examining the Long Tail for Fine-Grained Entity Typing0
Rulebreakers Challenge: Revealing a Blind Spot in Large Language Models' Reasoning with Formal Logic0
Roadmap towards Superhuman Speech Understanding using Large Language Models0
Understanding the Role of LLMs in Multimodal Evaluation BenchmarksCode0
Comprehending Knowledge Graphs with Large Language Models for Recommender Systems0
KITTEN: A Knowledge-Intensive Evaluation of Image Generation on Visual Entities0
DyVo: Dynamic Vocabularies for Learned Sparse Retrieval with EntitiesCode0
TVBench: Redesigning Video-Language Evaluation0
Which Programming Language and What Features at Pre-training Stage Affect Downstream Logical Inference Performance?Code0
SEAL: SEmantic-Augmented Imitation Learning via Language Model0
Intent Detection in the Age of LLMs0
"Oh LLM, I'm Asking Thee, Please Give Me a Decision Tree": Zero-Shot Decision Tree Induction and Embedding with Large Language Models0
"Why" Has the Least Side Effect on Model Editing0
Pioneering Reliable Assessment in Text-to-Image Knowledge Editing: Leveraging a Fine-Grained Dataset and an Innovative CriterionCode0
60 Data Points are Sufficient to Fine-Tune LLMs for Question-Answering0
Style Outweighs Substance: Failure Modes of LLM Judges in Alignment BenchmarkingCode0
The X Types -- Mapping the Semantics of the Twitter Sphere0
Can-Do! A Dataset and Neuro-Symbolic Grounded Framework for Embodied Planning with Large Multimodal Models0
Relevance-driven Decision Making for Safer and More Efficient Human Robot Collaboration0
Time Awareness in Large Language Models: Benchmarking Fact Recall Across Time0
Visual Language Tracking with Multi-modal Interaction: A Robust Benchmark0
Multimodal Large Language Model Driven Scenario Testing for Autonomous Vehicles0
How Does Code Pretraining Affect Language Model Task Performance?0
Physical Rule-Guided Convolutional Neural Network0
CV-Probes: Studying the interplay of lexical and world knowledge in visually grounded verb understanding0
Novel-WD: Exploring acquisition of Novel World Knowledge in LLMs Using Prefix-Tuning0
Zero-Shot Visual Reasoning by Vision-Language Models: Benchmarking and Analysis0
Exploring the Potential of Large Language Models for Heterophilic Graphs0
To Code, or Not To Code? Exploring Impact of Code in Pre-training0
Efficient and Deployable Knowledge Infusion for Open-World Recommendations via Large Language Models0
CoRA: Collaborative Information Perception by Large Language Model's Weights for Recommendation0
CoDi: Conversational Distillation for Grounded Question Answering0
On the Necessity of World Knowledge for Mitigating Missing Labels in Extreme ClassificationCode0
A Mechanistic Interpretation of Syllogistic Reasoning in Auto-Regressive Language Models0
Prompt Tuning as User Inherent Profile Inference Machine0
MAQA: Evaluating Uncertainty Quantification in LLMs Regarding Data UncertaintyCode0
LLaVA-VSD: Large Language-and-Vision Assistant for Visual Spatial DescriptionCode0
Better Alignment with Instruction Back-and-Forth Translation0
Lifelong Personalized Low-Rank Adaptation of Large Language Models for Recommendation0
CLR-Fact: Evaluating the Complex Logical Reasoning Capability of Large Language Models over Factual Knowledge0
Visual Riddles: a Commonsense and World Knowledge Challenge for Large Vision and Language Models0
DYNAMICQA: Tracing Internal Knowledge Conflicts in Language ModelsCode0
Knowledge Acquisition Disentanglement for Knowledge-based Visual Question Answering with Large Language ModelsCode0
Generalization v.s. Memorization: Tracing Language Models' Capabilities Back to Pretraining Data0
LoFTI: Localization and Factuality Transfer to Indian LocalesCode0
VQA-Diff: Exploiting VQA and Diffusion for Zero-Shot Image-to-3D Vehicle Asset Generation in Autonomous Driving0
BAPO: Base-Anchored Preference Optimization for Overcoming Forgetting in Large Language Models Personalization0
Mental Modeling of Reinforcement Learning Agents by Language Models0
LABOR-LLM: Language-Based Occupational Representations with Large Language Models0
Mitigating Hallucination in Fictional Character Role-PlayCode0
Show:102550
← PrevPage 8 of 17Next →

No leaderboard results yet.