SOTAVerified

World Knowledge

Papers

Showing 351375 of 818 papers

TitleStatusHype
All Entities are Not Created Equal: Examining the Long Tail for Fine-Grained Entity Typing0
Rulebreakers Challenge: Revealing a Blind Spot in Large Language Models' Reasoning with Formal Logic0
Roadmap towards Superhuman Speech Understanding using Large Language Models0
Comprehending Knowledge Graphs with Large Language Models for Recommender Systems0
Understanding the Role of LLMs in Multimodal Evaluation BenchmarksCode0
KITTEN: A Knowledge-Intensive Evaluation of Image Generation on Visual Entities0
DyVo: Dynamic Vocabularies for Learned Sparse Retrieval with EntitiesCode0
TVBench: Redesigning Video-Language Evaluation0
Which Programming Language and What Features at Pre-training Stage Affect Downstream Logical Inference Performance?Code0
SEAL: SEmantic-Augmented Imitation Learning via Language Model0
Intent Detection in the Age of LLMs0
"Oh LLM, I'm Asking Thee, Please Give Me a Decision Tree": Zero-Shot Decision Tree Induction and Embedding with Large Language Models0
"Why" Has the Least Side Effect on Model Editing0
Pioneering Reliable Assessment in Text-to-Image Knowledge Editing: Leveraging a Fine-Grained Dataset and an Innovative CriterionCode0
60 Data Points are Sufficient to Fine-Tune LLMs for Question-Answering0
Style Outweighs Substance: Failure Modes of LLM Judges in Alignment BenchmarkingCode0
Can-Do! A Dataset and Neuro-Symbolic Grounded Framework for Embodied Planning with Large Multimodal Models0
The X Types -- Mapping the Semantics of the Twitter Sphere0
Relevance-driven Decision Making for Safer and More Efficient Human Robot Collaboration0
Time Awareness in Large Language Models: Benchmarking Fact Recall Across Time0
Visual Language Tracking with Multi-modal Interaction: A Robust Benchmark0
Multimodal Large Language Model Driven Scenario Testing for Autonomous Vehicles0
How Does Code Pretraining Affect Language Model Task Performance?0
Physical Rule-Guided Convolutional Neural Network0
CV-Probes: Studying the interplay of lexical and world knowledge in visually grounded verb understanding0
Show:102550
← PrevPage 15 of 33Next →

No leaderboard results yet.