SOTAVerified

World Knowledge

Papers

Showing 176200 of 818 papers

TitleStatusHype
KITTEN: A Knowledge-Intensive Evaluation of Image Generation on Visual Entities0
LiveXiv -- A Multi-Modal Live Benchmark Based on Arxiv Papers ContentCode1
DyVo: Dynamic Vocabularies for Learned Sparse Retrieval with EntitiesCode0
TVBench: Redesigning Video-Language Evaluation0
LLM Embeddings Improve Test-time Adaptation to Tabular Y|X-ShiftsCode1
Which Programming Language and What Features at Pre-training Stage Affect Downstream Logical Inference Performance?Code0
SEAL: SEmantic-Augmented Imitation Learning via Language Model0
Intent Detection in the Age of LLMs0
One Token to Seg Them All: Language Instructed Reasoning Segmentation in VideosCode2
"Why" Has the Least Side Effect on Model Editing0
CurricuLLM: Automatic Task Curricula Design for Learning Complex Robot Skills using Large Language ModelsCode1
"Oh LLM, I'm Asking Thee, Please Give Me a Decision Tree": Zero-Shot Decision Tree Induction and Embedding with Large Language Models0
Pioneering Reliable Assessment in Text-to-Image Knowledge Editing: Leveraging a Fine-Grained Dataset and an Innovative CriterionCode0
60 Data Points are Sufficient to Fine-Tune LLMs for Question-Answering0
Style Outweighs Substance: Failure Modes of LLM Judges in Alignment BenchmarkingCode0
Can-Do! A Dataset and Neuro-Symbolic Grounded Framework for Embodied Planning with Large Multimodal Models0
The X Types -- Mapping the Semantics of the Twitter Sphere0
Relevance-driven Decision Making for Safer and More Efficient Human Robot Collaboration0
Time Awareness in Large Language Models: Benchmarking Fact Recall Across Time0
HLLM: Enhancing Sequential Recommendations via Hierarchical Large Language Models for Item and User ModelingCode4
Diversify and Conquer: Diversity-Centric Data Selection with Iterative RefinementCode1
Visual Language Tracking with Multi-modal Interaction: A Robust Benchmark0
Synthetic continued pretrainingCode2
Multimodal Large Language Model Driven Scenario Testing for Autonomous Vehicles0
Can OOD Object Detectors Learn from Foundation Models?Code1
Show:102550
← PrevPage 8 of 33Next →

No leaderboard results yet.