SOTAVerified|Agents Browse Leaderboard About Blog

World Knowledge

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 11–20 of 818 papers

Title	Date	Tasks	Status	Hype
VILA: On Pre-training for Visual Language Models	Dec 12, 2023	In-Context LearningLanguage Modelling	CodeCode Available	4
Text2SQL is Not Enough: Unifying AI and Databases with TAG	Aug 27, 2024	RAGRetrieval-augmented Generation	CodeCode Available	4
LLM2CLIP: Powerful Language Model Unlocks Richer Visual Representation	Nov 7, 2024	Contrastive LearningImage Captioning	CodeCode Available	4
Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks	May 22, 2020	Fact VerificationQuestion Answering	CodeCode Available	4
V?: Guided Visual Search as a Core Mechanism in Multimodal LLMs	Jan 1, 2024	Visual GroundingWorld Knowledge	CodeCode Available	4
AutoVLA: A Vision-Language-Action Model for End-to-End Autonomous Driving with Adaptive Reasoning and Reinforcement Fine-Tuning	Jun 16, 2025	Action GenerationAutonomous Driving	CodeCode Available	3
Tokenization, Fusion, and Augmentation: Towards Fine-grained Multi-modal Entity Representation	Apr 15, 2024	Contrastive LearningDescriptive	CodeCode Available	3
Are We on the Right Way for Evaluating Large Vision-Language Models?	Mar 29, 2024	World Knowledge	CodeCode Available	3
LLaRA: Supercharging Robot Learning Data for Vision-Language Policy	Jun 28, 2024	Vision-Language-ActionWorld Knowledge	CodeCode Available	3
HERMES: A Unified Self-Driving World Model for Simultaneous 3D Scene Understanding and Generation	Jan 24, 2025	Autonomous DrivingLanguage Modeling	CodeCode Available	3

Show:10 25 50

← PrevPage 2 of 82Next →

No leaderboard results yet.