SOTAVerified

World Knowledge

Papers

Showing 2650 of 818 papers

TitleStatusHype
Improving Medical Reasoning with Curriculum-Aware Reinforcement Learning0
DriveX: Omni Scene Modeling for Learning Generalizable World Knowledge in Autonomous Driving0
Alchemist: Turning Public Text-to-Image Data into Generative Gold0
GRE Suite: Geo-localization Inference via Fine-Tuned Vision-Language Models and Enhanced Reasoning ChainsCode1
Align Beyond Prompts: Evaluating World Knowledge Alignment in Text-to-Image GenerationCode0
Do BERT-Like Bidirectional Models Still Perform Better on Text Classification in the Era of LLMs?0
DeepRec: Towards a Deep Dive Into the Item Space with Large Language Model Based Recommendation0
O^2-Searcher: A Searching-based Agent Model for Open-Domain Open-Ended Question AnsweringCode1
TimeCausality: Evaluating the Causal Ability in Time Dimension for Vision Language ModelsCode0
Robo2VLM: Visual Question Answering from Large-Scale In-the-Wild Robot Manipulation Datasets0
UniErase: Unlearning Token as a Universal Erasure Primitive for Language ModelsCode0
Table Foundation Models: on knowledge pre-training for tabular learning0
Bidirectional LMs are Better Knowledge Memorizers? A Benchmark for Real-world Knowledge InjectionCode0
Benchmarking Spatiotemporal Reasoning in LLMs and Reasoning Models: Capabilities and ChallengesCode0
Who You Are Matters: Bridging Topics and Social Roles via LLM-Enhanced Logical Recommendation0
LODGE: Joint Hierarchical Task Planning and Learning of Domain Models with Grounded Execution0
LLM4CD: Leveraging Large Language Models for Open-World Knowledge Augmented Cognitive DiagnosisCode0
Enhancing Cache-Augmented Generation (CAG) with Adaptive Contextual Compression for Scalable Knowledge Integration0
Advancing and Benchmarking Personalized Tool Invocation for LLMsCode0
Evaluating Contrastive Feedback for Effective User SimulationsCode0
WorldGenBench: A World-Knowledge-Integrated Benchmark for Reasoning-Driven Text-to-Image Generation0
Grokking in the Wild: Data Augmentation for Real-World Multi-Hop Reasoning with Transformers0
Towards Automated Scoping of AI for Social Good Projects0
Doxing via the Lens: Revealing Location-related Privacy Leakage on Multi-modal Large Reasoning Models0
WeatherGen: A Unified Diverse Weather Generator for LiDAR Point Clouds via Spider Mamba DiffusionCode1
Show:102550
← PrevPage 2 of 33Next →

No leaderboard results yet.