SOTAVerified

World Knowledge

Papers

Showing 150 of 818 papers

TitleStatusHype
HRSeg: High-Resolution Visual Perception and Enhancement for Reasoning Segmentation0
Comparing Apples to Oranges: A Dataset & Analysis of LLM Humour Understanding from Traditional Puns to Topical Jokes0
KEN: Knowledge Augmentation and Emotion Guidance Network for Multimodal Fake News Detection0
Video Event Reasoning and Prediction by Fusing World Knowledge from LLMs with Vision Foundation Models0
DreamVLA: A Vision-Language-Action Model Dreamed with Comprehensive World KnowledgeCode3
A Semi-supervised Scalable Unified Framework for E-commerce Query Classification0
From 2D to 3D Cognition: A Brief Survey of General World Models0
MIRAGE: A Benchmark for Multimodal Information-Seeking and Reasoning in Agricultural Expert-Guided ConversationsCode0
Multi-Preference Lambda-weighted Listwise DPO for Dynamic Preference AlignmentCode0
ImpliRet: Benchmarking the Implicit Fact Retrieval ChallengeCode0
AutoVLA: A Vision-Language-Action Model for End-to-End Autonomous Driving with Adaptive Reasoning and Reinforcement Fine-TuningCode3
ConTextTab: A Semantics-Aware Tabular In-Context LearnerCode2
MMMG: A Massive, Multidisciplinary, Multi-Tier Generation Benchmark for Text-to-Image Reasoning0
RoCA: Robust Cross-Domain End-to-End Autonomous Driving0
Serendipitous Recommendation with Multimodal LLM0
ReCogDrive: A Reinforced Cognitive Framework for End-to-End Autonomous Driving0
Vid2Sim: Generalizable, Video-based Reconstruction of Appearance, Geometry and Physics for Mesh-free Simulation0
Quantifying Cross-Modality Memorization in Vision-Language Models0
TIIF-Bench: How Does Your T2I Model Follow Your Instructions?0
From Words to Waves: Analyzing Concept Formation in Speech and Text-Based Foundation Models0
Probing the Geometry of Truth: Consistency and Generalization of Truth Directions in LLMs Across Logical Transformations and Question Answering TasksCode0
Augment or Not? A Comparative Study of Pure and Augmented Large Language Model RecommendersCode0
SC-LoRA: Balancing Efficient Fine-tuning and Knowledge Preservation via Subspace-Constrained LoRA0
MOVi: Training-free Text-conditioned Multi-Object Video Generation0
Hierarchical Tree Search-based User Lifelong Behavior Modeling on Large Language Model0
Improving Medical Reasoning with Curriculum-Aware Reinforcement Learning0
DriveX: Omni Scene Modeling for Learning Generalizable World Knowledge in Autonomous Driving0
Alchemist: Turning Public Text-to-Image Data into Generative Gold0
GRE Suite: Geo-localization Inference via Fine-Tuned Vision-Language Models and Enhanced Reasoning ChainsCode1
Align Beyond Prompts: Evaluating World Knowledge Alignment in Text-to-Image GenerationCode0
Do BERT-Like Bidirectional Models Still Perform Better on Text Classification in the Era of LLMs?0
DeepRec: Towards a Deep Dive Into the Item Space with Large Language Model Based Recommendation0
O^2-Searcher: A Searching-based Agent Model for Open-Domain Open-Ended Question AnsweringCode1
TimeCausality: Evaluating the Causal Ability in Time Dimension for Vision Language ModelsCode0
Robo2VLM: Visual Question Answering from Large-Scale In-the-Wild Robot Manipulation Datasets0
UniErase: Unlearning Token as a Universal Erasure Primitive for Language ModelsCode0
Table Foundation Models: on knowledge pre-training for tabular learning0
Bidirectional LMs are Better Knowledge Memorizers? A Benchmark for Real-world Knowledge InjectionCode0
Benchmarking Spatiotemporal Reasoning in LLMs and Reasoning Models: Capabilities and ChallengesCode0
Who You Are Matters: Bridging Topics and Social Roles via LLM-Enhanced Logical Recommendation0
LODGE: Joint Hierarchical Task Planning and Learning of Domain Models with Grounded Execution0
LLM4CD: Leveraging Large Language Models for Open-World Knowledge Augmented Cognitive DiagnosisCode0
Enhancing Cache-Augmented Generation (CAG) with Adaptive Contextual Compression for Scalable Knowledge Integration0
Advancing and Benchmarking Personalized Tool Invocation for LLMsCode0
Evaluating Contrastive Feedback for Effective User SimulationsCode0
WorldGenBench: A World-Knowledge-Integrated Benchmark for Reasoning-Driven Text-to-Image Generation0
Grokking in the Wild: Data Augmentation for Real-World Multi-Hop Reasoning with Transformers0
Towards Automated Scoping of AI for Social Good Projects0
Doxing via the Lens: Revealing Location-related Privacy Leakage on Multi-modal Large Reasoning Models0
WeatherGen: A Unified Diverse Weather Generator for LiDAR Point Clouds via Spider Mamba DiffusionCode1
Show:102550
← PrevPage 1 of 17Next →

No leaderboard results yet.