SOTAVerified

World Knowledge

Papers

Showing 125 of 818 papers

TitleStatusHype
HRSeg: High-Resolution Visual Perception and Enhancement for Reasoning Segmentation0
Comparing Apples to Oranges: A Dataset & Analysis of LLM Humour Understanding from Traditional Puns to Topical Jokes0
KEN: Knowledge Augmentation and Emotion Guidance Network for Multimodal Fake News Detection0
Video Event Reasoning and Prediction by Fusing World Knowledge from LLMs with Vision Foundation Models0
DreamVLA: A Vision-Language-Action Model Dreamed with Comprehensive World KnowledgeCode3
A Semi-supervised Scalable Unified Framework for E-commerce Query Classification0
From 2D to 3D Cognition: A Brief Survey of General World Models0
MIRAGE: A Benchmark for Multimodal Information-Seeking and Reasoning in Agricultural Expert-Guided ConversationsCode0
Multi-Preference Lambda-weighted Listwise DPO for Dynamic Preference AlignmentCode0
ImpliRet: Benchmarking the Implicit Fact Retrieval ChallengeCode0
AutoVLA: A Vision-Language-Action Model for End-to-End Autonomous Driving with Adaptive Reasoning and Reinforcement Fine-TuningCode3
ConTextTab: A Semantics-Aware Tabular In-Context LearnerCode2
MMMG: A Massive, Multidisciplinary, Multi-Tier Generation Benchmark for Text-to-Image Reasoning0
RoCA: Robust Cross-Domain End-to-End Autonomous Driving0
Serendipitous Recommendation with Multimodal LLM0
ReCogDrive: A Reinforced Cognitive Framework for End-to-End Autonomous Driving0
Vid2Sim: Generalizable, Video-based Reconstruction of Appearance, Geometry and Physics for Mesh-free Simulation0
Quantifying Cross-Modality Memorization in Vision-Language Models0
TIIF-Bench: How Does Your T2I Model Follow Your Instructions?0
From Words to Waves: Analyzing Concept Formation in Speech and Text-Based Foundation Models0
Probing the Geometry of Truth: Consistency and Generalization of Truth Directions in LLMs Across Logical Transformations and Question Answering TasksCode0
Augment or Not? A Comparative Study of Pure and Augmented Large Language Model RecommendersCode0
SC-LoRA: Balancing Efficient Fine-tuning and Knowledge Preservation via Subspace-Constrained LoRA0
MOVi: Training-free Text-conditioned Multi-Object Video Generation0
Hierarchical Tree Search-based User Lifelong Behavior Modeling on Large Language Model0
Show:102550
← PrevPage 1 of 33Next →

No leaderboard results yet.