SOTAVerified

World Knowledge

Papers

Showing 201250 of 818 papers

TitleStatusHype
Unsupervised Commonsense Question Answering with Self-TalkCode1
REALM: Retrieval-Augmented Language Model Pre-TrainingCode1
ASER: A Large-scale Eventuality Knowledge GraphCode1
Analyzing Knowledge Graph Embedding Methods from a Multi-Embedding Interaction PerspectiveCode1
CommonsenseQA: A Question Answering Challenge Targeting Commonsense KnowledgeCode1
Breaking NLI Systems with Sentences that Require Simple Lexical InferencesCode1
On the Evaluation of Semantic Phenomena in Neural Machine Translation Using Natural Language InferenceCode1
Imagine This! Scripts to Compositions to VideosCode1
Off-Policy General Value Functions to Represent Dynamic Role Assignments in RoboCup 3D Soccer SimulationCode1
Comparing Apples to Oranges: A Dataset & Analysis of LLM Humour Understanding from Traditional Puns to Topical Jokes0
HRSeg: High-Resolution Visual Perception and Enhancement for Reasoning Segmentation0
KEN: Knowledge Augmentation and Emotion Guidance Network for Multimodal Fake News Detection0
Video Event Reasoning and Prediction by Fusing World Knowledge from LLMs with Vision Foundation Models0
A Semi-supervised Scalable Unified Framework for E-commerce Query Classification0
From 2D to 3D Cognition: A Brief Survey of General World Models0
MIRAGE: A Benchmark for Multimodal Information-Seeking and Reasoning in Agricultural Expert-Guided ConversationsCode0
Multi-Preference Lambda-weighted Listwise DPO for Dynamic Preference AlignmentCode0
ImpliRet: Benchmarking the Implicit Fact Retrieval ChallengeCode0
MMMG: A Massive, Multidisciplinary, Multi-Tier Generation Benchmark for Text-to-Image Reasoning0
RoCA: Robust Cross-Domain End-to-End Autonomous Driving0
ReCogDrive: A Reinforced Cognitive Framework for End-to-End Autonomous Driving0
Serendipitous Recommendation with Multimodal LLM0
Vid2Sim: Generalizable, Video-based Reconstruction of Appearance, Geometry and Physics for Mesh-free Simulation0
Quantifying Cross-Modality Memorization in Vision-Language Models0
TIIF-Bench: How Does Your T2I Model Follow Your Instructions?0
From Words to Waves: Analyzing Concept Formation in Speech and Text-Based Foundation Models0
Probing the Geometry of Truth: Consistency and Generalization of Truth Directions in LLMs Across Logical Transformations and Question Answering TasksCode0
MOVi: Training-free Text-conditioned Multi-Object Video Generation0
SC-LoRA: Balancing Efficient Fine-tuning and Knowledge Preservation via Subspace-Constrained LoRA0
Augment or Not? A Comparative Study of Pure and Augmented Large Language Model RecommendersCode0
Hierarchical Tree Search-based User Lifelong Behavior Modeling on Large Language Model0
DriveX: Omni Scene Modeling for Learning Generalizable World Knowledge in Autonomous Driving0
Improving Medical Reasoning with Curriculum-Aware Reinforcement Learning0
Alchemist: Turning Public Text-to-Image Data into Generative Gold0
Align Beyond Prompts: Evaluating World Knowledge Alignment in Text-to-Image GenerationCode0
Do BERT-Like Bidirectional Models Still Perform Better on Text Classification in the Era of LLMs?0
DeepRec: Towards a Deep Dive Into the Item Space with Large Language Model Based Recommendation0
TimeCausality: Evaluating the Causal Ability in Time Dimension for Vision Language ModelsCode0
Robo2VLM: Visual Question Answering from Large-Scale In-the-Wild Robot Manipulation Datasets0
UniErase: Unlearning Token as a Universal Erasure Primitive for Language ModelsCode0
Table Foundation Models: on knowledge pre-training for tabular learning0
Bidirectional LMs are Better Knowledge Memorizers? A Benchmark for Real-world Knowledge InjectionCode0
Benchmarking Spatiotemporal Reasoning in LLMs and Reasoning Models: Capabilities and ChallengesCode0
Who You Are Matters: Bridging Topics and Social Roles via LLM-Enhanced Logical Recommendation0
LODGE: Joint Hierarchical Task Planning and Learning of Domain Models with Grounded Execution0
LLM4CD: Leveraging Large Language Models for Open-World Knowledge Augmented Cognitive DiagnosisCode0
Enhancing Cache-Augmented Generation (CAG) with Adaptive Contextual Compression for Scalable Knowledge Integration0
Advancing and Benchmarking Personalized Tool Invocation for LLMsCode0
Evaluating Contrastive Feedback for Effective User SimulationsCode0
WorldGenBench: A World-Knowledge-Integrated Benchmark for Reasoning-Driven Text-to-Image Generation0
Show:102550
← PrevPage 5 of 17Next →

No leaderboard results yet.