SOTAVerified

World Knowledge

Papers

Showing 201225 of 818 papers

TitleStatusHype
Unsupervised Commonsense Question Answering with Self-TalkCode1
REALM: Retrieval-Augmented Language Model Pre-TrainingCode1
ASER: A Large-scale Eventuality Knowledge GraphCode1
Analyzing Knowledge Graph Embedding Methods from a Multi-Embedding Interaction PerspectiveCode1
CommonsenseQA: A Question Answering Challenge Targeting Commonsense KnowledgeCode1
Breaking NLI Systems with Sentences that Require Simple Lexical InferencesCode1
On the Evaluation of Semantic Phenomena in Neural Machine Translation Using Natural Language InferenceCode1
Imagine This! Scripts to Compositions to VideosCode1
Off-Policy General Value Functions to Represent Dynamic Role Assignments in RoboCup 3D Soccer SimulationCode1
HRSeg: High-Resolution Visual Perception and Enhancement for Reasoning Segmentation0
Comparing Apples to Oranges: A Dataset & Analysis of LLM Humour Understanding from Traditional Puns to Topical Jokes0
KEN: Knowledge Augmentation and Emotion Guidance Network for Multimodal Fake News Detection0
Video Event Reasoning and Prediction by Fusing World Knowledge from LLMs with Vision Foundation Models0
A Semi-supervised Scalable Unified Framework for E-commerce Query Classification0
MIRAGE: A Benchmark for Multimodal Information-Seeking and Reasoning in Agricultural Expert-Guided ConversationsCode0
From 2D to 3D Cognition: A Brief Survey of General World Models0
Multi-Preference Lambda-weighted Listwise DPO for Dynamic Preference AlignmentCode0
ImpliRet: Benchmarking the Implicit Fact Retrieval ChallengeCode0
MMMG: A Massive, Multidisciplinary, Multi-Tier Generation Benchmark for Text-to-Image Reasoning0
RoCA: Robust Cross-Domain End-to-End Autonomous Driving0
ReCogDrive: A Reinforced Cognitive Framework for End-to-End Autonomous Driving0
Serendipitous Recommendation with Multimodal LLM0
Vid2Sim: Generalizable, Video-based Reconstruction of Appearance, Geometry and Physics for Mesh-free Simulation0
Quantifying Cross-Modality Memorization in Vision-Language Models0
TIIF-Bench: How Does Your T2I Model Follow Your Instructions?0
Show:102550
← PrevPage 9 of 33Next →

No leaderboard results yet.