SOTAVerified

World Knowledge

Papers

Showing 126150 of 818 papers

TitleStatusHype
VLABench: A Large-Scale Benchmark for Language-Conditioned Robotics Manipulation with Long-Horizon Reasoning Tasks0
An Automatic Graph Construction Framework based on Large Language Models for RecommendationCode1
Knowledge Editing through Chain-of-ThoughtCode1
Interweaving Memories of a Siamese Large Language ModelCode0
Beyond Partisan Leaning: A Comparative Analysis of Political Bias in Large Language Models0
Logical Consistency of Large Language Models in Fact-checking0
Fietje: An open, efficient LLM for DutchCode2
GraphEQA: Using 3D Semantic Scene Graphs for Real-time Embodied Question Answering0
MMLU-CF: A Contamination-free Multi-task Language Understanding BenchmarkCode2
Bridging the User-side Knowledge Gap in Knowledge-aware Recommendations with Large Language ModelsCode1
AntiLeak-Bench: Preventing Data Contamination by Automatically Constructing Benchmarks with Updated Real-World KnowledgeCode0
MetaMorph: Multimodal Understanding and Generation via Instruction Tuning0
HandsOnVLM: Vision-Language Models for Hand-Object Interaction Prediction0
QUENCH: Measuring the gap between Indic and Non-Indic Contextual General Reasoning in LLMsCode0
GaGA: Towards Interactive Global Geolocation Assistant0
AltFS: Agency-light Feature Selection with Large Language Models in Deep Recommender Systems0
Adapting to Non-Stationary Environments: Multi-Armed Bandit Enhanced Retrieval-Augmented Generation on Knowledge GraphsCode1
Balancing Efficiency and Effectiveness: An LLM-Infused Approach for Optimized CTR Prediction0
Exploring Critical Testing Scenarios for Decision-Making Policies: An LLM Approach0
World knowledge-enhanced Reasoning Using Instruction-guided Interactor in Autonomous Driving0
I Don't Know: Explicit Modeling of Uncertainty with an [IDK] TokenCode1
Retrieval-Augmented Machine Translation with Unstructured KnowledgeCode1
A surprisal oracle for when every layer countsCode0
SeqAfford: Sequential 3D Affordance Reasoning via Multimodal Large Language Model0
Realistic Corner Case Generation for Autonomous Vehicles with Multimodal Large Language Model0
Show:102550
← PrevPage 6 of 33Next →

No leaderboard results yet.