SOTAVerified

World Knowledge

Papers

Showing 76100 of 818 papers

TitleStatusHype
FusDreamer: Label-efficient Remote Sensing World Model for Multimodal Data ClassificationCode1
Condor: Enhance LLM Alignment with Knowledge-Driven Data Synthesis and RefinementCode1
VoxEval: Benchmarking the Knowledge Understanding Capabilities of End-to-End Spoken Language ModelsCode1
An Automatic Graph Construction Framework based on Large Language Models for RecommendationCode1
Knowledge Editing through Chain-of-ThoughtCode1
Bridging the User-side Knowledge Gap in Knowledge-aware Recommendations with Large Language ModelsCode1
Adapting to Non-Stationary Environments: Multi-Armed Bandit Enhanced Retrieval-Augmented Generation on Knowledge GraphsCode1
I Don't Know: Explicit Modeling of Uncertainty with an [IDK] TokenCode1
Retrieval-Augmented Machine Translation with Unstructured KnowledgeCode1
LiveXiv -- A Multi-Modal Live Benchmark Based on Arxiv Papers ContentCode1
LLM Embeddings Improve Test-time Adaptation to Tabular Y|X-ShiftsCode1
CurricuLLM: Automatic Task Curricula Design for Learning Complex Robot Skills using Large Language ModelsCode1
Diversify and Conquer: Diversity-Centric Data Selection with Iterative RefinementCode1
Can OOD Object Detectors Learn from Foundation Models?Code1
AgentMove: Predicting Human Mobility Anywhere Using Large Language Model based Agentic FrameworkCode1
BLADE: Benchmarking Language Model Agents for Data-Driven ScienceCode1
Flooding Spread of Manipulated Knowledge in LLM-Based Multi-Agent CommunitiesCode1
Reverse Image Retrieval Cues Parametric Memory in Multimodal LLMsCode1
Large Scale Knowledge WashingCode1
Everything is Editable: Extend Knowledge Editing to Unstructured Data in Large Language ModelsCode1
Elements of World Knowledge (EWOK): A cognition-inspired framework for evaluating basic world knowledge in language modelsCode1
PAC-Bayesian Generalization Bounds for Knowledge Graph Representation LearningCode1
Cross-Care: Assessing the Healthcare Implications of Pre-training Data on Language Model BiasCode1
LEARN: Knowledge Adaptation from Large Language Model to Recommendation for Practical Industrial ApplicationCode1
A User-Centric Multi-Intent Benchmark for Evaluating Large Language ModelsCode1
Show:102550
← PrevPage 4 of 33Next →

No leaderboard results yet.