SOTAVerified

World Knowledge

Papers

Showing 150 of 818 papers

TitleStatusHype
Scaling Synthetic Data Creation with 1,000,000,000 PersonasCode11
When LLMs step into the 3D World: A Survey and Meta-Analysis of 3D Tasks via Multi-modal Large Language ModelsCode7
Mistral 7BCode6
Direct Preference Optimization: Your Language Model is Secretly a Reward ModelCode6
ShareGPT4Video: Improving Video Understanding and Generation with Better CaptionsCode5
ChatDBG: Augmenting Debugging with Large Language ModelsCode5
PointVLA: Injecting the 3D World into Vision-Language-Action ModelsCode4
WISE: A World Knowledge-Informed Semantic Evaluation for Text-to-Image GenerationCode4
LLM2CLIP: Powerful Language Model Unlocks Richer Visual RepresentationCode4
HLLM: Enhancing Sequential Recommendations via Hierarchical Large Language Models for Item and User ModelingCode4
Text2SQL is Not Enough: Unifying AI and Databases with TAGCode4
V?: Guided Visual Search as a Core Mechanism in Multimodal LLMsCode4
VILA: On Pre-training for Visual Language ModelsCode4
LISA: Reasoning Segmentation via Large Language ModelCode4
Retrieval-Augmented Generation for Knowledge-Intensive NLP TasksCode4
DreamVLA: A Vision-Language-Action Model Dreamed with Comprehensive World KnowledgeCode3
AutoVLA: A Vision-Language-Action Model for End-to-End Autonomous Driving with Adaptive Reasoning and Reinforcement Fine-TuningCode3
GPT-ImgEval: A Comprehensive Benchmark for Diagnosing GPT4o in Image GenerationCode3
HERMES: A Unified Self-Driving World Model for Simultaneous 3D Scene Understanding and GenerationCode3
Cold-Start Recommendation towards the Era of Large Language Models (LLMs): A Comprehensive Survey and RoadmapCode3
VISA: Reasoning Video Object Segmentation via Large Language ModelsCode3
LLaRA: Supercharging Robot Learning Data for Vision-Language PolicyCode3
Tokenization, Fusion, and Augmentation: Towards Fine-grained Multi-modal Entity RepresentationCode3
GS2Mesh: Surface Reconstruction from Gaussian Splatting via Novel Stereo ViewsCode3
Are We on the Right Way for Evaluating Large Vision-Language Models?Code3
Unified Source-Free Domain AdaptationCode3
Siren's Song in the AI Ocean: A Survey on Hallucination in Large Language ModelsCode3
How Can Recommender Systems Benefit from Large Language Models: A SurveyCode3
ConTextTab: A Semantics-Aware Tabular In-Context LearnerCode2
Free-form language-based robotic reasoning and graspingCode2
HyperSeg: Hybrid Segmentation Assistant with Fine-grained Visual PerceiverCode2
Fietje: An open, efficient LLM for DutchCode2
MMLU-CF: A Contamination-free Multi-task Language Understanding BenchmarkCode2
HyperSeg: Towards Universal Visual Segmentation with Large Language ModelCode2
One Token to Seg Them All: Language Instructed Reasoning Segmentation in VideosCode2
Synthetic continued pretrainingCode2
Optimus-1: Hybrid Multimodal Memory Empowered Agents Excel in Long-Horizon TasksCode2
SeaLLMs 3: Open Foundation and Chat Multilingual Large Language Models for Southeast Asian LanguagesCode2
Language Representations Can be What Recommenders Need: Findings and PotentialsCode2
LangSuitE: Planning, Controlling and Interacting with Large Language Models in Embodied Text EnvironmentsCode2
RWKU: Benchmarking Real-World Knowledge Unlearning for Large Language ModelsCode2
On Softmax Direct Preference Optimization for RecommendationCode2
A Synthetic Dataset for Personal Attribute InferenceCode2
CorDA: Context-Oriented Decomposition Adaptation of Large Language Models for Task-Aware Parameter-Efficient Fine-tuningCode2
KG-FIT: Knowledge Graph Fine-Tuning Upon Open-World KnowledgeCode2
Meteor: Mamba-based Traversal of Rationale for Large Language and Vision ModelsCode2
Agent Planning with World Knowledge ModelCode2
Learnable Item Tokenization for Generative RecommendationCode2
Understanding Long Videos with Multimodal Language ModelsCode2
Embodied LLM Agents Learn to Cooperate in Organized TeamsCode2
Show:102550
← PrevPage 1 of 17Next →

No leaderboard results yet.