SOTAVerified

World Knowledge

Papers

Showing 150 of 818 papers

TitleStatusHype
Scaling Synthetic Data Creation with 1,000,000,000 PersonasCode11
When LLMs step into the 3D World: A Survey and Meta-Analysis of 3D Tasks via Multi-modal Large Language ModelsCode7
Mistral 7BCode6
Direct Preference Optimization: Your Language Model is Secretly a Reward ModelCode6
ChatDBG: Augmenting Debugging with Large Language ModelsCode5
ShareGPT4Video: Improving Video Understanding and Generation with Better CaptionsCode5
PointVLA: Injecting the 3D World into Vision-Language-Action ModelsCode4
VILA: On Pre-training for Visual Language ModelsCode4
WISE: A World Knowledge-Informed Semantic Evaluation for Text-to-Image GenerationCode4
LLM2CLIP: Powerful Language Model Unlocks Richer Visual RepresentationCode4
LISA: Reasoning Segmentation via Large Language ModelCode4
V?: Guided Visual Search as a Core Mechanism in Multimodal LLMsCode4
Text2SQL is Not Enough: Unifying AI and Databases with TAGCode4
Retrieval-Augmented Generation for Knowledge-Intensive NLP TasksCode4
HLLM: Enhancing Sequential Recommendations via Hierarchical Large Language Models for Item and User ModelingCode4
LLaRA: Supercharging Robot Learning Data for Vision-Language PolicyCode3
GS2Mesh: Surface Reconstruction from Gaussian Splatting via Novel Stereo ViewsCode3
Cold-Start Recommendation towards the Era of Large Language Models (LLMs): A Comprehensive Survey and RoadmapCode3
HERMES: A Unified Self-Driving World Model for Simultaneous 3D Scene Understanding and GenerationCode3
AutoVLA: A Vision-Language-Action Model for End-to-End Autonomous Driving with Adaptive Reasoning and Reinforcement Fine-TuningCode3
How Can Recommender Systems Benefit from Large Language Models: A SurveyCode3
Tokenization, Fusion, and Augmentation: Towards Fine-grained Multi-modal Entity RepresentationCode3
Unified Source-Free Domain AdaptationCode3
Siren's Song in the AI Ocean: A Survey on Hallucination in Large Language ModelsCode3
VISA: Reasoning Video Object Segmentation via Large Language ModelsCode3
Are We on the Right Way for Evaluating Large Vision-Language Models?Code3
DreamVLA: A Vision-Language-Action Model Dreamed with Comprehensive World KnowledgeCode3
GPT-ImgEval: A Comprehensive Benchmark for Diagnosing GPT4o in Image GenerationCode3
Optimus-1: Hybrid Multimodal Memory Empowered Agents Excel in Long-Horizon TasksCode2
CorDA: Context-Oriented Decomposition Adaptation of Large Language Models for Task-Aware Parameter-Efficient Fine-tuningCode2
PlanBench: An Extensible Benchmark for Evaluating Large Language Models on Planning and Reasoning about ChangeCode2
One Token to Seg Them All: Language Instructed Reasoning Segmentation in VideosCode2
ConTextTab: A Semantics-Aware Tabular In-Context LearnerCode2
On Softmax Direct Preference Optimization for RecommendationCode2
Agent Planning with World Knowledge ModelCode2
MMLU-CF: A Contamination-free Multi-task Language Understanding BenchmarkCode2
MeaCap: Memory-Augmented Zero-shot Image CaptioningCode2
Measuring Massive Multitask Language UnderstandingCode2
Meteor: Mamba-based Traversal of Rationale for Large Language and Vision ModelsCode2
RETA-LLM: A Retrieval-Augmented Large Language Model ToolkitCode2
LangSuitE: Planning, Controlling and Interacting with Large Language Models in Embodied Text EnvironmentsCode2
HyperSeg: Hybrid Segmentation Assistant with Fine-grained Visual PerceiverCode2
HyperSeg: Towards Universal Visual Segmentation with Large Language ModelCode2
Language Models as Zero-Shot Planners: Extracting Actionable Knowledge for Embodied AgentsCode2
KG-FIT: Knowledge Graph Fine-Tuning Upon Open-World KnowledgeCode2
ChatPLUG: Open-Domain Generative Dialogue System with Internet-Augmented Instruction Tuning for Digital HumanCode2
GreaseLM: Graph REASoning Enhanced Language Models for Question AnsweringCode2
Grasp-Anything: Large-scale Grasp Dataset from Foundation ModelsCode2
Language Representations Can be What Recommenders Need: Findings and PotentialsCode2
A Synthetic Dataset for Personal Attribute InferenceCode2
Show:102550
← PrevPage 1 of 17Next →

No leaderboard results yet.