SOTAVerified

Decision Making

Papers

Showing 101150 of 12311 papers

TitleStatusHype
DrivingSphere: Building a High-fidelity 4D World for Closed-loop SimulationCode2
Graph Neural Network Surrogates to leverage Mechanistic Expert Knowledge towards Reliable and Immediate Pandemic ResponseCode2
Concept Bottleneck Language Models For protein designCode2
LLM-PySC2: Starcraft II learning environment for Large Language ModelsCode2
AdaSociety: An Adaptive Environment with Social Structures for Multi-Agent Decision-MakingCode2
A Survey of Financial AI: Architectures, Advances and Open ChallengesCode2
ActiveSplat: High-Fidelity Scene Reconstruction through Active Gaussian SplattingCode2
Context is Key: A Benchmark for Forecasting with Essential Textual InformationCode2
Literature Meets Data: A Synergistic Approach to Hypothesis GenerationCode2
Improving Causal Reasoning in Large Language Models: A SurveyCode2
A New Approach to Solving SMAC Task: Generating Decision Tree Code from Large Language ModelsCode2
Local Off-Grid Weather Forecasting with Multi-Modal Earth Observation DataCode2
Process Reward Model with Q-Value RankingsCode2
ForecastBench: A Dynamic Benchmark of AI Forecasting CapabilitiesCode2
Towards Interactive and Learnable Cooperative Driving Automation: a Large Language Model-Driven Decision-Making FrameworkCode2
Strategist: Learning Strategic Skills by LLMs via Bi-Level Tree SearchCode2
MOMAland: A Set of Benchmarks for Multi-Objective Multi-Agent Reinforcement LearningCode2
CoD, Towards an Interpretable Medical Agent using Chain of DiagnosisCode2
UrbanWorld: An Urban World Model for 3D City GenerationCode2
FinCon: A Synthesized LLM Multi-Agent System with Conceptual Verbal Reinforcement for Enhanced Financial Decision MakingCode2
AriGraph: Learning Knowledge Graph World Models with Episodic Memory for LLM AgentsCode2
ChartGemma: Visual Instruction-tuning for Chart Reasoning in the WildCode2
MacroHFT: Memory Augmented Context-aware Reinforcement Learning On High Frequency TradingCode2
PlanRAG: A Plan-then-Retrieval Augmented Generation for Generative Large Language Models as Decision MakersCode2
CleanDiffuser: An Easy-to-use Modularized Library for Diffusion Models in Decision MakingCode2
LVBench: An Extreme Long Video Understanding BenchmarkCode2
Predictive Dynamic FusionCode2
XRec: Large Language Models for Explainable RecommendationCode2
OccSora: 4D Occupancy Generation Models as World Simulators for Autonomous DrivingCode2
Can Graph Learning Improve Planning in LLM-based Agents?Code2
Safe Multi-Agent Reinforcement Learning with Bilevel Optimization in Autonomous DrivingCode2
Position: Foundation Agents as the Paradigm Shift for Decision MakingCode2
Diffusion Actor-Critic with Entropy RegulatorCode2
iVideoGPT: Interactive VideoGPTs are Scalable World ModelsCode2
Continuously Learning, Adapting, and Improving: A Dual-Process Approach to Autonomous DrivingCode2
LoRA-Ensemble: Efficient Uncertainty Modelling for Self-attention NetworksCode2
MediCLIP: Adapting CLIP for Few-shot Medical Image Anomaly DetectionCode2
OverlapMamba: Novel Shift State Space Model for LiDAR-based Place RecognitionCode2
A Survey of Time Series Foundation Models: Generalizing Time Series Representation with Large Language ModelCode2
PLAYER*: Enhancing LLM-based Multi-Agent Communication and Interaction in Murder Mystery GamesCode2
Cooperate or Collapse: Emergence of Sustainable Cooperation in a Society of LLM AgentsCode2
Mitigating Hallucinations in Large Vision-Language Models with Instruction Contrastive DecodingCode2
Embodied LLM Agents Learn to Cooperate in Organized TeamsCode2
BEVCar: Camera-Radar Fusion for BEV Map and Object SegmentationCode2
How Far Are We on the Decision-Making of LLMs? Evaluating LLMs' Gaming Ability in Multi-Agent EnvironmentsCode2
Expandable Subspace Ensemble for Pre-Trained Model-Based Class-Incremental LearningCode2
LLM-Assisted Light: Leveraging Large Language Model Capabilities for Human-Mimetic Traffic Signal Control in Complex Urban EnvironmentsCode2
A Cognitive-Based Trajectory Prediction Approach for Autonomous DrivingCode2
DecisionNCE: Embodied Multimodal Representations via Implicit Preference LearningCode2
MACRec: a Multi-Agent Collaboration Framework for RecommendationCode2
Show:102550
← PrevPage 3 of 247Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified