SOTAVerified

Decision Making

Papers

Showing 201250 of 12311 papers

TitleStatusHype
Distributional Soft Actor-Critic with Three RefinementsCode2
OverlapMamba: Novel Shift State Space Model for LiDAR-based Place RecognitionCode2
AdaSociety: An Adaptive Environment with Social Structures for Multi-Agent Decision-MakingCode2
Revocable Deep Reinforcement Learning with Affinity Regularization for Outlier-Robust Graph MatchingCode2
Deep Generative Models for Offline Policy Learning: Tutorial, Survey, and Perspectives on Future DirectionsCode2
PlanT: Explainable Planning Transformers via Object-Level RepresentationsCode2
Diffusion Actor-Critic with Entropy RegulatorCode2
ADAPT: Action-aware Driving Caption TransformerCode2
Predictive Dynamic FusionCode2
Preserving Causal Constraints in Counterfactual Explanations for Machine Learning ClassifiersCode2
A Review of Safe Reinforcement Learning: Methods, Theory and ApplicationsCode2
Process Reward Model with Q-Value RankingsCode2
AdaFlow: Imitation Learning with Variance-Adaptive Flow-Based PoliciesCode2
Can Graph Learning Improve Planning in LLM-based Agents?Code2
ActiveSplat: High-Fidelity Scene Reconstruction through Active Gaussian SplattingCode2
DecisionNCE: Embodied Multimodal Representations via Implicit Preference LearningCode2
Dungeons and Data: A Large-Scale NetHack DatasetCode2
CausalPFN: Amortized Causal Effect Estimation via In-Context LearningCode2
GPT-Driver: Learning to Drive with GPTCode2
Short-Term Density Forecasting of Low-Voltage Load using Bernstein-Polynomial Normalizing FlowsCode2
ChartAssisstant: A Universal Chart Multimodal Language Model via Chart-to-Table Pre-training and Multitask Instruction TuningCode2
ChartGemma: Visual Instruction-tuning for Chart Reasoning in the WildCode2
Natural Language Reinforcement LearningCode2
CryptoPulse: Short-Term Cryptocurrency Forecasting with Dual-Prediction and Cross-Correlated Market IndicatorsCode1
Curating a COVID-19 data repository and forecasting county-level death counts in the United StatesCode1
COVIDNet-CT: A Tailored Deep Convolutional Neural Network Design for Detection of COVID-19 Cases from Chest CT ImagesCode1
COVID-Net CT-2: Enhanced Deep Neural Networks for Detection of COVID-19 from Chest CT Images Through Bigger, More Diverse LearningCode1
Cryptocurrency Portfolio Management with Deep Reinforcement LearningCode1
Curriculum-based Reinforcement Learning for Distribution System Critical Load RestorationCode1
Counterfactual Explainable RecommendationCode1
ContrXT: Generating Contrastive Explanations from any Text ClassifierCode1
Counterfactual Explanations in Sequential Decision Making Under UncertaintyCode1
Continuation Path Learning for Homotopy OptimizationCode1
Conformal Time-series ForecastingCode1
Constraint-Adaptive Policy Switching for Offline Safe Reinforcement LearningCode1
CUTS+: High-dimensional Causal Discovery from Irregular Time-seriesCode1
Conditioning Sparse Variational Gaussian Processes for Online Decision-makingCode1
Conditional-Flow NeRF: Accurate 3D Modelling with Reliable Uncertainty QuantificationCode1
Conformal Alignment: Knowing When to Trust Foundation Models with GuaranteesCode1
ComTraQ-MPC: Meta-Trained DQN-MPC Integration for Trajectory Tracking with Limited Active Localization UpdatesCode1
ContainerGym: A Real-World Reinforcement Learning Benchmark for Resource AllocationCode1
Concept-level Debugging of Part-Prototype NetworksCode1
Contrastive Variational Reinforcement Learning for Complex ObservationsCode1
Controlling Neural Networks with Rule RepresentationsCode1
Conformal Inference of Counterfactuals and Individual Treatment EffectsCode1
CoSense3D: an Agent-based Efficient Learning Framework for Collective PerceptionCode1
Abstracting Deep Neural Networks into Concept Graphs for Concept Level InterpretabilityCode1
COVID-CXNet: Detecting COVID-19 in Frontal Chest X-ray Images using Deep LearningCode1
Comparing Exploration-Exploitation Strategies of LLMs and Humans: Insights from Standard Multi-armed Bandit TasksCode1
Modelling uncertainty in coupled electricity and gas systems -- is it worth the effort?Code1
Show:102550
← PrevPage 5 of 247Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified