SOTAVerified

Decision Making

Papers

Showing 9511000 of 12311 papers

TitleStatusHype
Scaling Pareto-Efficient Decision Making Via Offline Multi-Objective RLCode1
ScheduleNet: Learn to solve multi-agent scheduling problems with reinforcement learningCode1
False Correlation Reduction for Offline Reinforcement LearningCode1
SCOUTER: Slot Attention-based Classifier for Explainable Image RecognitionCode1
Augmenting Reinforcement Learning with Transformer-based Scene Representation Learning for Decision-making of Autonomous DrivingCode1
Autonomous Driving using Residual Sensor Fusion and Deep Reinforcement LearningCode1
Selection by Prediction with Conformal p-valuesCode1
Self-Calibrating Conformal PredictionCode1
Sequential Voting with Relational Box Fields for Active Object DetectionCode1
Sequential Planning in Large Partially Observable Environments guided by LLMsCode1
Bayesian SegNet: Model Uncertainty in Deep Convolutional Encoder-Decoder Architectures for Scene UnderstandingCode1
A Survey on Session-based Recommender SystemsCode1
Simplified Temporal Consistency Reinforcement LearningCode1
AgentsCourt: Building Judicial Decision-Making Agents with Court Debate Simulation and Legal Knowledge AugmentationCode1
Skillful Precipitation Nowcasting using Deep Generative Models of RadarCode1
Skill Set Optimization: Reinforcing Language Model Behavior via Transferable SkillsCode1
Adversarial Attacks on Probabilistic Autoregressive Forecasting ModelsCode1
SMART: A Decision-Making Framework with Multi-modality Fusion for Autonomous Driving Based on Reinforcement LearningCode1
A SWAT-based Reinforcement Learning Framework for Crop ManagementCode1
A Survey on Interpretable Cross-modal ReasoningCode1
An Empirical Characterization of Fair Machine Learning For Clinical Risk PredictionCode1
An empirical evaluation of active inference in multi-armed banditsCode1
Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic ActorCode1
SSL-SoilNet: A Hybrid Transformer-based Framework with Self-Supervised Learning for Large-scale Soil Organic Carbon PredictionCode1
Spatio-Temporal-Categorical Graph Neural Networks for Fine-Grained Multi-Incident Co-PredictionCode1
A Generative Framework for Probabilistic, Spatiotemporally Coherent Downscaling of Climate SimulationCode1
CAVE: Cerebral Artery-Vein Segmentation in Digital Subtraction AngiographyCode1
Split Q Learning: Reinforcement Learning with Two-Stream RewardsCode1
STeCa: Step-level Trajectory Calibration for LLM Agent LearningCode1
An End-to-end Deep Reinforcement Learning Approach for the Long-term Short-term Planning on the Frenet SpaceCode1
A survey on datasets for fairness-aware machine learningCode1
ECG-Image-Kit: A Synthetic Image Generation Toolbox to Facilitate Deep Learning-Based Electrocardiogram DigitizationCode1
A Survey of Medical Vision-and-Language Applications and Their TechniquesCode1
Super-resolution Probabilistic Rain Prediction from Satellite Data Using 3D U-Nets and EarthFormersCode1
survex: an R package for explaining machine learning survival modelsCode1
Suspicion-Agent: Playing Imperfect Information Games with Theory of Mind Aware GPT-4Code1
Adversarial Robustness of Representation Learning for Knowledge GraphsCode1
Sym-Q: Adaptive Symbolic Regression via Sequential Decision-MakingCode1
Synthesizing Event-centric Knowledge Graphs of Daily Activities Using Virtual SpaceCode1
TabFairGAN: Fair Tabular Data Generation with Generative Adversarial NetworksCode1
Abstracting Deep Neural Networks into Concept Graphs for Concept Level InterpretabilityCode1
TAPAS: a Toolbox for Adversarial Privacy Auditing of Synthetic DataCode1
Targeted-BEHRT: Deep learning for observational causal inference on longitudinal electronic health recordsCode1
Task-Agnostic Online Reinforcement Learning with an Infinite Mixture of Gaussian ProcessesCode1
AdViCE: Aggregated Visual Counterfactual Explanations for Machine Learning Model ValidationCode1
TE2Rules: Explaining Tree Ensembles using RulesCode1
TeD-SPAD: Temporal Distinctiveness for Self-supervised Privacy-preservation for video Anomaly DetectionCode1
TELLER: A Trustworthy Framework for Explainable, Generalizable and Controllable Fake News DetectionCode1
Text-based RL Agents with Commonsense Knowledge: New Challenges, Environments and BaselinesCode1
A Study of Situational Reasoning for Traffic UnderstandingCode1
Show:102550
← PrevPage 20 of 247Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified