| Epistemic Exploration for Generalizable Planning and Learning in Non-Stationary Settings | Feb 13, 2024 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| Adapting Static Fairness to Sequential Decision-Making: Bias Mitigation Strategies towards Equal Long-term Benefit Rate | Sep 7, 2023 | Decision MakingFairness | CodeCode Available | 0 |
| Agent-Specific Effects: A Causal Effect Propagation Analysis in Multi-Agent MDPs | Oct 17, 2023 | counterfactualDecision Making | CodeCode Available | 0 |
| Did we personalize? Assessing personalization by an online reinforcement learning algorithm using resampling | Apr 11, 2023 | Decision MakingReinforcement Learning (RL) | CodeCode Available | 0 |
| Learning to Reach Goals via Diffusion | Oct 4, 2023 | Computational EfficiencyDecision Making | CodeCode Available | 0 |
| Online Decision Making with History-Average Dependent Costs (Extended) | Dec 11, 2023 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| Evolutionary Multi-Armed Bandits with Genetic Thompson Sampling | Apr 26, 2022 | Decision MakingEvolutionary Algorithms | CodeCode Available | 0 |
| Online Learning with Costly Features in Non-stationary Environments | Jul 18, 2023 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| SOPE: Spectrum of Off-Policy Estimators | Nov 6, 2021 | Decision MakingOff-policy evaluation | CodeCode Available | 0 |
| Explainable Knowledge Graph Embedding: Inference Reconciliation for Knowledge Inferences Supporting Robot Actions | May 4, 2022 | Decision MakingGraph Embedding | CodeCode Available | 0 |
| Learning to Solve the Min-Max Mixed-Shelves Picker-Routing Problem via Hierarchical and Parallel Decoding | Feb 14, 2025 | Decision MakingMulti-agent Reinforcement Learning | CodeCode Available | 0 |
| Sample and Computationally Efficient Continuous-Time Reinforcement Learning with General Function Approximation | May 20, 2025 | Computational Efficiencycontinuous-control | CodeCode Available | 0 |
| Sound Heuristic Search Value Iteration for Undiscounted POMDPs with Reachability Objectives | Jun 5, 2024 | Decision MakingEfficient Exploration | CodeCode Available | 0 |
| Learning Versatile Skills with Curriculum Masking | Oct 23, 2024 | Decision MakingOffline RL | CodeCode Available | 0 |
| Bellman Meets Hawkes: Model-Based Reinforcement Learning via Temporal Point Processes | Jan 29, 2022 | Decision MakingModel-based Reinforcement Learning | CodeCode Available | 0 |
| Reinforcement Learning | May 29, 2020 | Autonomous VehiclesBoard Games | CodeCode Available | 0 |
| TORE: Token Recycling in Vision Transformers for Efficient Active Visual Exploration | Nov 26, 2023 | Decision MakingDecoder | CodeCode Available | 0 |
| Lifelong Learning with a Changing Action Set | Jun 5, 2019 | Decision MakingLifelong learning | CodeCode Available | 0 |
| TextAtari: 100K Frames Game Playing with Language Agents | Jun 4, 2025 | Atari GamesDecision Making | CodeCode Available | 0 |
| A Parallel Hybrid Action Space Reinforcement Learning Model for Real-world Adaptive Traffic Signal Control | Mar 18, 2025 | Decision MakingSequential Decision Making | CodeCode Available | 0 |
| Adversarially Robust Decision Transformer | Jul 25, 2024 | Adversarial RobustnessSequential Decision Making | CodeCode Available | 0 |
| Exploring the Inquiry-Diagnosis Relationship with Advanced Patient Simulators | Jan 16, 2025 | DiagnosticSequential Decision Making | CodeCode Available | 0 |
| Detecting Adversarial Attacks on Neural Network Policies with Visual Foresight | Oct 2, 2017 | Autonomous VehiclesDecision Making | CodeCode Available | 0 |
| SANDWICH: Towards an Offline, Differentiable, Fully-Trainable Wireless Neural Ray-Tracing Surrogate | Nov 13, 2024 | Decision MakingGPU | CodeCode Available | 0 |
| Universal Off-Policy Evaluation | Apr 26, 2021 | counterfactualDecision Making | CodeCode Available | 0 |