| Inducing Stackelberg Equilibrium through Spatio-Temporal Sequential Decision-Making in Multi-Agent Reinforcement Learning | Apr 20, 2023 | Decision MakingMulti-agent Reinforcement Learning | —Unverified | 0 |
| Inference of Utilities and Time Preference in Sequential Decision-Making | May 24, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Information Avoidance and Overvaluation in Sequential Decision Making under Epistemic Constraints | Jun 9, 2021 | Decision MakingManagement | —Unverified | 0 |
| Information Directed Sampling for Linear Partial Monitoring | Feb 25, 2020 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 |
| Information-Theoretic Safe Bayesian Optimization | Feb 23, 2024 | Bayesian OptimizationDecision Making | —Unverified | 0 |
| InfraLib: Enabling Reinforcement Learning and Decision-Making for Large-Scale Infrastructure Management | Sep 5, 2024 | BenchmarkingComputational Efficiency | —Unverified | 0 |
| Integrated Sensing and Communications for Low-Altitude Economy: A Deep Reinforcement Learning Approach | Dec 5, 2024 | Collision AvoidanceDeep Reinforcement Learning | —Unverified | 0 |
| Integrating Policy Summaries with Reward Decomposition for Explaining Reinforcement Learning Agents | Oct 21, 2022 | Decision Makingreinforcement-learning | —Unverified | 0 |
| Integrating Reward Maximization and Population Estimation: Sequential Decision-Making for Internal Revenue Service Audit Selection | Apr 25, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Interactions between dynamic team composition and coordination: An agent-based modeling approach | Jan 11, 2024 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Interpolating Between Softmax Policy Gradient and Neural Replicator Dynamics with Capped Implicit Exploration | Jun 4, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Interpretability in Action: Exploratory Analysis of VPT, a Minecraft Agent | Jul 16, 2024 | Decision MakingMinecraft | —Unverified | 0 |
| Intrinsically Motivated Hierarchical Policy Learning in Multi-objective Markov Decision Processes | Aug 18, 2023 | Decision MakingMulti-Objective Reinforcement Learning | —Unverified | 0 |
| Invariant Lipschitz Bandits: A Side Observation Approach | Dec 14, 2022 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Inverse Design of Photonic Crystal Surface Emitting Lasers is a Sequence Modeling Problem | Mar 8, 2024 | Decision MakingReinforcement Learning (RL) | —Unverified | 0 |
| Inverse Policy Evaluation for Value-based Sequential Decision-making | Aug 26, 2020 | Decision MakingQ-Learning | —Unverified | 0 |
| Inverse-RLignment: Large Language Model Alignment from Demonstrations through Inverse Reinforcement Learning | May 24, 2024 | Decision MakingLanguage Modeling | —Unverified | 0 |
| Investigating Order Effects in Multidimensional Relevance Judgment using Query Logs | Jul 14, 2018 | Decision MakingSequential Decision Making | —Unverified | 0 |
| Is Behavior Cloning All You Need? Understanding Horizon in Imitation Learning | Jul 20, 2024 | AllAutonomous Driving | —Unverified | 0 |
| Is Conditional Generative Modeling all you need for Decision-Making? | Nov 28, 2022 | AllDecision Making | —Unverified | 0 |
| Is Reinforcement Learning More Difficult Than Bandits? A Near-optimal Algorithm Escaping the Curse of Horizon | Sep 28, 2020 | Decision MakingMulti-Armed Bandits | —Unverified | 0 |
| Jamming-Resilient Path Planning for Multiple UAVs via Deep Reinforcement Learning | Apr 9, 2021 | Collision AvoidanceDecision Making | —Unverified | 0 |
| JARVIS: A Neuro-Symbolic Commonsense Reasoning Framework for Conversational Embodied Agents | Aug 28, 2022 | Action GenerationCommon Sense Reasoning | —Unverified | 0 |
| Joint AP Probing and Scheduling: A Contextual Bandit Approach | Aug 6, 2021 | Decision MakingScheduling | —Unverified | 0 |
| Keep Various Trajectories: Promoting Exploration of Ensemble Policies in Continuous Control | Oct 17, 2023 | continuous-controlContinuous Control | —Unverified | 0 |