| AutoDIME: Automatic Design of Interesting Multi-Agent Environments | Mar 4, 2022 | DiagnosticMuJoCo | —Unverified | 0 |
| Why Should I Trust You, Bellman? The Bellman Error is a Poor Replacement for Value Error | Jan 28, 2022 | Value prediction | —Unverified | 0 |
| CoRGi: Content-Rich Graph Neural Networks with Attention | Oct 10, 2021 | ImputationValue prediction | —Unverified | 0 |
| X-model: Improving Data Efficiency in Deep Learning with A Minimax Model | Oct 9, 2021 | Age Estimationmodel | —Unverified | 0 |
| Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble | Oct 4, 2021 | Adroid door-clonedAdroid door-human | CodeCode Available | 1 |
| Why Should I Trust You, Bellman? Evaluating the Bellman Objective with Off-Policy Data | Sep 29, 2021 | Deep Reinforcement LearningOff-policy evaluation | —Unverified | 0 |
| Understanding and Leveraging Overparameterization in Recursive Value Estimation | Sep 29, 2021 | Reinforcement Learning (RL)Value prediction | —Unverified | 0 |
| On the Estimation Bias in Double Q-Learning | Sep 29, 2021 | Q-LearningValue prediction | CodeCode Available | 0 |
| Generative Self-training for Cross-domain Unsupervised Tagged-to-Cine MRI Synthesis | Jun 23, 2021 | Domain AdaptationImage Generation | —Unverified | 0 |
| RCURRENCY: Live Digital Asset Trading Using a Recurrent Neural Network-based Forecasting System | Jun 13, 2021 | Value prediction | —Unverified | 0 |