| Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble | Oct 4, 2021 | Adroid door-clonedAdroid door-human | CodeCode Available | 1 |
| Reinforcement Learning from Passive Data via Latent Intentions | Apr 10, 2023 | reinforcement-learningReinforcement Learning | CodeCode Available | 1 |
| CompilerDream: Learning a Compiler World Model for General Code Optimization | Apr 24, 2024 | DiversityModel-based Reinforcement Learning | CodeCode Available | 1 |
| WorldValuesBench: A Large-Scale Benchmark Dataset for Multi-Cultural Value Awareness of Language Models | Apr 25, 2024 | Value prediction | CodeCode Available | 1 |
| Learning, Fast and Slow: A Goal-Directed Memory-Based Approach for Dynamic Environments | Jan 31, 2023 | Reinforcement Learning (RL)Retrieval | CodeCode Available | 1 |
| ExtremeCast: Boosting Extreme Value Prediction for Global Weather Forecast | Feb 2, 2024 | PredictionValue prediction | CodeCode Available | 1 |
| PIVEN: A Deep Neural Network for Prediction Intervals with Specific Value Prediction | Jun 9, 2020 | PredictionPrediction Intervals | CodeCode Available | 1 |
| Uncertainty-Aware Probabilistic Graph Neural Networks for Road-Level Traffic Accident Prediction | Sep 10, 2023 | Graph Neural NetworkPrediction | CodeCode Available | 1 |
| A Multi-Granularity-Aware Aspect Learning Model for Multi-Aspect Dense Retrieval | Dec 5, 2023 | Language ModellingRetrieval | CodeCode Available | 1 |
| DATE: Dual Attentive Tree-aware Embedding for Customs Fraud Detection | Aug 23, 2020 | Fraud DetectionMulti-target regression | CodeCode Available | 1 |
| Spatial Action Maps for Mobile Manipulation | Apr 20, 2020 | Q-LearningValue prediction | CodeCode Available | 1 |
| Billion-user Customer Lifetime Value Prediction: An Industrial-scale Solution from Kuaishou | Aug 29, 2022 | Value prediction | —Unverified | 0 |
| An Optimal Tightness Bound for the Simulation Lemma | Jun 24, 2024 | LEMMAValue prediction | —Unverified | 0 |
| Biology-inspired joint distribution neurons based on Hierarchical Correlation Reconstruction allowing for multidirectional neural networks | May 8, 2024 | Tensor DecompositionValue prediction | —Unverified | 0 |
| Avoiding Confusion between Predictors and Inhibitors in Value Function Approximation | Dec 19, 2013 | Decision MakingReinforcement Learning | —Unverified | 0 |
| DiffSTOCK: Probabilistic relational Stock Market Predictions using Diffusion Models | Mar 21, 2024 | DenoisingManagement | —Unverified | 0 |
| AutoDIME: Automatic Design of Interesting Multi-Agent Environments | Mar 4, 2022 | DiagnosticMuJoCo | —Unverified | 0 |
| AIRI: Predicting Retention Indices and their Uncertainties using Artificial Intelligence | Jan 3, 2024 | Value prediction | —Unverified | 0 |
| Auction Design using Value Prediction with Hallucinations | Feb 12, 2025 | PredictionValue prediction | —Unverified | 0 |
| Customer Lifetime Value Prediction with Uncertainty Estimation Using Monte Carlo Dropout | Nov 24, 2024 | Value prediction | —Unverified | 0 |
| Customer Lifetime Value Prediction Using Embeddings | Mar 7, 2017 | MarketingPrediction | —Unverified | 0 |
| Deep Structural Knowledge Exploitation and Synergy for Estimating Node Importance Value on Heterogeneous Information Networks | Feb 19, 2024 | InformativenessValue prediction | —Unverified | 0 |
| Attentive Continuous Generative Self-training for Unsupervised Domain Adaptive Medical Image Translation | May 23, 2023 | Domain AdaptationPseudo Label | —Unverified | 0 |
| Digital Twin Synchronization: Bridging the Sim-RL Agent to a Real-Time Robotic Additive Manufacturing Control | Jan 29, 2025 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| A Meta-learning based Stacked Regression Approach for Customer Lifetime Value Prediction | Aug 7, 2023 | Meta-Learningregression | —Unverified | 0 |