| RTBAgent: A LLM-based Agent System for Real-Time Bidding | Feb 2, 2025 | Decision Making | CodeCode Available | 1 |
| Vintix: Action Model via In-Context Reinforcement Learning | Jan 31, 2025 | Decision MakingIn-Context Reinforcement Learning | CodeCode Available | 1 |
| Harnessing Diverse Perspectives: A Multi-Agent Framework for Enhanced Error Detection in Knowledge Graphs | Jan 27, 2025 | Decision MakingKnowledge Graphs | CodeCode Available | 1 |
| A Survey of World Models for Autonomous Driving | Jan 20, 2025 | Anomaly DetectionAutonomous Driving | CodeCode Available | 1 |
| MyGO Multiplex CoT: A Method for Self-Reflection in Large Language Models via Double Chain of Thought Thinking | Jan 20, 2025 | Decision MakingGSM8K | CodeCode Available | 1 |
| NS-Gym: Open-Source Simulation Environments and Benchmarks for Non-Stationary Markov Decision Processes | Jan 16, 2025 | Decision Making | CodeCode Available | 1 |
| O1 Replication Journey -- Part 3: Inference-time Scaling for Medical Reasoning | Jan 11, 2025 | Decision MakingDiagnostic | CodeCode Available | 1 |
| ICFNet: Integrated Cross-modal Fusion Network for Survival Prediction | Jan 6, 2025 | Decision MakingSurvival Prediction | CodeCode Available | 1 |
| Co-Activation Graph Analysis of Safety-Verified and Explainable Deep Reinforcement Learning Policies | Jan 6, 2025 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 |
| MIRAGE: Exploring How Large Language Models Perform in Complex Social Interactive Environments | Jan 3, 2025 | Decision Making | CodeCode Available | 1 |