| A Diffusion-Based Framework for Occluded Object Movement | Apr 2, 2025 | ObjectWorld Knowledge | —Unverified | 0 |
| Synthetic-to-Real Self-supervised Robust Depth Estimation via Learning with Motion and Structure Priors | Mar 26, 2025 | Depth EstimationWorld Knowledge | CodeCode Available | 1 |
| LLM-based Agent Simulation for Maternal Health Interventions: Uncertainty Estimation and Decision-focused Evaluation | Mar 25, 2025 | counterfactualDecision Making | CodeCode Available | 0 |
| Test-Time Reasoning Through Visual Human Preferences with VLMs and Soft Rewards | Mar 25, 2025 | World Knowledge | —Unverified | 0 |
| Human-Object Interaction with Vision-Language Model Guided Relative Movement Dynamics | Mar 24, 2025 | Human-Object Interaction DetectionLanguage Modeling | —Unverified | 0 |
| Instructing the Architecture Search for Spatial-temporal Sequence Forecasting with LLM | Mar 23, 2025 | Neural Architecture SearchPrompt Engineering | —Unverified | 0 |
| A Study into Investigating Temporal Robustness of LLMs | Mar 21, 2025 | Question AnsweringWorld Knowledge | —Unverified | 0 |
| Advancing Problem-Based Learning in Biomedical Engineering in the Era of Generative AI | Mar 20, 2025 | World Knowledge | —Unverified | 0 |
| World Knowledge from AI Image Generation for Robot Control | Mar 20, 2025 | Image GenerationWorld Knowledge | —Unverified | 0 |
| JARVIS-VLA: Post-Training Large-Scale Vision Language Models to Play Visual Games with Keyboards and Mouse | Mar 20, 2025 | Decision MakingImitation Learning | —Unverified | 0 |