| Video Event Reasoning and Prediction by Fusing World Knowledge from LLMs with Vision Foundation Models | Jul 8, 2025 | Future predictionLarge Language Model | —Unverified | 0 |
| Distributed Poisson multi-Bernoulli filtering via generalised covariance intersection | Jun 23, 2025 | Future prediction | —Unverified | 0 |
| DySS: Dynamic Queries and State-Space Learning for Efficient 3D Object Detection from Multi-Camera Videos | Jun 11, 2025 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| Looking Beyond Visible Cues: Implicit Video Question Answering via Dual-Clue Reasoning | Jun 9, 2025 | Future predictionQuestion Answering | CodeCode Available | 0 |
| LETS Forecast: Learning Embedology for Time Series Forecasting | Jun 6, 2025 | Future predictionTime Series | CodeCode Available | 1 |
| Are Statistical Methods Obsolete in the Era of Deep Learning? | May 27, 2025 | Deep LearningEpidemiology | —Unverified | 0 |
| ProphetDWM: A Driving World Model for Rolling Out Future Actions and Videos | May 24, 2025 | Action GenerationAutonomous Driving | —Unverified | 0 |
| Learning from Streaming Video with Orthogonal Gradients | Apr 2, 2025 | Future predictionRepresentation Learning | —Unverified | 0 |
| AdaWorld: Learning Adaptable World Models with Latent Actions | Mar 24, 2025 | Future prediction | CodeCode Available | 3 |
| PhysTwin: Physics-Informed Reconstruction and Simulation of Deformable Objects from Videos | Mar 23, 2025 | 4D reconstructionDeformable Object Manipulation | CodeCode Available | 3 |
| Seeing the Future, Perceiving the Future: A Unified Driving World Model for Future Generation and Perception | Mar 17, 2025 | Future predictionScene Generation | CodeCode Available | 2 |
| FluidNexus: 3D Fluid Reconstruction and Prediction from a Single Video | Mar 6, 2025 | Future predictionNovel View Synthesis | —Unverified | 0 |
| A Survey of World Models for Autonomous Driving | Jan 20, 2025 | Anomaly DetectionAutonomous Driving | CodeCode Available | 1 |
| Advancing Semantic Future Prediction through Multimodal Visual Sequence Transformers | Jan 14, 2025 | Future predictionPrediction | CodeCode Available | 1 |
| Navigating Tomorrow: Reliably Assessing Large Language Models Performance on Future Event Prediction | Jan 10, 2025 | Articlescounterfactual | —Unverified | 0 |
| Back To The Future: A Hybrid Transformer-XGBoost Model for Action-oriented Future-proofing Nowcasting | Dec 21, 2024 | Future prediction | —Unverified | 0 |
| A Novel Machine Learning Classifier Based on Genetic Algorithms and Data Importance Reformatting | Dec 17, 2024 | Future prediction | —Unverified | 0 |
| Imagine-2-Drive: Leveraging High-Fidelity World Models via Multi-Modal Diffusion Policies | Nov 15, 2024 | Autonomous DrivingAutonomous Vehicles | —Unverified | 0 |
| TrajAgent: An Agent Framework for Unified Trajectory Modelling | Oct 27, 2024 | Future predictionLanguage Modeling | CodeCode Available | 1 |
| From Cognition to Precognition: A Future-Aware Framework for Social Navigation | Sep 20, 2024 | Future predictionNavigate | CodeCode Available | 2 |
| The 2023/24 VIEWS Prediction Challenge: Predicting the Number of Fatalities in Armed Conflict, with Uncertainty | Jul 8, 2024 | Future predictionPrediction | —Unverified | 0 |
| Latent Intuitive Physics: Learning to Transfer Hidden Physics from A 3D Video | Jun 18, 2024 | Future predictionTransfer Learning | —Unverified | 0 |
| MMWorld: Towards Multi-discipline Multi-faceted World Model Evaluation in Videos | Jun 12, 2024 | counterfactualFuture prediction | CodeCode Available | 1 |
| Identifying latent state transition in non-linear dynamical systems | Jun 5, 2024 | Future predictionRepresentation Learning | —Unverified | 0 |
| Compositional 4D Dynamic Scenes Understanding with Physics Priors for Video Question Answering | Jun 2, 2024 | counterfactualCounterfactual Reasoning | CodeCode Available | 1 |