| MMWorld: Towards Multi-discipline Multi-faceted World Model Evaluation in Videos | Jun 12, 2024 | counterfactualFuture prediction | CodeCode Available | 1 |
| MultiPath++: Efficient Information Fusion and Trajectory Aggregation for Behavior Prediction | Nov 29, 2021 | Autonomous DrivingFuture prediction | CodeCode Available | 1 |
| EgoTaskQA: Understanding Human Tasks in Egocentric Videos | Oct 8, 2022 | Action Localizationcounterfactual | CodeCode Available | 1 |
| FIERY: Future Instance Prediction in Bird's-Eye View from Surround Monocular Cameras | Apr 21, 2021 | Autonomous DrivingBird's-Eye View Semantic Segmentation | CodeCode Available | 1 |
| PiP: Planning-informed Trajectory Prediction for Autonomous Driving | Mar 25, 2020 | Autonomous DrivingFuture prediction | CodeCode Available | 1 |
| Preserving Dynamic Attention for Long-Term Spatial-Temporal Prediction | Jun 16, 2020 | Future predictionPrediction | CodeCode Available | 1 |
| HelmFluid: Learning Helmholtz Dynamics for Interpretable Fluid Prediction | Oct 16, 2023 | Future predictionPrediction | CodeCode Available | 1 |
| Compositional 4D Dynamic Scenes Understanding with Physics Priors for Video Question Answering | Jun 2, 2024 | counterfactualCounterfactual Reasoning | CodeCode Available | 1 |
| Peeking into the Future: Predicting Future Person Activities and Locations in Videos | Feb 11, 2019 | Activity PredictionFuture prediction | CodeCode Available | 1 |
| TrajAgent: An Agent Framework for Unified Trajectory Modelling | Oct 27, 2024 | Future predictionLanguage Modeling | CodeCode Available | 1 |