| Temporal Aggregate Representations for Long-Range Video Understanding | Jun 1, 2020 | Action AnticipationAction Recognition | CodeCode Available | 1 |
| PiP: Planning-informed Trajectory Prediction for Autonomous Driving | Mar 25, 2020 | Autonomous DrivingFuture prediction | CodeCode Available | 1 |
| Peeking into the Future: Predicting Future Person Activities and Locations in Videos | Feb 11, 2019 | Activity PredictionFuture prediction | CodeCode Available | 1 |
| Video Event Reasoning and Prediction by Fusing World Knowledge from LLMs with Vision Foundation Models | Jul 8, 2025 | Future predictionLarge Language Model | —Unverified | 0 |
| Distributed Poisson multi-Bernoulli filtering via generalised covariance intersection | Jun 23, 2025 | Future prediction | —Unverified | 0 |
| DySS: Dynamic Queries and State-Space Learning for Efficient 3D Object Detection from Multi-Camera Videos | Jun 11, 2025 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| Looking Beyond Visible Cues: Implicit Video Question Answering via Dual-Clue Reasoning | Jun 9, 2025 | Future predictionQuestion Answering | CodeCode Available | 0 |
| Are Statistical Methods Obsolete in the Era of Deep Learning? | May 27, 2025 | Deep LearningEpidemiology | —Unverified | 0 |
| ProphetDWM: A Driving World Model for Rolling Out Future Actions and Videos | May 24, 2025 | Action GenerationAutonomous Driving | —Unverified | 0 |
| Learning from Streaming Video with Orthogonal Gradients | Apr 2, 2025 | Future predictionRepresentation Learning | —Unverified | 0 |