| StoryDiffusion: Consistent Self-Attention for Long-Range Image and Video Generation | May 2, 2024 | motion predictionStory Generation | CodeCode Available | 9 |
| CoTracker: It is Better to Track Together | Jul 14, 2023 | GPUmotion prediction | CodeCode Available | 4 |
| SparseDrive: End-to-End Autonomous Driving via Sparse Scene Representation | May 30, 2024 | AttributeAutonomous Driving | CodeCode Available | 4 |
| End-to-end Autonomous Driving: Challenges and Frontiers | Jun 29, 2023 | Autonomous Drivingmotion prediction | CodeCode Available | 4 |
| GenAD: Generative End-to-End Autonomous Driving | Feb 18, 2024 | Autonomous DrivingBench2Drive | CodeCode Available | 3 |
| SIMPL: A Simple and Efficient Multi-agent Motion Prediction Baseline for Autonomous Driving | Feb 4, 2024 | Autonomous DrivingAutonomous Vehicles | CodeCode Available | 3 |
| MotionGPT: Human Motion as a Foreign Language | Jun 26, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| SC4D: Sparse-Controlled Video-to-4D Generation and Motion Transfer | Apr 4, 2024 | motion predictionNeRF | CodeCode Available | 2 |
| Open-Source Ground-based Sky Image Datasets for Very Short-term Solar Forecasting, Cloud Analysis and Modeling: A Comprehensive Survey | Nov 27, 2022 | motion prediction | CodeCode Available | 2 |
| Shifts 2.0: Extending The Dataset of Real Distributional Shifts | Jun 30, 2022 | Autonomous Drivingimage-classification | CodeCode Available | 2 |
| MTR-A: 1st Place Solution for 2022 Waymo Open Dataset Challenge -- Motion Prediction | Sep 20, 2022 | motion predictionPrediction | CodeCode Available | 2 |
| MTR++: Multi-Agent Motion Prediction with Symmetric Scene Modeling and Guided Intention Querying | Jun 30, 2023 | Autonomous DrivingDecoder | CodeCode Available | 2 |
| Query-Centric Trajectory Prediction | Jan 1, 2023 | Autonomous DrivingAutonomous Vehicles | CodeCode Available | 2 |
| Robust Motion In-betweening | Feb 9, 2021 | Human Pose Forecastingmotion in-betweening | CodeCode Available | 2 |
| Large Trajectory Models are Scalable Motion Predictors and Planners | Oct 30, 2023 | Autonomous DrivingLanguage Modeling | CodeCode Available | 2 |
| Joint Perception and Prediction for Autonomous Driving: A Survey | Dec 18, 2024 | Autonomous Drivingmotion prediction | CodeCode Available | 2 |
| Motion Transformer with Global Intention Localization and Local Movement Refinement | Sep 27, 2022 | motion predictionPrediction | CodeCode Available | 2 |
| FlexCloud: Direct, Modular Georeferencing and Drift-Correction of Point Cloud Maps | Feb 1, 2025 | Autonomous Drivingmotion prediction | CodeCode Available | 2 |
| BEVerse: Unified Perception and Prediction in Birds-Eye-View for Vision-Centric Autonomous Driving | May 19, 2022 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 |
| HumanMAC: Masked Motion Completion for Human Motion Prediction | Feb 7, 2023 | DenoisingHuman motion prediction | CodeCode Available | 2 |
| GPD-1: Generative Pre-training for Driving | Dec 11, 2024 | Autonomous DrivingDecision Making | CodeCode Available | 2 |
| BEVWorld: A Multimodal World Model for Autonomous Driving via Unified BEV Latent Space | Jul 8, 2024 | Autonomous DrivingDecoder | CodeCode Available | 2 |
| DiffBEV: Conditional Diffusion Model for Bird's Eye View Perception | Mar 15, 2023 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 |
| DriveMLLM: A Benchmark for Spatial Understanding with Multimodal Large Language Models in Autonomous Driving | Nov 20, 2024 | Autonomous Drivingmotion prediction | CodeCode Available | 2 |
| ESVO2: Direct Visual-Inertial Odometry with Stereo Event Cameras | Oct 12, 2024 | motion predictionPose Tracking | CodeCode Available | 2 |