| Mamba4D: Efficient 4D Point Cloud Video Understanding with Disentangled Spatial-Temporal State Space Models | Jan 1, 2025 | Action RecognitionAction Segmentation | CodeCode Available | 1 | 5 |
| Masked Spatio-Temporal Structure Prediction for Self-supervised Learning on Point Cloud Videos | Aug 18, 2023 | point cloud video understandingSelf-Supervised Learning | CodeCode Available | 1 | 5 |
| Point Primitive Transformer for Long-Term 4D Point Cloud Video Understanding | Jul 30, 2022 | point cloud video understandingVideo Understanding | CodeCode Available | 1 | 5 |
| X4D-SceneFormer: Enhanced Scene Understanding on 4D Point Cloud Videos through Cross-modal Knowledge Transfer | Dec 12, 2023 | Action RecognitionAction Segmentation | CodeCode Available | 0 | 5 |
| MAMBA4D: Efficient Long-Sequence Point Cloud Video Understanding with Disentangled Spatial-Temporal State Space Models | May 23, 2024 | Action RecognitionAction Segmentation | —Unverified | 0 | 0 |
| A Unified Framework for Human-centric Point Cloud Video Understanding | Mar 29, 2024 | 3D Pose EstimationAction Recognition | —Unverified | 0 | 0 |
| CrossVideo: Self-supervised Cross-modal Contrastive Learning for Point Cloud Video Understanding | Jan 17, 2024 | Contrastive Learningpoint cloud video understanding | —Unverified | 0 | 0 |
| Adapting Pre-trained 3D Models for Point Cloud Video Understanding via Cross-frame Spatio-temporal Perception | Jan 1, 2025 | Autonomous DrivingGesture Recognition | —Unverified | 0 | 0 |