| MIGC++: Advanced Multi-Instance Generation Controller for Image Synthesis | Jul 2, 2024 | AttributeImage Generation | CodeCode Available | 4 | 5 |
| GaussianFormer: Scene as Gaussians for Vision-Based 3D Semantic Occupancy Prediction | May 27, 2024 | 3D Semantic Occupancy PredictionAutonomous Driving | CodeCode Available | 4 | 5 |
| ElasTST: Towards Robust Varied-Horizon Forecasting with Elastic Time-Series Transformer | Nov 4, 2024 | PositionTime Series | CodeCode Available | 3 | 5 |
| Rotary Position Embedding for Vision Transformer | Mar 20, 2024 | Position | CodeCode Available | 3 | 5 |
| Relation DETR: Exploring Explicit Position Relation Prior for Object Detection | Jul 16, 2024 | 2D Object Detectionobject-detection | CodeCode Available | 3 | 5 |
| RoFormer: Enhanced Transformer with Rotary Position Embedding | Apr 20, 2021 | PositionSemantic Text Matching | CodeCode Available | 3 | 5 |
| Scaling Diffusion Transformers to 16 Billion Parameters | Jul 16, 2024 | AttributeConditional Image Generation | CodeCode Available | 3 | 5 |
| PETRv2: A Unified Framework for 3D Perception from Multi-Camera Images | Jun 2, 2022 | 3D Lane Detection3D Object Detection | CodeCode Available | 3 | 5 |
| PETR: Position Embedding Transformation for Multi-View 3D Object Detection | Mar 10, 2022 | 3D Object DetectionObject | CodeCode Available | 3 | 5 |
| Position: Graph Foundation Models are Already Here | Feb 3, 2024 | Position | CodeCode Available | 3 | 5 |