| Sapiens: Foundation for Human Vision Models | Aug 22, 2024 | 2D Human Pose Estimation2D Pose Estimation | CodeCode Available | 9 |
| Metric3Dv2: A Versatile Monocular Geometric Foundation Model for Zero-shot Metric Depth and Surface Normal Estimation | Mar 22, 2024 | Depth EstimationSurface Normal Estimation | CodeCode Available | 7 |
| Fine-Tuning Image-Conditional Diffusion Models is Easier than You Think | Sep 17, 2024 | Conditional Image GenerationDepth Estimation | CodeCode Available | 4 |
| Rethinking Inductive Biases for Surface Normal Estimation | Mar 1, 2024 | Surface Normal Estimation | CodeCode Available | 4 |
| What Matters When Repurposing Diffusion Models for General Dense Perception Tasks? | Mar 10, 2024 | Depth EstimationImage Matting | CodeCode Available | 3 |
| iDisc: Internal Discretization for Monocular Depth Estimation | Apr 13, 2023 | Autonomous DrivingDepth Estimation | CodeCode Available | 3 |
| A Large Scale Homography Benchmark | Feb 20, 2023 | Homography EstimationSurface Normal Estimation | CodeCode Available | 2 |
| InvPT: Inverted Pyramid Multi-task Transformer for Dense Scene Understanding | Mar 15, 2022 | Boundary DetectionHuman Parsing | CodeCode Available | 2 |
| LiSu: A Dataset and Method for LiDAR Surface Normal Estimation | Mar 11, 2025 | Autonomous DrivingDomain Adaptation | CodeCode Available | 1 |
| TSP-Transformer: Task-Specific Prompts Boosted Transformer for Holistic Scene Understanding | Nov 6, 2023 | Boundary DetectionDepth Estimation | CodeCode Available | 1 |