| Sapiens: Foundation for Human Vision Models | Aug 22, 2024 | 2D Human Pose Estimation2D Pose Estimation | CodeCode Available | 9 | 5 |
| Metric3Dv2: A Versatile Monocular Geometric Foundation Model for Zero-shot Metric Depth and Surface Normal Estimation | Mar 22, 2024 | Depth EstimationSurface Normal Estimation | CodeCode Available | 7 | 5 |
| Rethinking Inductive Biases for Surface Normal Estimation | Mar 1, 2024 | Surface Normal Estimation | CodeCode Available | 4 | 5 |
| Fine-Tuning Image-Conditional Diffusion Models is Easier than You Think | Sep 17, 2024 | Conditional Image GenerationDepth Estimation | CodeCode Available | 4 | 5 |
| What Matters When Repurposing Diffusion Models for General Dense Perception Tasks? | Mar 10, 2024 | Depth EstimationImage Matting | CodeCode Available | 3 | 5 |
| iDisc: Internal Discretization for Monocular Depth Estimation | Apr 13, 2023 | Autonomous DrivingDepth Estimation | CodeCode Available | 3 | 5 |
| A Large Scale Homography Benchmark | Feb 20, 2023 | Homography EstimationSurface Normal Estimation | CodeCode Available | 2 | 5 |
| InvPT: Inverted Pyramid Multi-task Transformer for Dense Scene Understanding | Mar 15, 2022 | Boundary DetectionHuman Parsing | CodeCode Available | 2 | 5 |
| Egocentric Scene Understanding via Multimodal Spatial Rectifier | Jul 14, 2022 | Scene UnderstandingSurface Normal Estimation | CodeCode Available | 1 | 5 |
| Estimating and Exploiting the Aleatoric Uncertainty in Surface Normal Estimation | Sep 20, 2021 | DecoderPrediction | CodeCode Available | 1 | 5 |