| Sapiens: Foundation for Human Vision Models | Aug 22, 2024 | 2D Human Pose Estimation2D Pose Estimation | CodeCode Available | 9 |
| Metric3Dv2: A Versatile Monocular Geometric Foundation Model for Zero-shot Metric Depth and Surface Normal Estimation | Mar 22, 2024 | Depth EstimationSurface Normal Estimation | CodeCode Available | 7 |
| Fine-Tuning Image-Conditional Diffusion Models is Easier than You Think | Sep 17, 2024 | Conditional Image GenerationDepth Estimation | CodeCode Available | 4 |
| Rethinking Inductive Biases for Surface Normal Estimation | Mar 1, 2024 | Surface Normal Estimation | CodeCode Available | 4 |
| What Matters When Repurposing Diffusion Models for General Dense Perception Tasks? | Mar 10, 2024 | Depth EstimationImage Matting | CodeCode Available | 3 |
| iDisc: Internal Discretization for Monocular Depth Estimation | Apr 13, 2023 | Autonomous DrivingDepth Estimation | CodeCode Available | 3 |
| A Large Scale Homography Benchmark | Feb 20, 2023 | Homography EstimationSurface Normal Estimation | CodeCode Available | 2 |
| InvPT: Inverted Pyramid Multi-task Transformer for Dense Scene Understanding | Mar 15, 2022 | Boundary DetectionHuman Parsing | CodeCode Available | 2 |
| LiSu: A Dataset and Method for LiDAR Surface Normal Estimation | Mar 11, 2025 | Autonomous DrivingDomain Adaptation | CodeCode Available | 1 |
| TSP-Transformer: Task-Specific Prompts Boosted Transformer for Holistic Scene Understanding | Nov 6, 2023 | Boundary DetectionDepth Estimation | CodeCode Available | 1 |
| FOUND: Foot Optimization with Uncertain Normals for Surface Deformation Using Synthetic Data | Oct 27, 2023 | Outlier DetectionSurface Normal Estimation | CodeCode Available | 1 |
| Prompt Guided Transformer for Multi-Task Dense Prediction | Jul 28, 2023 | Boundary DetectionDecoder | CodeCode Available | 1 |
| D2NT: A High-Performing Depth-to-Normal Translator | Apr 24, 2023 | Surface Normal EstimationVocal Bursts Intensity Prediction | CodeCode Available | 1 |
| High-Quality RGB-D Reconstruction via Multi-View Uncalibrated Photometric Stereo and Gradient-SDF | Oct 21, 2022 | RGB-D ReconstructionSurface Normal Estimation | CodeCode Available | 1 |
| Perspective Phase Angle Model for Polarimetric 3D Reconstruction | Jul 20, 2022 | 3D ReconstructionSurface Normal Estimation | CodeCode Available | 1 |
| Egocentric Scene Understanding via Multimodal Spatial Rectifier | Jul 14, 2022 | Scene UnderstandingSurface Normal Estimation | CodeCode Available | 1 |
| DenseMTL: Cross-task Attention Mechanism for Dense Multi-task Learning | Jun 17, 2022 | 2D Semantic SegmentationDepth Estimation | CodeCode Available | 1 |
| MulT: An End-to-End Multitask Learning Transformer | May 17, 2022 | DecoderDepth Estimation | CodeCode Available | 1 |
| GRIT: General Robust Image Task Benchmark | Apr 28, 2022 | Instance SegmentationKeypoint Detection | CodeCode Available | 1 |
| Omnidata: A Scalable Pipeline for Making Multi-Task Mid-Level Vision Datasets from 3D Scans | Oct 11, 2021 | Depth EstimationSurface Normal Estimation | CodeCode Available | 1 |
| Estimating and Exploiting the Aleatoric Uncertainty in Surface Normal Estimation | Sep 20, 2021 | DecoderPrediction | CodeCode Available | 1 |
| Human Pose and Shape Estimation from Single Polarization Images | Aug 15, 2021 | Surface Normal Estimation | CodeCode Available | 1 |
| Transformer-Based Attention Networks for Continuous Pixel-Wise Prediction | Mar 22, 2021 | DecoderDepth Estimation | CodeCode Available | 1 |
| GeoNet++: Iterative Geometric Neural Network with Edge-Aware Refinement for Joint Depth and Surface Normal Estimation | Dec 13, 2020 | 3D ReconstructionDepth Estimation | CodeCode Available | 1 |
| HoliCity: A City-Scale Data Platform for Learning Holistic 3D Structures | Aug 7, 2020 | Surface Normal Estimation | CodeCode Available | 1 |