| Sapiens: Foundation for Human Vision Models | Aug 22, 2024 | 2D Human Pose Estimation2D Pose Estimation | CodeCode Available | 9 |
| Metric3Dv2: A Versatile Monocular Geometric Foundation Model for Zero-shot Metric Depth and Surface Normal Estimation | Mar 22, 2024 | Depth EstimationSurface Normal Estimation | CodeCode Available | 7 |
| Fine-Tuning Image-Conditional Diffusion Models is Easier than You Think | Sep 17, 2024 | Conditional Image GenerationDepth Estimation | CodeCode Available | 4 |
| Rethinking Inductive Biases for Surface Normal Estimation | Mar 1, 2024 | Surface Normal Estimation | CodeCode Available | 4 |
| What Matters When Repurposing Diffusion Models for General Dense Perception Tasks? | Mar 10, 2024 | Depth EstimationImage Matting | CodeCode Available | 3 |
| iDisc: Internal Discretization for Monocular Depth Estimation | Apr 13, 2023 | Autonomous DrivingDepth Estimation | CodeCode Available | 3 |
| A Large Scale Homography Benchmark | Feb 20, 2023 | Homography EstimationSurface Normal Estimation | CodeCode Available | 2 |
| InvPT: Inverted Pyramid Multi-task Transformer for Dense Scene Understanding | Mar 15, 2022 | Boundary DetectionHuman Parsing | CodeCode Available | 2 |
| LiSu: A Dataset and Method for LiDAR Surface Normal Estimation | Mar 11, 2025 | Autonomous DrivingDomain Adaptation | CodeCode Available | 1 |
| TSP-Transformer: Task-Specific Prompts Boosted Transformer for Holistic Scene Understanding | Nov 6, 2023 | Boundary DetectionDepth Estimation | CodeCode Available | 1 |
| FOUND: Foot Optimization with Uncertain Normals for Surface Deformation Using Synthetic Data | Oct 27, 2023 | Outlier DetectionSurface Normal Estimation | CodeCode Available | 1 |
| Prompt Guided Transformer for Multi-Task Dense Prediction | Jul 28, 2023 | Boundary DetectionDecoder | CodeCode Available | 1 |
| D2NT: A High-Performing Depth-to-Normal Translator | Apr 24, 2023 | Surface Normal EstimationVocal Bursts Intensity Prediction | CodeCode Available | 1 |
| High-Quality RGB-D Reconstruction via Multi-View Uncalibrated Photometric Stereo and Gradient-SDF | Oct 21, 2022 | RGB-D ReconstructionSurface Normal Estimation | CodeCode Available | 1 |
| Perspective Phase Angle Model for Polarimetric 3D Reconstruction | Jul 20, 2022 | 3D ReconstructionSurface Normal Estimation | CodeCode Available | 1 |
| Egocentric Scene Understanding via Multimodal Spatial Rectifier | Jul 14, 2022 | Scene UnderstandingSurface Normal Estimation | CodeCode Available | 1 |
| DenseMTL: Cross-task Attention Mechanism for Dense Multi-task Learning | Jun 17, 2022 | 2D Semantic SegmentationDepth Estimation | CodeCode Available | 1 |
| MulT: An End-to-End Multitask Learning Transformer | May 17, 2022 | DecoderDepth Estimation | CodeCode Available | 1 |
| GRIT: General Robust Image Task Benchmark | Apr 28, 2022 | Instance SegmentationKeypoint Detection | CodeCode Available | 1 |
| Omnidata: A Scalable Pipeline for Making Multi-Task Mid-Level Vision Datasets from 3D Scans | Oct 11, 2021 | Depth EstimationSurface Normal Estimation | CodeCode Available | 1 |
| Estimating and Exploiting the Aleatoric Uncertainty in Surface Normal Estimation | Sep 20, 2021 | DecoderPrediction | CodeCode Available | 1 |
| Human Pose and Shape Estimation from Single Polarization Images | Aug 15, 2021 | Surface Normal Estimation | CodeCode Available | 1 |
| Transformer-Based Attention Networks for Continuous Pixel-Wise Prediction | Mar 22, 2021 | DecoderDepth Estimation | CodeCode Available | 1 |
| GeoNet++: Iterative Geometric Neural Network with Edge-Aware Refinement for Joint Depth and Surface Normal Estimation | Dec 13, 2020 | 3D ReconstructionDepth Estimation | CodeCode Available | 1 |
| HoliCity: A City-Scale Data Platform for Learning Holistic 3D Structures | Aug 7, 2020 | Surface Normal Estimation | CodeCode Available | 1 |
| Surface Normal Estimation of Tilted Images via Spatial Rectifier | Jul 17, 2020 | Data AugmentationSurface Normal Estimation | CodeCode Available | 1 |
| SharinGAN: Combining Synthetic and Real Data for Unsupervised Geometry Estimation | Jun 7, 2020 | Depth EstimationMonocular Depth Estimation | CodeCode Available | 1 |
| IRS: A Large Naturalistic Indoor Robotics Stereo Dataset to Train Deep Models for Disparity and Surface Normal Estimation | Dec 20, 2019 | Disparity EstimationScene Understanding | CodeCode Available | 1 |
| GeoNet: Geometric Neural Network for Joint Depth and Surface Normal Estimation | Jun 1, 2018 | Depth EstimationSurface Normal Estimation | CodeCode Available | 1 |
| Pixel-wise Attentional Gating for Parsimonious Pixel Labeling | May 3, 2018 | Boundary DetectionSemantic Segmentation | CodeCode Available | 1 |
| Probabilistic Online Event Downsampling | Jun 3, 2025 | object-detectionObject Detection | —Unverified | 0 |
| Robust and Real-time Surface Normal Estimation from Stereo Disparities using Affine Transformations | Apr 21, 2025 | GPUSurface Normal Estimation | —Unverified | 0 |
| NormalCrafter: Learning Temporally Consistent Normals from Video Diffusion Priors | Apr 15, 2025 | Surface Normal Estimation | —Unverified | 0 |
| Image Gradient-Aided Photometric Stereo Network | Dec 16, 2024 | regressionSurface Normal Estimation | —Unverified | 0 |
| Multi-task Geometric Estimation of Depth and Surface Normal from Monocular 360° Images | Nov 4, 2024 | Multi-Task LearningScene Understanding | CodeCode Available | 0 |
| Elite360M: Efficient 360 Multi-task Learning via Bi-projection Fusion and Cross-task Collaboration | Aug 18, 2024 | 3D geometryERP | —Unverified | 0 |
| StableNormal: Reducing Diffusion Variance for Stable and Sharp Normal | Jun 24, 2024 | Surface Normal EstimationSurface Reconstruction | —Unverified | 0 |
| Enabling Visual Recognition at Radio Frequency | May 29, 2024 | object-detectionObject Detection | —Unverified | 0 |
| PanoNormal: Monocular Indoor 360° Surface Normal Estimation | May 29, 2024 | Surface Normal Estimation | —Unverified | 0 |
| Region-aware Distribution Contrast: A Novel Approach to Multi-Task Partially Supervised Learning | Mar 15, 2024 | Depth EstimationSemantic Segmentation | —Unverified | 0 |
| Surface Normal Estimation with Transformers | Jan 11, 2024 | Surface Normal Estimation | —Unverified | 0 |
| Event-based Shape from Polarization with Spiking Neural Networks | Dec 26, 2023 | Surface Normal Estimation | —Unverified | 0 |
| RFTrans: Leveraging Refractive Flow of Transparent Objects for Surface Normal Estimation and Manipulation | Nov 21, 2023 | global-optimizationSurface Normal Estimation | —Unverified | 0 |
| PolyMaX: General Dense Prediction with Mask Transformer | Nov 9, 2023 | Depth EstimationMonocular Depth Estimation | CodeCode Available | 0 |
| Decodable and Sample Invariant Continuous Object Encoder | Oct 31, 2023 | ObjectSurface Normal Estimation | CodeCode Available | 0 |
| CLIP meets Model Zoo Experts: Pseudo-Supervision for Visual Enhancement | Oct 21, 2023 | Depth Estimationimage-classification | —Unverified | 0 |
| Large-scale Monocular Depth Estimation in the Wild | Sep 18, 2023 | Depth EstimationDepth Prediction | —Unverified | 0 |
| TransNet: Transparent Object Manipulation Through Category-Level Pose Estimation | Jul 23, 2023 | Depth CompletionObject | —Unverified | 0 |
| Independent Component Alignment for Multi-Task Learning | May 30, 2023 | Depth EstimationInstance Segmentation | CodeCode Available | 0 |
| Pointersect: Neural Rendering with Cloud-Ray Intersection | Apr 24, 2023 | Inverse RenderingNeural Rendering | —Unverified | 0 |