SOTAVerified

Depth Estimation

Depth Estimation is the task of measuring the distance of each pixel relative to the camera. Depth is extracted from either monocular (single) or stereo (multiple views of a scene) images. Traditional methods use multi-view geometry to find the relationship between the images. Newer methods can directly estimate depth by minimizing the regression loss, or by learning to generate a novel view from a sequence. The most popular benchmarks are KITTI and NYUv2. Models are typically evaluated according to a RMS metric.

Source: DIODE: A Dense Indoor and Outdoor DEpth Dataset

Papers

Showing 15511600 of 2454 papers

TitleStatusHype
HandsOff: Labeled Dataset Generation With No Additional Human Annotations0
Hand tracking for clinical applications: validation of the Google MediaPipe Hand (GMH) and the depth-enhanced GMH-D frameworks0
Harnessing Foundation Models for Robust and Generalizable 6-DOF Bronchoscopy Localization0
HC-Search for Structured Prediction in Computer Vision0
HDPV-SLAM: Hybrid Depth-augmented Panoramic Visual SLAM for Mobile Mapping System with Tilted LiDAR and Panoramic Visual Camera0
HeadPosr: End-to-end Trainable Head Pose Estimation using Transformer Encoders0
Heterogeneous Light Fields0
Hidden in plain sight: VLMs overlook their visual representations0
Hierarchical Normalization for Robust Monocular Depth Estimation0
High-Accuracy Facial Depth Models derived from 3D Synthetic Data0
High-Accuracy RGB-D Face Recognition via Segmentation-Aware Face Depth Estimation and Mask-Guided Attention Network0
High-fidelity Endoscopic Image Synthesis by Utilizing Depth-guided Neural Surfaces0
High-Precision Self-Supervised Monocular Depth Estimation with Rich-Resource Prior0
High Quality Structure From Small Motion for Rolling Shutter Cameras0
High-Resolution Depth Estimation for 360-degree Panoramas through Perspective and Panoramic Depth Images Registration0
High-Resolution Synthetic RGB-D Datasets for Monocular Depth Estimation0
Hi-Map: Hierarchical Factorized Radiance Field for High-Fidelity Monocular Dense Mapping0
HiMODE: A Hybrid Monocular Omnidirectional Depth Estimation Model0
HMOR: Hierarchical Multi-Person Ordinal Relations for Monocular Multi-Person 3D Pose Estimation0
H-Net: Unsupervised Attention-based Stereo Depth Estimation Leveraging Epipolar Geometry0
HoloDrive: Holistic 2D-3D Multi-Modal Street Scene Generation for Autonomous Driving0
How do Cross-View and Cross-Modal Alignment Affect Representations in Contrastive Learning?0
How do neural networks see depth in single images?0
How Much Depth Information can Radar Contribute to a Depth Estimation Model?0
How to deal with glare for improved perception of Autonomous Vehicles0
HRDFuse: Monocular 360deg Depth Estimation by Collaboratively Learning Holistic-With-Regional Depth Distributions0
HRDFuse: Monocular 360°Depth Estimation by Collaboratively Learning Holistic-with-Regional Depth Distributions0
HUSH: Holistic Panoramic 3D Scene Understanding using Spherical Harmonics0
Hybrid Light Field Imaging for Improved Spatial Resolution and Depth Range0
EndoPerfect: High-Accuracy Monocular Depth Estimation and 3D Reconstruction for Endoscopic Surgery via NeRF-Stereo Fusion0
Hybridnet for depth estimation and semantic segmentation0
Hybrid Skip: A Biologically Inspired Skip Connection for the UNet Architecture0
Hybrid Transformer Based Feature Fusion for Self-Supervised Monocular Depth Estimation0
I2P-Rec: Recognizing Images on Large-scale Point Cloud Maps through Bird's Eye View Projections0
IAFA: Instance-aware Feature Aggregation for 3D Object Detection from a Single Image0
ICG-MVSNet: Learning Intra-view and Cross-view Relationships for Guidance in Multi-View Stereo0
iConFormer: Dynamic Parameter-Efficient Tuning with Input-Conditioned Adaptation0
iELAS: An ELAS-Based Energy-Efficient Accelerator for Real-Time Stereo Matching on FPGA Platform0
IFAST: Weakly Supervised Interpretable Face Anti-spoofing from Single-shot Binocular NIR Images0
IGAF: Incremental Guided Attention Fusion for Depth Super-Resolution0
Image and Depth from a Single Defocused Image Using Coded Aperture Photography0
Image-to-Image Translation for Autonomous Driving from Coarsely-Aligned Image Pairs0
Implicit and Explicit Language Guidance for Diffusion-based Visual Perception0
Improved and efficient inter-vehicle distance estimation using road gradients of both ego and target vehicles0
Improved Monocular Depth Prediction Using Distance Transform Over Pre-semantic Contours with Self-supervised Neural Networks0
Improved Multiple-Image-Based Reflection Removal Algorithm Using Deep Neural Networks0
Improved Neural Radiance Fields Using Pseudo-depth and Fusion0
Improved Noise and Attack Robustness for Semantic Segmentation by Using Multi-Task Training with Self-Supervised Depth Estimation0
Improving 2D face recognition via fine-level facial depth generation and RGB-D complementary feature learning0
Improving 2D Feature Representations by 3D-Aware Fine-Tuning0
Show:102550
← PrevPage 32 of 50Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1OmniDepthRMSE0.62Unverified
2SphereDepthRMSE0.45Unverified
3Jin et al.RMSE0.42Unverified
4BiFuse with fusionRMSE0.41Unverified
5HoHoNet (ResNet-101)RMSE0.38Unverified
6PanoDepthRMSE0.37Unverified
7BiFuse++RMSE0.37Unverified
8UniFuse with fusionRMSE0.37Unverified
9DisConvRMSE0.37Unverified
10SliceNetRMSE0.37Unverified
#ModelMetricClaimedVerifiedStatus
1A2JmAP8.61Unverified
2PAD-NetRMS0.79Unverified
3MS-CRFRMS0.59Unverified
4DORNRMS0.51Unverified
5FreeformRMS0.43Unverified
6Optimized, freeformRMS0.43Unverified
7VNLRMS0.42Unverified
8BTSRMS0.41Unverified
9TransDepth (AGD+ ViT)RMS0.37Unverified
10AdaBinsRMS0.36Unverified
#ModelMetricClaimedVerifiedStatus
1T2NetAbs Rel0.35Unverified
2MIDASAbs Rel0.31Unverified
3Bhattacharjee et al.Abs Rel0.25Unverified
#ModelMetricClaimedVerifiedStatus
1T2NetAbs Rel0.49Unverified
2MIDASAbs Rel0.42Unverified
3Bhattacharjee et al.Abs Rel0.38Unverified
#ModelMetricClaimedVerifiedStatus
1LeReSabsolute relative error0.1Unverified
2DELTASabsolute relative error0.09Unverified
3Distill Any Depthabsolute relative error0.04Unverified
#ModelMetricClaimedVerifiedStatus
1SDC-DepthRMSE6.92Unverified
2SwinMTLRMSE6.35Unverified
#ModelMetricClaimedVerifiedStatus
1AIP-BrownDelta < 1.250.36Unverified
2LeResDelta < 1.250.23Unverified
#ModelMetricClaimedVerifiedStatus
1H-Net (Ours)Absolute relative error (AbsRel)0.09Unverified
2H-Net (Ours) Full EigenAbsolute relative error (AbsRel)0.08Unverified
#ModelMetricClaimedVerifiedStatus
1GLPDepthDelta < 1.250.43Unverified
2SRDINET (Model A)Delta < 1.250.4Unverified
#ModelMetricClaimedVerifiedStatus
1Atlas (finetuned)RMSE0.17Unverified
2Atlas (plain)RMSE0.17Unverified
#ModelMetricClaimedVerifiedStatus
1LFattNetBadPix(0.01)17.23Unverified
#ModelMetricClaimedVerifiedStatus
1LightDepthNumber of parameters (M)42.6Unverified
#ModelMetricClaimedVerifiedStatus
1UniFuseAbs Rel0.11Unverified
#ModelMetricClaimedVerifiedStatus
1X-TC (Cross-Task Consistency)L1 error1.63Unverified