SOTAVerified

Depth Estimation

Depth Estimation is the task of measuring the distance of each pixel relative to the camera. Depth is extracted from either monocular (single) or stereo (multiple views of a scene) images. Traditional methods use multi-view geometry to find the relationship between the images. Newer methods can directly estimate depth by minimizing the regression loss, or by learning to generate a novel view from a sequence. The most popular benchmarks are KITTI and NYUv2. Models are typically evaluated according to a RMS metric.

Source: DIODE: A Dense Indoor and Outdoor DEpth Dataset

Papers

Showing 201250 of 2454 papers

TitleStatusHype
Disparity Estimation Using a Quad-Pixel SensorCode1
DARES: Depth Anything in Robotic Endoscopic Surgery with Self-supervised Vector-LoRA of the Foundation ModelCode1
InSpaceType: Dataset and Benchmark for Reconsidering Cross-Space Type Performance in Indoor Monocular DepthCode1
Structure-preserving Image Translation for Depth Estimation in Colonoscopy VideoCode1
SHARP: Segmentation of Hands and Arms by Range using Pseudo-Depth for Enhanced Egocentric 3D Hand Pose Estimation and Action RecognitionCode1
Depth Any Canopy: Leveraging Depth Foundation Models for Canopy Height EstimationCode1
BodySLAM: A Generalized Monocular Visual SLAM Framework for Surgical ApplicationsCode1
BaseBoostDepth: Exploiting Larger Baselines For Self-supervised Monocular Depth EstimationCode1
SINDER: Repairing the Singular Defects of DINOv2Code1
LabelDistill: Label-guided Cross-modal Knowledge Distillation for Camera-based 3D Object DetectionCode1
ProDepth: Boosting Self-Supervised Multi-Frame Monocular Depth with Probabilistic FusionCode1
SCIPaD: Incorporating Spatial Clues into Unsupervised Pose-Depth Joint LearningCode1
Depth-Aware Endoscopic Video InpaintingCode1
Uni-DVPS: Unified Model for Depth-Aware Video Panoptic SegmentationCode1
CaFNet: A Confidence-Driven Framework for Radar Camera Depth EstimationCode1
Scale-Invariant Monocular Depth Estimation via SSI DepthCode1
Self-supervised Adversarial Training of Monocular Depth Estimation against Physical-World AttacksCode1
EvGGS: A Collaborative Learning Framework for Event-based Generalizable Gaussian SplattingCode1
Dusk Till Dawn: Self-supervised Nighttime Stereo Depth Estimation using Visual Foundation ModelsCode1
Underwater Variable Zoom: Depth-Guided Perception Network for Underwater Image EnhancementCode1
Mining Supervision for Dynamic Regions in Self-Supervised Monocular Depth EstimationCode1
Digging into contrastive learning for robust depth estimation with diffusion modelsCode1
Stereo-LiDAR Depth Estimation with Deformable Propagation and Learned Disparity-Depth ConversionCode1
WorDepth: Variational Language Prior for Monocular Depth EstimationCode1
BadPart: Unified Black-box Adversarial Patch Attacks against Pixel-wise Regression TasksCode1
Flare-Free Vision: Empowering Uformer with Depth InsightsCode1
VSRD: Instance-Aware Volumetric Silhouette Rendering for Weakly Supervised 3D Object DetectionCode1
ModaLink: Unifying Modalities for Efficient Image-to-PointCloud Place RecognitionCode1
MSI-NeRF: Linking Omni-Depth with View Synthesis through Multi-Sphere Image aided Generalizable Neural Radiance FieldCode1
SwinMTL: A Shared Architecture for Simultaneous Depth Estimation and Semantic Segmentation from Monocular Camera ImagesCode1
SM4Depth: Seamless Monocular Metric Depth Estimation across Multiple Cameras and Scenes by One ModelCode1
Stealing Stable Diffusion Prior for Robust Monocular Depth EstimationCode1
Depth-aware Test-Time Training for Zero-shot Video Object SegmentationCode1
Scalable Vision-Based 3D Object Detection and Monocular Depth Estimation for Autonomous DrivingCode1
NeRF-Det++: Incorporating Semantic Cues and Perspective-aware Depth Supervision for Indoor Multi-View 3D DetectionCode1
GAM-Depth: Self-Supervised Indoor Depth Estimation Leveraging a Gradient-Aware Mask and Semantic ConstraintsCode1
Depth-aware Volume Attention for Texture-less Stereo MatchingCode1
RIDERS: Radar-Infrared Depth Estimation for Robust SensingCode1
A Concise but High-performing Network for Image Guided Depth Completion in Autonomous DrivingCode1
Range-Agnostic Multi-View Depth Estimation With Keyframe SelectionCode1
InverseMatrixVT3D: An Efficient Projection Matrix-Based Approach for 3D Occupancy PredictionCode1
A Study on Self-Supervised Pretraining for Vision Problems in Gastrointestinal EndoscopyCode1
Global and Hierarchical Geometry Consistency Priors for Few-shot NeRFs in Indoor ScenesCode1
Learning Deformable Hypothesis Sampling for Accurate PatchMatch Multi-View StereoCode1
Manydepth2: Motion-Aware Self-Supervised Multi-Frame Monocular Depth Estimation in Dynamic ScenesCode1
Harnessing Diffusion Models for Visual Perception with Meta PromptsCode1
Pola4All: survey of polarimetric applications and an open-source toolkit to analyze polarizationCode1
Atlantis: Enabling Underwater Depth Estimation with Stable DiffusionCode1
CT-MVSNet: Efficient Multi-View Stereo with Cross-scale TransformerCode1
EVP: Enhanced Visual Perception using Inverse Multi-Attentive Feature Refinement and Regularized Image-Text AlignmentCode1
Show:102550
← PrevPage 5 of 50Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1OmniDepthRMSE0.62Unverified
2SphereDepthRMSE0.45Unverified
3Jin et al.RMSE0.42Unverified
4BiFuse with fusionRMSE0.41Unverified
5HoHoNet (ResNet-101)RMSE0.38Unverified
6PanoDepthRMSE0.37Unverified
7BiFuse++RMSE0.37Unverified
8UniFuse with fusionRMSE0.37Unverified
9DisConvRMSE0.37Unverified
10SliceNetRMSE0.37Unverified
#ModelMetricClaimedVerifiedStatus
1A2JmAP8.61Unverified
2PAD-NetRMS0.79Unverified
3MS-CRFRMS0.59Unverified
4DORNRMS0.51Unverified
5FreeformRMS0.43Unverified
6Optimized, freeformRMS0.43Unverified
7VNLRMS0.42Unverified
8BTSRMS0.41Unverified
9TransDepth (AGD+ ViT)RMS0.37Unverified
10AdaBinsRMS0.36Unverified
#ModelMetricClaimedVerifiedStatus
1T2NetAbs Rel0.35Unverified
2MIDASAbs Rel0.31Unverified
3Bhattacharjee et al.Abs Rel0.25Unverified
#ModelMetricClaimedVerifiedStatus
1T2NetAbs Rel0.49Unverified
2MIDASAbs Rel0.42Unverified
3Bhattacharjee et al.Abs Rel0.38Unverified
#ModelMetricClaimedVerifiedStatus
1LeReSabsolute relative error0.1Unverified
2DELTASabsolute relative error0.09Unverified
3Distill Any Depthabsolute relative error0.04Unverified
#ModelMetricClaimedVerifiedStatus
1SDC-DepthRMSE6.92Unverified
2SwinMTLRMSE6.35Unverified
#ModelMetricClaimedVerifiedStatus
1AIP-BrownDelta < 1.250.36Unverified
2LeResDelta < 1.250.23Unverified
#ModelMetricClaimedVerifiedStatus
1H-Net (Ours)Absolute relative error (AbsRel)0.09Unverified
2H-Net (Ours) Full EigenAbsolute relative error (AbsRel)0.08Unverified
#ModelMetricClaimedVerifiedStatus
1GLPDepthDelta < 1.250.43Unverified
2SRDINET (Model A)Delta < 1.250.4Unverified
#ModelMetricClaimedVerifiedStatus
1Atlas (finetuned)RMSE0.17Unverified
2Atlas (plain)RMSE0.17Unverified
#ModelMetricClaimedVerifiedStatus
1LFattNetBadPix(0.01)17.23Unverified
#ModelMetricClaimedVerifiedStatus
1LightDepthNumber of parameters (M)42.6Unverified
#ModelMetricClaimedVerifiedStatus
1UniFuseAbs Rel0.11Unverified
#ModelMetricClaimedVerifiedStatus
1X-TC (Cross-Task Consistency)L1 error1.63Unverified