SOTAVerified

Depth Estimation

Depth Estimation is the task of measuring the distance of each pixel relative to the camera. Depth is extracted from either monocular (single) or stereo (multiple views of a scene) images. Traditional methods use multi-view geometry to find the relationship between the images. Newer methods can directly estimate depth by minimizing the regression loss, or by learning to generate a novel view from a sequence. The most popular benchmarks are KITTI and NYUv2. Models are typically evaluated according to a RMS metric.

Source: DIODE: A Dense Indoor and Outdoor DEpth Dataset

Papers

Showing 451500 of 2454 papers

TitleStatusHype
Channel-Wise Attention-Based Network for Self-Supervised Monocular Depth EstimationCode1
EvGGS: A Collaborative Learning Framework for Event-based Generalizable Gaussian SplattingCode1
Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual PerceptionCode1
Expediting Large-Scale Vision Transformer for Dense Prediction without Fine-tuningCode1
Beyond Image to Depth: Improving Depth Prediction using EchoesCode1
Deep Two-View Structure-from-Motion RevisitedCode1
Object-aware Monocular Depth Prediction with Instance ConvolutionsCode1
DeepVideoMVS: Multi-View Stereo on Video with Recurrent Spatio-Temporal FusionCode1
All in Tokens: Unifying Output Space of Visual Tasks via Soft TokenCode1
Occlusion-Aware Cost Constructor for Light Field Depth EstimationCode1
Channel-Aware Distillation Transformer for Depth Estimation on Nano DronesCode1
DeFeat-Net: General Monocular Depth via Simultaneous Unsupervised Representation LearningCode1
Estimating Image Depth in the Comics DomainCode1
DEFLOW: Self-supervised 3D Motion Estimation of Debris FlowCode1
EPI-based Oriented Relation Networks for Light Field Depth EstimationCode1
BiFuse: Monocular 360 Depth Estimation via Bi-Projection FusionCode1
ES-Net: An Efficient Stereo Matching NetworkCode1
BiFuse++: Self-supervised and Efficient Bi-projection Fusion for 360 Depth EstimationCode1
Event-based Stereo Depth Estimation from Ego-motion using Ray Density FusionCode1
Exploring Sparse Visual Prompt for Domain Adaptive Dense PredictionCode1
End-to-End Pseudo-LiDAR for Image-Based 3D Object DetectionCode1
CDGS: Confidence-Aware Depth Regularization for 3D Gaussian SplattingCode1
Bird's-Eye-View Panoptic Segmentation Using Monocular Frontal View ImagesCode1
On the Importance of Accurate Geometry Data for Dense 3D Vision TasksCode1
Energy-Efficient Adaptive 3D SensingCode1
EndoMUST: Monocular Depth Estimation for Robotic Endoscopy via End-to-end Multi-step Self-supervised TrainingCode1
Dense Depth Estimation from Multiple 360-degree Images Using Virtual DepthCode1
Overcoming the Distance Estimation Bottleneck in Estimating Animal Abundance with Camera TrapsCode1
Adaptive confidence thresholding for monocular depth estimationCode1
EndoOmni: Zero-Shot Cross-Dataset Depth Estimation in Endoscopy by Robust Self-Learning from Noisy LabelsCode1
Categorical Depth Distribution Network for Monocular 3D Object DetectionCode1
Efficient Depth Estimation for Unstable Stereo Camera Systems on AR GlassesCode1
Efficient Neural Radiance Fields for Interactive Free-viewpoint VideoCode1
EndoDepth: A Benchmark for Assessing Robustness in Endoscopic Depth PredictionCode1
ENRICH: Multi-purposE dataset for beNchmaRking In Computer vision and pHotogrammetryCode1
Photon-Starved Scene Inference using Single Photon CamerasCode1
Blur aware metric depth estimation with multi-focus plenoptic camerasCode1
Depth and DOF Cues Make A Better Defocus Blur DetectorCode1
A Confidence-based Iterative Solver of Depths and Surface Normals for Deep Multi-view StereoCode1
Efficient Attention: Attention with Linear ComplexitiesCode1
Depth Any Canopy: Leveraging Depth Foundation Models for Canopy Height EstimationCode1
BodySLAM: A Generalized Monocular Visual SLAM Framework for Surgical ApplicationsCode1
Fully Self-Supervised Depth Estimation from Defocus ClueCode1
Can Language Understand Depth?Code1
EC-Depth: Exploring the consistency of self-supervised monocular depth estimation in challenging scenesCode1
Dusk Till Dawn: Self-supervised Nighttime Stereo Depth Estimation using Visual Foundation ModelsCode1
Predict to Detect: Prediction-guided 3D Object Detection using Sequential ImagesCode1
Dyna-DM: Dynamic Object-aware Self-supervised Monocular Depth MapsCode1
Depth-aware Test-Time Training for Zero-shot Video Object SegmentationCode1
An intelligent modular real-time vision-based system for environment perceptionCode1
Show:102550
← PrevPage 10 of 50Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1OmniDepthRMSE0.62Unverified
2SphereDepthRMSE0.45Unverified
3Jin et al.RMSE0.42Unverified
4BiFuse with fusionRMSE0.41Unverified
5HoHoNet (ResNet-101)RMSE0.38Unverified
6PanoDepthRMSE0.37Unverified
7BiFuse++RMSE0.37Unverified
8UniFuse with fusionRMSE0.37Unverified
9DisConvRMSE0.37Unverified
10SliceNetRMSE0.37Unverified
#ModelMetricClaimedVerifiedStatus
1A2JmAP8.61Unverified
2PAD-NetRMS0.79Unverified
3MS-CRFRMS0.59Unverified
4DORNRMS0.51Unverified
5FreeformRMS0.43Unverified
6Optimized, freeformRMS0.43Unverified
7VNLRMS0.42Unverified
8BTSRMS0.41Unverified
9TransDepth (AGD+ ViT)RMS0.37Unverified
10AdaBinsRMS0.36Unverified
#ModelMetricClaimedVerifiedStatus
1T2NetAbs Rel0.35Unverified
2MIDASAbs Rel0.31Unverified
3Bhattacharjee et al.Abs Rel0.25Unverified
#ModelMetricClaimedVerifiedStatus
1T2NetAbs Rel0.49Unverified
2MIDASAbs Rel0.42Unverified
3Bhattacharjee et al.Abs Rel0.38Unverified
#ModelMetricClaimedVerifiedStatus
1LeReSabsolute relative error0.1Unverified
2DELTASabsolute relative error0.09Unverified
3Distill Any Depthabsolute relative error0.04Unverified
#ModelMetricClaimedVerifiedStatus
1SDC-DepthRMSE6.92Unverified
2SwinMTLRMSE6.35Unverified
#ModelMetricClaimedVerifiedStatus
1AIP-BrownDelta < 1.250.36Unverified
2LeResDelta < 1.250.23Unverified
#ModelMetricClaimedVerifiedStatus
1H-Net (Ours)Absolute relative error (AbsRel)0.09Unverified
2H-Net (Ours) Full EigenAbsolute relative error (AbsRel)0.08Unverified
#ModelMetricClaimedVerifiedStatus
1GLPDepthDelta < 1.250.43Unverified
2SRDINET (Model A)Delta < 1.250.4Unverified
#ModelMetricClaimedVerifiedStatus
1Atlas (finetuned)RMSE0.17Unverified
2Atlas (plain)RMSE0.17Unverified
#ModelMetricClaimedVerifiedStatus
1LFattNetBadPix(0.01)17.23Unverified
#ModelMetricClaimedVerifiedStatus
1LightDepthNumber of parameters (M)42.6Unverified
#ModelMetricClaimedVerifiedStatus
1UniFuseAbs Rel0.11Unverified
#ModelMetricClaimedVerifiedStatus
1X-TC (Cross-Task Consistency)L1 error1.63Unverified