SOTAVerified

Depth Estimation

Depth Estimation is the task of measuring the distance of each pixel relative to the camera. Depth is extracted from either monocular (single) or stereo (multiple views of a scene) images. Traditional methods use multi-view geometry to find the relationship between the images. Newer methods can directly estimate depth by minimizing the regression loss, or by learning to generate a novel view from a sequence. The most popular benchmarks are KITTI and NYUv2. Models are typically evaluated according to a RMS metric.

Source: DIODE: A Dense Indoor and Outdoor DEpth Dataset

Papers

Showing 9511000 of 2454 papers

TitleStatusHype
Meta-Optimization for Higher Model Generalizability in Single-Image Depth Prediction0
Learning Monocular Depth in Dynamic Environment via Context-aware Temporal Attention0
Virtual Occlusions Through Implicit DepthCode1
A Multi-modal Approach to Single-modal Visual Place Classification0
FusionDepth: Complement Self-Supervised Monocular Depth Estimation with Cost Volume0
Improving 2D face recognition via fine-level facial depth generation and RGB-D complementary feature learning0
Edge-aware Consistent Stereo Video Depth Estimation0
High-Resolution Synthetic RGB-D Datasets for Monocular Depth Estimation0
AutoColor: Learned Light Power Control for Multi-Color HologramsCode0
ViP-NeRF: Visibility Prior for Sparse Input Neural Radiance FieldsCode1
Exploring the Mutual Influence between Self-Supervised Single-Frame and Multi-Frame Depth EstimationCode0
CompletionFormer: Depth Completion with Convolutions and Vision TransformersCode2
Depth-Relative Self Attention for Monocular Depth Estimation0
RGB-D Indiscernible Object Counting in Underwater ScenesCode1
FSNet: Redesign Self-Supervised MonoDepth for Full-Scale Depth Prediction for Autonomous Driving0
A geometry-aware deep network for depth estimation in monocular endoscopyCode1
DarSwin: Distortion Aware Radial Swin Transformer0
CrossFusion: Interleaving Cross-modal Complementation for Noise-resistant 3D Object Detection0
Pose Constraints for Consistent Self-supervised Monocular Depth and Ego-motionCode0
Learning to Fuse Monocular and Multi-view Cues for Multi-frame Depth Estimation in Dynamic ScenesCode1
360^ High-Resolution Depth Estimation via Uncertainty-aware Structural Knowledge Transfer0
EGformer: Equirectangular Geometry-biased Transformer for 360 Depth Estimation0
Temporally Consistent Online Depth Estimation Using Point-Based FusionCode1
DINOv2: Learning Robust Visual Features without SupervisionCode6
The Second Monocular Depth Estimation Challenge0
Self-Supervised Learning based Depth Estimation from Monocular ImagesCode0
Event-based tracking of human hands0
iDisc: Internal Discretization for Monocular Depth EstimationCode3
Improving Neural Radiance Fields with Depth-aware Optimization for Novel View SynthesisCode0
BEVStereo++: Accurate Depth Estimation in Multi-view 3D Object Detection via Dynamic Temporal Stereo0
DualRefine: Self-Supervised Depth and Pose Estimation Through Iterative Epipolar Sampling and Refinement Toward EquilibriumCode1
EGA-Depth: Efficient Guided Attention for Self-Supervised Multi-Camera Depth Estimation0
DeLiRa: Self-Supervised Depth, Light, and Radiance Fields0
DEFLOW: Self-supervised 3D Motion Estimation of Debris FlowCode1
Joint 2D-3D Multi-Task Learning on Cityscapes-3D: 3D Detection, Segmentation, and Depth EstimationCode2
altiro3D: Scene representation from single image and novel view synthesisCode1
ENRICH: Multi-purposE dataset for beNchmaRking In Computer vision and pHotogrammetryCode1
Single Image Depth Prediction Made Better: A Multivariate Gaussian Take0
Joint Depth Estimation and Mixture of Rain Removal From a Single ImageCode0
EA-LSS: Edge-aware Lift-splat-shot Framework for 3D BEV Object DetectionCode2
SemHint-MD: Learning from Noisy Semantic Labels for Self-Supervised Monocular Depth Estimation0
DDP: Diffusion Model for Dense Visual PredictionCode2
TiDy-PSFs: Computational Imaging with Time-Averaged Dynamic Point-Spread-Functions0
DORT: Modeling Dynamic Objects in Recurrent for Multi-Camera 3D Object Detection and TrackingCode1
An intelligent modular real-time vision-based system for environment perceptionCode1
4K-HAZE: A Dehazing Benchmark with 4K Resolution Hazy and Haze-Free ImagesCode1
On the Importance of Accurate Geometry Data for Dense 3D Vision TasksCode1
Multi-Frame Self-Supervised Depth Estimation with Multi-Scale Feature Fusion in Dynamic Scenes0
Collaboration Helps Camera Overtake LiDAR in 3D DetectionCode1
SCADE: NeRFs from Space Carving with Ambiguity-Aware Depth Estimates0
Show:102550
← PrevPage 20 of 50Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1OmniDepthRMSE0.62Unverified
2SphereDepthRMSE0.45Unverified
3Jin et al.RMSE0.42Unverified
4BiFuse with fusionRMSE0.41Unverified
5HoHoNet (ResNet-101)RMSE0.38Unverified
6PanoDepthRMSE0.37Unverified
7BiFuse++RMSE0.37Unverified
8UniFuse with fusionRMSE0.37Unverified
9DisConvRMSE0.37Unverified
10SliceNetRMSE0.37Unverified
#ModelMetricClaimedVerifiedStatus
1A2JmAP8.61Unverified
2PAD-NetRMS0.79Unverified
3MS-CRFRMS0.59Unverified
4DORNRMS0.51Unverified
5FreeformRMS0.43Unverified
6Optimized, freeformRMS0.43Unverified
7VNLRMS0.42Unverified
8BTSRMS0.41Unverified
9TransDepth (AGD+ ViT)RMS0.37Unverified
10AdaBinsRMS0.36Unverified
#ModelMetricClaimedVerifiedStatus
1T2NetAbs Rel0.35Unverified
2MIDASAbs Rel0.31Unverified
3Bhattacharjee et al.Abs Rel0.25Unverified
#ModelMetricClaimedVerifiedStatus
1T2NetAbs Rel0.49Unverified
2MIDASAbs Rel0.42Unverified
3Bhattacharjee et al.Abs Rel0.38Unverified
#ModelMetricClaimedVerifiedStatus
1LeReSabsolute relative error0.1Unverified
2DELTASabsolute relative error0.09Unverified
3Distill Any Depthabsolute relative error0.04Unverified
#ModelMetricClaimedVerifiedStatus
1SDC-DepthRMSE6.92Unverified
2SwinMTLRMSE6.35Unverified
#ModelMetricClaimedVerifiedStatus
1AIP-BrownDelta < 1.250.36Unverified
2LeResDelta < 1.250.23Unverified
#ModelMetricClaimedVerifiedStatus
1H-Net (Ours)Absolute relative error (AbsRel)0.09Unverified
2H-Net (Ours) Full EigenAbsolute relative error (AbsRel)0.08Unverified
#ModelMetricClaimedVerifiedStatus
1GLPDepthDelta < 1.250.43Unverified
2SRDINET (Model A)Delta < 1.250.4Unverified
#ModelMetricClaimedVerifiedStatus
1Atlas (finetuned)RMSE0.17Unverified
2Atlas (plain)RMSE0.17Unverified
#ModelMetricClaimedVerifiedStatus
1LFattNetBadPix(0.01)17.23Unverified
#ModelMetricClaimedVerifiedStatus
1LightDepthNumber of parameters (M)42.6Unverified
#ModelMetricClaimedVerifiedStatus
1UniFuseAbs Rel0.11Unverified
#ModelMetricClaimedVerifiedStatus
1X-TC (Cross-Task Consistency)L1 error1.63Unverified