SOTAVerified

Depth Estimation

Depth Estimation is the task of measuring the distance of each pixel relative to the camera. Depth is extracted from either monocular (single) or stereo (multiple views of a scene) images. Traditional methods use multi-view geometry to find the relationship between the images. Newer methods can directly estimate depth by minimizing the regression loss, or by learning to generate a novel view from a sequence. The most popular benchmarks are KITTI and NYUv2. Models are typically evaluated according to a RMS metric.

Source: DIODE: A Dense Indoor and Outdoor DEpth Dataset

Papers

Showing 851900 of 2454 papers

TitleStatusHype
AttEntropy: On the Generalization Ability of Supervised Semantic Segmentation Transformers to New Objects in New DomainsCode0
ObjCAViT: Improving Monocular Depth Estimation Using Natural Language Models And Image-Object Cross-AttentionCode0
360SD-Net: 360° Stereo Depth Estimation with Learnable Cost VolumeCode0
Occlusion-aware Unsupervised Learning of Depth from 4-D Light FieldsCode0
EndoLRMGS: Complete Endoscopic Scene Reconstruction combining Large Reconstruction Modelling and Gaussian SplattingCode0
EndoGaussian: Real-time Gaussian Splatting for Dynamic Endoscopic Scene ReconstructionCode0
NimbleD: Enhancing Self-supervised Monocular Depth Estimation with Pseudo-labels and Large-scale Video Pre-trainingCode0
Creative Flow+ DatasetCode0
Adversarial Manhole: Challenging Monocular Depth Estimation and Semantic Segmentation Models with Patch AttackCode0
Eliminating the Blind Spot: Adapting 3D Object Detection and Monocular Depth Estimation to 360° Panoramic ImageryCode0
NeSLAM: Neural Implicit Mapping and Self-Supervised Feature Tracking With Depth Completion and DenoisingCode0
EgoDTM: Towards 3D-Aware Egocentric Video-Language PretrainingCode0
Neighbor-Vote: Improving Monocular 3D Object Detection through Neighbor Distance VotingCode0
Attention-Based Depth Distillation with 3D-Aware Positional Encoding for Monocular 3D Object DetectionCode0
Co-SemDepth: Fast Joint Semantic Segmentation and Depth Estimation on Aerial ImagesCode0
MVDepthNet: Real-time Multiview Depth Estimation Neural NetworkCode0
Correlation of Object Detection Performance with Visual Saliency and Depth EstimationCode0
SDGOCC: Semantic and Depth-Guided Bird's-Eye View Transformation for 3D Multimodal Occupancy PredictionCode0
Attention-based Context Aggregation Network for Monocular Depth EstimationCode0
Efficient Calisthenics Skills Classification through Foreground Instance Selection and Depth EstimationCode0
Copy-Pasting Coherent Depth Regions Improves Contrastive Learning for Urban-Scene SegmentationCode0
Adversarial Attacks on Monocular Pose EstimationCode0
Mutli-View 3D Reconstruction using Knowledge DistillationCode0
Edge-Guided Occlusion Fading Reduction for a Light-Weighted Self-Supervised Monocular Depth EstimationCode0
Convolution kernel adaptation to calibrated fisheyeCode0
Multi-View Silhouette and Depth Decomposition for High Resolution 3D Object RepresentationCode0
Multi-View Stereo by Temporal Nonparametric FusionCode0
EDEN: Multimodal Synthetic Dataset of Enclosed GarDEN ScenesCode0
Attacking Attention of Foundation Models Disrupts Downstream TasksCode0
EDADepth: Enhanced Data Augmentation for Monocular Depth EstimationCode0
Multiview Detection with Cardboard Human ModelingCode0
Normal Assisted Stereo Depth EstimationCode0
Multi-Scale Continuous CRFs as Sequential Deep Networks for Monocular Depth EstimationCode0
Multi-task Learning for Monocular Depth and Defocus Estimations with Real ImagesCode0
Multiple Prior Representation Learning for Self-Supervised Monocular Depth Estimation via Hybrid TransformerCode0
Dynamic Filter NetworksCode0
Continual Learning of Unsupervised Monocular Depth from VideosCode0
Multi-Modal Attention Networks for Enhanced Segmentation and Depth Estimation of Subsurface Defects in Pulse ThermographyCode0
A Systematic Performance Analysis of Deep Perceptual Loss Networks: Breaking Transfer Learning ConventionsCode0
Multimodal Scale Consistency and Awareness for Monocular Self-Supervised Depth EstimationCode0
Multi-Task Meta Learning: learn how to adapt to unseen tasksCode0
FUSE: Label-Free Image-Event Joint Monocular Depth Estimation via Frequency-Decoupled Alignment and Degradation-Robust FusionCode0
MultiDepth: Single-Image Depth Estimation via Multi-Task Regression and ClassificationCode0
Multi-body Depth and Camera Pose Estimation from Multiple ViewsCode0
Dual CNN Models for Unsupervised Monocular Depth EstimationCode0
Asynchronous Collaborative Graph Representation for Frames and EventsCode0
Consistency Regularisation for Unsupervised Domain Adaptation in Monocular Depth EstimationCode0
Consensus-based Optimization for 3D Human Pose Estimation in Camera CoordinatesCode0
MonoSIM: Simulating Learning Behaviors of Heterogeneous Point Cloud Object Detectors for Monocular 3D Object DetectionCode0
MorphEyes: Variable Baseline Stereo For Quadrotor NavigationCode0
Show:102550
← PrevPage 18 of 50Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1OmniDepthRMSE0.62Unverified
2SphereDepthRMSE0.45Unverified
3Jin et al.RMSE0.42Unverified
4BiFuse with fusionRMSE0.41Unverified
5HoHoNet (ResNet-101)RMSE0.38Unverified
6PanoDepthRMSE0.37Unverified
7BiFuse++RMSE0.37Unverified
8UniFuse with fusionRMSE0.37Unverified
9DisConvRMSE0.37Unverified
10SliceNetRMSE0.37Unverified
#ModelMetricClaimedVerifiedStatus
1A2JmAP8.61Unverified
2PAD-NetRMS0.79Unverified
3MS-CRFRMS0.59Unverified
4DORNRMS0.51Unverified
5FreeformRMS0.43Unverified
6Optimized, freeformRMS0.43Unverified
7VNLRMS0.42Unverified
8BTSRMS0.41Unverified
9TransDepth (AGD+ ViT)RMS0.37Unverified
10AdaBinsRMS0.36Unverified
#ModelMetricClaimedVerifiedStatus
1T2NetAbs Rel0.35Unverified
2MIDASAbs Rel0.31Unverified
3Bhattacharjee et al.Abs Rel0.25Unverified
#ModelMetricClaimedVerifiedStatus
1T2NetAbs Rel0.49Unverified
2MIDASAbs Rel0.42Unverified
3Bhattacharjee et al.Abs Rel0.38Unverified
#ModelMetricClaimedVerifiedStatus
1LeReSabsolute relative error0.1Unverified
2DELTASabsolute relative error0.09Unverified
3Distill Any Depthabsolute relative error0.04Unverified
#ModelMetricClaimedVerifiedStatus
1SDC-DepthRMSE6.92Unverified
2SwinMTLRMSE6.35Unverified
#ModelMetricClaimedVerifiedStatus
1AIP-BrownDelta < 1.250.36Unverified
2LeResDelta < 1.250.23Unverified
#ModelMetricClaimedVerifiedStatus
1H-Net (Ours)Absolute relative error (AbsRel)0.09Unverified
2H-Net (Ours) Full EigenAbsolute relative error (AbsRel)0.08Unverified
#ModelMetricClaimedVerifiedStatus
1GLPDepthDelta < 1.250.43Unverified
2SRDINET (Model A)Delta < 1.250.4Unverified
#ModelMetricClaimedVerifiedStatus
1Atlas (finetuned)RMSE0.17Unverified
2Atlas (plain)RMSE0.17Unverified
#ModelMetricClaimedVerifiedStatus
1LFattNetBadPix(0.01)17.23Unverified
#ModelMetricClaimedVerifiedStatus
1LightDepthNumber of parameters (M)42.6Unverified
#ModelMetricClaimedVerifiedStatus
1UniFuseAbs Rel0.11Unverified
#ModelMetricClaimedVerifiedStatus
1X-TC (Cross-Task Consistency)L1 error1.63Unverified