SOTAVerified

Depth Estimation

Depth Estimation is the task of measuring the distance of each pixel relative to the camera. Depth is extracted from either monocular (single) or stereo (multiple views of a scene) images. Traditional methods use multi-view geometry to find the relationship between the images. Newer methods can directly estimate depth by minimizing the regression loss, or by learning to generate a novel view from a sequence. The most popular benchmarks are KITTI and NYUv2. Models are typically evaluated according to a RMS metric.

Source: DIODE: A Dense Indoor and Outdoor DEpth Dataset

Papers

Showing 22512300 of 2454 papers

TitleStatusHype
MonoMM: A Multi-scale Mamba-Enhanced Network for Real-time Monocular 3D Object Detection0
MonoMobility: Zero-Shot 3D Mobility Analysis from Monocular Videos0
MonoPCC: Photometric-invariant Cycle Constraint for Monocular Depth Estimation of Endoscopic Images0
MonoPGC: Monocular 3D Object Detection with Pixel Geometry Contexts0
MonoPP: Metric-Scaled Self-Supervised Monocular Depth Estimation by Planar-Parallax Geometry in Automotive Applications0
MonoSelfRecon: Purely Self-Supervised Explicit Generalizable 3D Reconstruction of Indoor Scenes from Monocular RGB Views0
MonSter: Awakening the Mono in Stereo0
Motion Guided LIDAR-camera Self-calibration and Accelerated Depth Upsampling for Autonomous Vehicles0
MOTSLAM: MOT-assisted monocular dynamic SLAM using single-view depth estimation0
Moving SLAM: Fully Unsupervised Deep Learning in Non-Rigid Scenes0
MSDPN: Monocular Depth Prediction with Partial Laser Observation using Multi-stage Neural Networks0
MSFNet:Multi-scale features network for monocular depth estimation0
MSS-DepthNet: Depth Prediction with Multi-Step Spiking Neural Network0
MT3DNet: Multi-Task learning Network for 3D Surgical Scene Reconstruction0
Multi-Camera Calibration Free BEV Representation for 3D Object Detection0
Multi-Camera Collaborative Depth Prediction via Consistent Structure Estimation0
MultiDepth: Multi-Sample Priors for Refining Monocular Metric Depth Estimations in Indoor Scenes0
Multi-Frame Self-Supervised Depth Estimation with Multi-Scale Feature Fusion in Dynamic Scenes0
Multi-Frame Self-Supervised Depth with Transformers0
MultiGO: Towards Multi-level Geometry Learning for Monocular 3D Textured Human Reconstruction0
BEVUDA: Multi-geometric Space Alignments for Domain Adaptive BEV 3D Object Detection0
Multi-Modal Depth Estimation Using Convolutional Neural Networks0
Multimodal End-to-End Autonomous Driving0
Multi-Modality Driven LoRA for Adverse Condition Depth Estimation0
Multi-Object Discovery by Low-Dimensional Object Motion0
MonoNeRF: Learning Generalizable NeRFs from Monocular Videos without Camera Pose0
Multi-Robot Collaborative Perception with Graph Neural Networks0
Gated Cross-Attention Network for Depth Completion0
MultiTask-CenterNet (MCN): Efficient and Diverse Multitask Learning using an Anchor Free Approach0
Multi-task human analysis in still images: 2D/3D pose, depth map, and multi-part segmentation0
Multi-Task Learning-Enabled Automatic Vessel Draft Reading for Intelligent Maritime Surveillance0
Multi-task Learning for Real-time Autonomous Driving Leveraging Task-adaptive Attention Generator0
Multi-task learning from fixed-wing UAV images for 2D/3D city modeling0
Multi-task learning with cross-task consistency for improved depth estimation in colonoscopy0
Multi-Task Learning with Knowledge Distillation for Dense Prediction0
Multi-Task Self-Supervised Learning for Image Segmentation Task0
Multi-task Self-Supervised Visual Learning0
Multi-View Depth Consistent Image Generation Using Generative AI Models: Application on Architectural Design of University Buildings0
Multi-View Reconstruction using Signed Ray Distance Functions (SRDF)0
Multi-view Reconstruction via SfM-guided Monocular Depth Estimation0
Mutual Adaptive Reasoning for Monocular 3D Multi-Person Pose Estimation0
MVD-Fusion: Single-view 3D via Depth-consistent Multi-view Generation0
MV-ROPE: Multi-view Constraints for Robust Category-level Object Pose and Size Estimation0
MVSAnywhere: Zero-Shot Multi-View Stereo0
MVS-GS: High-Quality 3D Gaussian Splatting Mapping via Online Multi-View Stereo0
MVTrans: Multi-View Perception of Transparent Objects0
Towards Comprehensive Monocular Depth Estimation: Multiple Heads Are Better Than One0
NeRFmentation: NeRF-based Augmentation for Monocular Depth Estimation0
NeRFReN: Neural Radiance Fields with Reflections0
NeRF-VO: Real-Time Sparse Visual Odometry with Neural Radiance Fields0
Show:102550
← PrevPage 46 of 50Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1OmniDepthRMSE0.62Unverified
2SphereDepthRMSE0.45Unverified
3Jin et al.RMSE0.42Unverified
4BiFuse with fusionRMSE0.41Unverified
5HoHoNet (ResNet-101)RMSE0.38Unverified
6PanoDepthRMSE0.37Unverified
7BiFuse++RMSE0.37Unverified
8UniFuse with fusionRMSE0.37Unverified
9DisConvRMSE0.37Unverified
10SliceNetRMSE0.37Unverified
#ModelMetricClaimedVerifiedStatus
1A2JmAP8.61Unverified
2PAD-NetRMS0.79Unverified
3MS-CRFRMS0.59Unverified
4DORNRMS0.51Unverified
5FreeformRMS0.43Unverified
6Optimized, freeformRMS0.43Unverified
7VNLRMS0.42Unverified
8BTSRMS0.41Unverified
9TransDepth (AGD+ ViT)RMS0.37Unverified
10AdaBinsRMS0.36Unverified
#ModelMetricClaimedVerifiedStatus
1T2NetAbs Rel0.35Unverified
2MIDASAbs Rel0.31Unverified
3Bhattacharjee et al.Abs Rel0.25Unverified
#ModelMetricClaimedVerifiedStatus
1T2NetAbs Rel0.49Unverified
2MIDASAbs Rel0.42Unverified
3Bhattacharjee et al.Abs Rel0.38Unverified
#ModelMetricClaimedVerifiedStatus
1LeReSabsolute relative error0.1Unverified
2DELTASabsolute relative error0.09Unverified
3Distill Any Depthabsolute relative error0.04Unverified
#ModelMetricClaimedVerifiedStatus
1SDC-DepthRMSE6.92Unverified
2SwinMTLRMSE6.35Unverified
#ModelMetricClaimedVerifiedStatus
1AIP-BrownDelta < 1.250.36Unverified
2LeResDelta < 1.250.23Unverified
#ModelMetricClaimedVerifiedStatus
1H-Net (Ours)Absolute relative error (AbsRel)0.09Unverified
2H-Net (Ours) Full EigenAbsolute relative error (AbsRel)0.08Unverified
#ModelMetricClaimedVerifiedStatus
1GLPDepthDelta < 1.250.43Unverified
2SRDINET (Model A)Delta < 1.250.4Unverified
#ModelMetricClaimedVerifiedStatus
1Atlas (finetuned)RMSE0.17Unverified
2Atlas (plain)RMSE0.17Unverified
#ModelMetricClaimedVerifiedStatus
1LFattNetBadPix(0.01)17.23Unverified
#ModelMetricClaimedVerifiedStatus
1LightDepthNumber of parameters (M)42.6Unverified
#ModelMetricClaimedVerifiedStatus
1UniFuseAbs Rel0.11Unverified
#ModelMetricClaimedVerifiedStatus
1X-TC (Cross-Task Consistency)L1 error1.63Unverified