SOTAVerified

Depth Estimation

Depth Estimation is the task of measuring the distance of each pixel relative to the camera. Depth is extracted from either monocular (single) or stereo (multiple views of a scene) images. Traditional methods use multi-view geometry to find the relationship between the images. Newer methods can directly estimate depth by minimizing the regression loss, or by learning to generate a novel view from a sequence. The most popular benchmarks are KITTI and NYUv2. Models are typically evaluated according to a RMS metric.

Source: DIODE: A Dense Indoor and Outdoor DEpth Dataset

Papers

Showing 801850 of 2454 papers

TitleStatusHype
Omnidirectional Depth-Aided Occupancy Prediction based on Cylindrical Voxel for Autonomous Driving0
Tracktention: Leveraging Point Tracking to Attend Videos Faster and Better0
FUSE: Label-Free Image-Event Joint Monocular Depth Estimation via Frequency-Decoupled Alignment and Degradation-Robust FusionCode0
Semi-SD: Semi-Supervised Metric Depth Estimation via Surrounding Cameras for Autonomous DrivingCode0
StableGS: A Floater-Free Framework for 3D Gaussian Splatting0
PDDM: Pseudo Depth Diffusion Model for RGB-PD Semantic Segmentation Based in Complex Indoor Scenes0
Co-SemDepth: Fast Joint Semantic Segmentation and Depth Estimation on Aerial ImagesCode0
Pow3R: Empowering Unconstrained 3D Reconstruction with Camera and Scene Priors0
GAA-TSO: Geometry-Aware Assisted Depth Completion for Transparent and Specular Objects0
Radar-Guided Polynomial Fitting for Metric Depth Estimation0
Dynamic Point Maps: A Versatile Representation for Dynamic 3D Reconstruction0
Jasmine: Harnessing Diffusion Prior for Self-supervised Depth Estimation0
Learning to Efficiently Adapt Foundation Models for Self-Supervised Endoscopic 3D Scene Reconstruction from Any Cameras0
TULIP: Towards Unified Language-Image Pretraining0
EgoDTM: Towards 3D-Aware Egocentric Video-Language PretrainingCode0
USAM-Net: A U-Net-based Network for Improved Stereo Correspondence and Scene Depth Estimation using Features from a Pre-trained Image Segmentation network0
Multi-view Reconstruction via SfM-guided Monocular Depth Estimation0
3D Densification for Multi-Map Monocular VSLAM in Endoscopy0
DUNE: Distilling a Universal Encoder from Heterogeneous 2D and 3D Teachers0
MonoCT: Overcoming Monocular 3D Detection Domain Shift with Consistent Teacher Models0
Improving Geometric Consistency for 360-Degree Neural Radiance Fields in Indoor Scenarios0
Seeing and Seeing Through the Glass: Real and Synthetic Data for Multi-Layer Depth Estimation0
Flow-NeRF: Joint Learning of Geometry, Poses, and Dense Flow within Unified Neural Representations0
WonderVerse: Extendable 3D Scene Generation with Video Generative Models0
GarmentCrafter: Progressive Novel View Synthesis for Single-View 3D Garment Reconstruction and Editing0
Endo-FASt3r: Endoscopic Foundation model Adaptation for Structure from motion0
Towards Ambiguity-Free Spatial Foundation Model: Rethinking and Decoupling Depth AmbiguityCode0
TomatoScanner: phenotyping tomato fruit based on only RGB imageCode0
EvidMTL: Evidential Multi-Task Learning for Uncertainty-Aware Semantic Surface Mapping from Monocular RGB Images0
Multi-View Depth Consistent Image Generation Using Generative AI Models: Application on Architectural Design of University Buildings0
RGB-Thermal Infrared Fusion for Robust Depth Estimation in Complex Environments0
Task-Agnostic Attacks Against Vision Foundation ModelsCode0
SLAM in the Dark: Self-Supervised Learning of Pose, Depth and Loop-Closure from Thermal Images0
RGB-Only Gaussian Splatting SLAM for Unbounded Outdoor Scenes0
Self-supervised Monocular Depth Estimation Robust to Reflective Surface Leveraged by Triplet Mining0
OrchardDepth: Precise Metric Depth Estimation of Orchard Scene from Monocular Camera Images0
Challenges of Multi-Modal Coreset Selection for Depth PredictionCode0
LXLv2: Enhanced LiDAR Excluded Lean 3D Object Detection with Fusion of 4D Radar and Camera0
SHADeS: Self-supervised Monocular Depth Estimation Through Non-Lambertian Image DecompositionCode0
Pre-training Auto-regressive Robotic Models with 4D Representations0
Deep Neural Networks for Accurate Depth Estimation with Latent Space Features0
Adjust Your Focus: Defocus Deblurring From Dual-Pixel Images Using Explicit Multi-Scale Cross-Correlation0
RealCam-I2V: Real-World Image-to-Video Generation with Interactive Complex Camera Control0
CoL3D: Collaborative Learning of Single-view Depth and Camera Intrinsics for Metric 3D Shape Recovery0
SteROI-D: System Design and Mapping for Stereo Depth Inference on Regions of Interest0
S^2-Diffusion: Generalizing from Instance-level to Category-level Skills in Robot Manipulation0
Matrix3D: Large Photogrammetry Model All-in-One0
Learning Inverse Laplacian Pyramid for Progressive Depth Completion0
From Image to Video: An Empirical Study of Diffusion Representations0
Fully Exploiting Vision Foundation Model's Profound Prior Knowledge for Generalizable RGB-Depth Driving Scene Parsing0
Show:102550
← PrevPage 17 of 50Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1OmniDepthRMSE0.62Unverified
2SphereDepthRMSE0.45Unverified
3Jin et al.RMSE0.42Unverified
4BiFuse with fusionRMSE0.41Unverified
5HoHoNet (ResNet-101)RMSE0.38Unverified
6PanoDepthRMSE0.37Unverified
7BiFuse++RMSE0.37Unverified
8UniFuse with fusionRMSE0.37Unverified
9DisConvRMSE0.37Unverified
10SliceNetRMSE0.37Unverified
#ModelMetricClaimedVerifiedStatus
1A2JmAP8.61Unverified
2PAD-NetRMS0.79Unverified
3MS-CRFRMS0.59Unverified
4DORNRMS0.51Unverified
5FreeformRMS0.43Unverified
6Optimized, freeformRMS0.43Unverified
7VNLRMS0.42Unverified
8BTSRMS0.41Unverified
9TransDepth (AGD+ ViT)RMS0.37Unverified
10AdaBinsRMS0.36Unverified
#ModelMetricClaimedVerifiedStatus
1T2NetAbs Rel0.35Unverified
2MIDASAbs Rel0.31Unverified
3Bhattacharjee et al.Abs Rel0.25Unverified
#ModelMetricClaimedVerifiedStatus
1T2NetAbs Rel0.49Unverified
2MIDASAbs Rel0.42Unverified
3Bhattacharjee et al.Abs Rel0.38Unverified
#ModelMetricClaimedVerifiedStatus
1LeReSabsolute relative error0.1Unverified
2DELTASabsolute relative error0.09Unverified
3Distill Any Depthabsolute relative error0.04Unverified
#ModelMetricClaimedVerifiedStatus
1SDC-DepthRMSE6.92Unverified
2SwinMTLRMSE6.35Unverified
#ModelMetricClaimedVerifiedStatus
1AIP-BrownDelta < 1.250.36Unverified
2LeResDelta < 1.250.23Unverified
#ModelMetricClaimedVerifiedStatus
1H-Net (Ours)Absolute relative error (AbsRel)0.09Unverified
2H-Net (Ours) Full EigenAbsolute relative error (AbsRel)0.08Unverified
#ModelMetricClaimedVerifiedStatus
1GLPDepthDelta < 1.250.43Unverified
2SRDINET (Model A)Delta < 1.250.4Unverified
#ModelMetricClaimedVerifiedStatus
1Atlas (finetuned)RMSE0.17Unverified
2Atlas (plain)RMSE0.17Unverified
#ModelMetricClaimedVerifiedStatus
1LFattNetBadPix(0.01)17.23Unverified
#ModelMetricClaimedVerifiedStatus
1LightDepthNumber of parameters (M)42.6Unverified
#ModelMetricClaimedVerifiedStatus
1UniFuseAbs Rel0.11Unverified
#ModelMetricClaimedVerifiedStatus
1X-TC (Cross-Task Consistency)L1 error1.63Unverified