SOTAVerified

Depth Estimation

Depth Estimation is the task of measuring the distance of each pixel relative to the camera. Depth is extracted from either monocular (single) or stereo (multiple views of a scene) images. Traditional methods use multi-view geometry to find the relationship between the images. Newer methods can directly estimate depth by minimizing the regression loss, or by learning to generate a novel view from a sequence. The most popular benchmarks are KITTI and NYUv2. Models are typically evaluated according to a RMS metric.

Source: DIODE: A Dense Indoor and Outdoor DEpth Dataset

Papers

Showing 101150 of 2454 papers

TitleStatusHype
Endo3R: Unified Online Reconstruction from Dynamic Monocular Endoscopic Video0
FreSca: Unveiling the Scaling Space in Diffusion Models0
GaussianLSS -- Toward Real-world BEV Perception: Depth Uncertainty Estimation via Gaussian Splatting0
A novel gesture interaction control method for rehabilitation lower extremity exoskeleton0
DEPTHOR: Depth Enhancement from a Practical Light-Weight dToF Sensor and RGB ImageCode1
Monocular and Generalizable Gaussian Talking Head Animation0
GeometryCrafter: Consistent Geometry Estimation for Open-world Videos with Diffusion Priors0
Detail-aware multi-view stereo network for depth estimationCode0
ExScene: Free-View 3D Scene Reconstruction with Gaussian Splatting from a Single Image0
Boosting Omnidirectional Stereo Matching with a Pre-trained Depth Foundation ModelCode1
Blurry-Edges: Photon-Limited Depth Estimation from Defocused Boundaries0
Deep Depth Estimation from Thermal Image: Dataset, Benchmark, and ChallengesCode1
One Look is Enough: A Novel Seamless Patchwise Refinement for Zero-Shot Monocular Depth Estimation Models on High-Resolution Images0
Intrinsic Image Decomposition for Robust Self-supervised Monocular Depth Estimation on Reflective Surfaces0
EndoLRMGS: Complete Endoscopic Scene Reconstruction combining Large Reconstruction Modelling and Gaussian SplattingCode0
MVSAnywhere: Zero-Shot Multi-View Stereo0
ICG-MVSNet: Learning Intra-view and Cross-view Relationships for Guidance in Multi-View Stereo0
A Unified Image-Dense Annotation Generation Model for Underwater ScenesCode2
Uni4D: Unifying Visual Foundation Models for 4D Modeling from a Single VideoCode3
Omnidirectional Depth-Aided Occupancy Prediction based on Cylindrical Voxel for Autonomous Driving0
Synthetic-to-Real Self-supervised Robust Depth Estimation via Learning with Motion and Structure PriorsCode1
Tracktention: Leveraging Point Tracking to Attend Videos Faster and Better0
FUSE: Label-Free Image-Event Joint Monocular Depth Estimation via Frequency-Decoupled Alignment and Degradation-Robust FusionCode0
Semi-SD: Semi-Supervised Metric Depth Estimation via Surrounding Cameras for Autonomous DrivingCode0
StableGS: A Floater-Free Framework for 3D Gaussian Splatting0
PDDM: Pseudo Depth Diffusion Model for RGB-PD Semantic Segmentation Based in Complex Indoor Scenes0
Co-SemDepth: Fast Joint Semantic Segmentation and Depth Estimation on Aerial ImagesCode0
Pow3R: Empowering Unconstrained 3D Reconstruction with Camera and Scene Priors0
Radar-Guided Polynomial Fitting for Metric Depth Estimation0
GAA-TSO: Geometry-Aware Assisted Depth Completion for Transparent and Specular Objects0
QuartDepth: Post-Training Quantization for Real-Time Depth Estimation on the EdgeCode1
Jasmine: Harnessing Diffusion Prior for Self-supervised Depth Estimation0
Dynamic Point Maps: A Versatile Representation for Dynamic 3D Reconstruction0
Learning to Efficiently Adapt Foundation Models for Self-Supervised Endoscopic 3D Scene Reconstruction from Any Cameras0
TULIP: Towards Unified Language-Image Pretraining0
USAM-Net: A U-Net-based Network for Improved Stereo Correspondence and Scene Depth Estimation using Features from a Pre-trained Image Segmentation network0
EgoDTM: Towards 3D-Aware Egocentric Video-Language PretrainingCode0
3D Densification for Multi-Map Monocular VSLAM in Endoscopy0
DUNE: Distilling a Universal Encoder from Heterogeneous 2D and 3D Teachers0
Multi-view Reconstruction via SfM-guided Monocular Depth Estimation0
Improving Geometric Consistency for 360-Degree Neural Radiance Fields in Indoor Scenarios0
MonoCT: Overcoming Monocular 3D Detection Domain Shift with Consistent Teacher Models0
Seeing and Seeing Through the Glass: Real and Synthetic Data for Multi-Layer Depth Estimation0
VGGT: Visual Geometry Grounded TransformerCode11
Simulating Dual-Pixel Images From Ray Tracing For Depth EstimationCode1
Flow-NeRF: Joint Learning of Geometry, Poses, and Dense Flow within Unified Neural Representations0
WonderVerse: Extendable 3D Scene Generation with Video Generative Models0
GarmentCrafter: Progressive Novel View Synthesis for Single-View 3D Garment Reconstruction and Editing0
Endo-FASt3r: Endoscopic Foundation model Adaptation for Structure from motion0
LBM: Latent Bridge Matching for Fast Image-to-Image TranslationCode4
Show:102550
← PrevPage 3 of 50Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1OmniDepthRMSE0.62Unverified
2SphereDepthRMSE0.45Unverified
3Jin et al.RMSE0.42Unverified
4BiFuse with fusionRMSE0.41Unverified
5HoHoNet (ResNet-101)RMSE0.38Unverified
6PanoDepthRMSE0.37Unverified
7BiFuse++RMSE0.37Unverified
8UniFuse with fusionRMSE0.37Unverified
9DisConvRMSE0.37Unverified
10SliceNetRMSE0.37Unverified
#ModelMetricClaimedVerifiedStatus
1A2JmAP8.61Unverified
2PAD-NetRMS0.79Unverified
3MS-CRFRMS0.59Unverified
4DORNRMS0.51Unverified
5FreeformRMS0.43Unverified
6Optimized, freeformRMS0.43Unverified
7VNLRMS0.42Unverified
8BTSRMS0.41Unverified
9TransDepth (AGD+ ViT)RMS0.37Unverified
10AdaBinsRMS0.36Unverified
#ModelMetricClaimedVerifiedStatus
1T2NetAbs Rel0.35Unverified
2MIDASAbs Rel0.31Unverified
3Bhattacharjee et al.Abs Rel0.25Unverified
#ModelMetricClaimedVerifiedStatus
1T2NetAbs Rel0.49Unverified
2MIDASAbs Rel0.42Unverified
3Bhattacharjee et al.Abs Rel0.38Unverified
#ModelMetricClaimedVerifiedStatus
1LeReSabsolute relative error0.1Unverified
2DELTASabsolute relative error0.09Unverified
3Distill Any Depthabsolute relative error0.04Unverified
#ModelMetricClaimedVerifiedStatus
1SDC-DepthRMSE6.92Unverified
2SwinMTLRMSE6.35Unverified
#ModelMetricClaimedVerifiedStatus
1AIP-BrownDelta < 1.250.36Unverified
2LeResDelta < 1.250.23Unverified
#ModelMetricClaimedVerifiedStatus
1H-Net (Ours)Absolute relative error (AbsRel)0.09Unverified
2H-Net (Ours) Full EigenAbsolute relative error (AbsRel)0.08Unverified
#ModelMetricClaimedVerifiedStatus
1GLPDepthDelta < 1.250.43Unverified
2SRDINET (Model A)Delta < 1.250.4Unverified
#ModelMetricClaimedVerifiedStatus
1Atlas (finetuned)RMSE0.17Unverified
2Atlas (plain)RMSE0.17Unverified
#ModelMetricClaimedVerifiedStatus
1LFattNetBadPix(0.01)17.23Unverified
#ModelMetricClaimedVerifiedStatus
1LightDepthNumber of parameters (M)42.6Unverified
#ModelMetricClaimedVerifiedStatus
1UniFuseAbs Rel0.11Unverified
#ModelMetricClaimedVerifiedStatus
1X-TC (Cross-Task Consistency)L1 error1.63Unverified