SOTAVerified

Depth Estimation

Depth Estimation is the task of measuring the distance of each pixel relative to the camera. Depth is extracted from either monocular (single) or stereo (multiple views of a scene) images. Traditional methods use multi-view geometry to find the relationship between the images. Newer methods can directly estimate depth by minimizing the regression loss, or by learning to generate a novel view from a sequence. The most popular benchmarks are KITTI and NYUv2. Models are typically evaluated according to a RMS metric.

Source: DIODE: A Dense Indoor and Outdoor DEpth Dataset

Papers

Showing 301350 of 2454 papers

TitleStatusHype
SharpDepth: Sharpening Metric Depth Predictions Using Diffusion Distillation0
GAPartManip: A Large-scale Part-centric Dataset for Material-Agnostic Articulated Object ManipulationCode5
Helvipad: A Real-World Dataset for Omnidirectional Stereo Depth EstimationCode1
Low-rank Adaptation-based All-Weather Removal for Autonomous Navigation0
DepthCues: Evaluating Monocular Depth Perception in Large Vision Models0
Boost 3D Reconstruction using Diffusion-based Monocular Camera CalibrationCode2
Spatially Visual Perception for End-to-End Robotic Learning0
DROID-Splat: Combining end-to-end SLAM with 3D Gaussian SplattingCode3
Generative Omnimatte: Learning to Decompose Video into Layers0
One Diffusion to Generate Them AllCode4
PriorDiffusion: Leverage Language Prior in Diffusion Models for Monocular Depth Estimation0
Gaussian Scenes: Pose-Free Sparse-View Scene Reconstruction using Depth-Enhanced Diffusion Priors0
OceanLens: An Adaptive Backscatter and Edge Correction using Deep Learning Model for Enhanced Underwater ImagingCode1
DATAP-SfM: Dynamic-Aware Tracking Any Point for Robust Structure from Motion in the Wild0
Scalable Autoregressive Monocular Depth Estimation0
GPS-Gaussian+: Generalizable Pixel-wise 3D Gaussian Splatting for Real-Time Human-Scene Rendering from Sparse Views0
The ADUULM-360 Dataset -- A Multi-Modal Dataset for Depth Estimation in Adverse WeatherCode0
MGNiceNet: Unified Monocular Geometric Scene UnderstandingCode0
MetricGold: Leveraging Text-To-Image Latent Diffusion Models for Metric Depth EstimationCode0
SPARS3R: Semantic Prior Alignment and Regularization for Sparse 3D ReconstructionCode2
Efficient Depth Estimation for Unstable Stereo Camera Systems on AR GlassesCode1
Mono2Stereo: Monocular Knowledge Transfer for Enhanced Stereo Matching0
RenderBender: A Survey on Adversarial Attacks Using Differentiable Rendering0
Architect: Generating Vivid and Interactive 3D Scenes with Hierarchical 2D Inpainting0
OSMLoc: Single Image-Based Visual Localization in OpenStreetMap with Fused Geometric and Semantic GuidanceCode2
Scaling Properties of Diffusion Models for Perceptual Tasks0
SE(3) Equivariant Ray Embeddings for Implicit Multi-View Depth Estimation0
SimpleBEV: Improved LiDAR-Camera Fusion Architecture for 3D Object Detection0
Enhancing Bronchoscopy Depth Estimation through Synthetic-to-Real Domain Adaptation0
D^3epth: Self-Supervised Depth Estimation with Dynamic Mask in Dynamic ScenesCode0
Revisiting Disparity from Dual-Pixel Images: Physics-Informed Lightweight Depth Estimation0
Adaptive Stereo Depth Estimation with Multi-Spectral Images Across All Lighting Conditions0
Correlation of Object Detection Performance with Visual Saliency and Depth EstimationCode0
PMPNet: Pixel Movement Prediction Network for Monocular Depth Estimation in Dynamic Scenes0
FewViewGS: Gaussian Splatting with Few View Matching and Multi-stage Training0
Improving Domain Generalization in Self-supervised Monocular Depth Estimation via Stabilized Adversarial Training0
MultiDepth: Multi-Sample Priors for Refining Monocular Metric Depth Estimations in Indoor Scenes0
On Deep Learning for Geometric and Semantic Scene Understanding Using On-Vehicle 3D LiDARCode2
Optical Lens Attack on Monocular Depth Estimation for Autonomous Driving0
ImOV3D: Learning Open-Vocabulary Point Clouds 3D Object Detection from Only 2D ImagesCode2
Nested ResNet: A Vision-Based Method for Detecting the Sensing Area of a Drop-in Gamma Probe0
PF3plat: Pose-Free Feed-Forward 3D Gaussian SplattingCode3
Active Event Alignment for Monocular Distance Estimation0
MVSDet: Multi-View Indoor 3D Object Detection via Efficient Plane SweepsCode1
Depth Attention for Robust RGB TrackingCode1
Unlocking Comics: The AI4VA Dataset for Visual UnderstandingCode1
MonoDGP: Monocular 3D Object Detection with Decoupled-Query and Geometry-Error PriorsCode2
Segmentation-aware Prior Assisted Joint Global Information Aggregated 3D Building Reconstruction0
Thermal Chameleon: Task-Adaptive Tone-mapping for Radiometric Thermal-Infrared imagesCode1
Retrieving snow depth distribution by downscaling ERA5 Reanalysis with ICESat-2 laser altimetry0
Show:102550
← PrevPage 7 of 50Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1OmniDepthRMSE0.62Unverified
2SphereDepthRMSE0.45Unverified
3Jin et al.RMSE0.42Unverified
4BiFuse with fusionRMSE0.41Unverified
5HoHoNet (ResNet-101)RMSE0.38Unverified
6PanoDepthRMSE0.37Unverified
7BiFuse++RMSE0.37Unverified
8UniFuse with fusionRMSE0.37Unverified
9DisConvRMSE0.37Unverified
10SliceNetRMSE0.37Unverified
#ModelMetricClaimedVerifiedStatus
1A2JmAP8.61Unverified
2PAD-NetRMS0.79Unverified
3MS-CRFRMS0.59Unverified
4DORNRMS0.51Unverified
5FreeformRMS0.43Unverified
6Optimized, freeformRMS0.43Unverified
7VNLRMS0.42Unverified
8BTSRMS0.41Unverified
9TransDepth (AGD+ ViT)RMS0.37Unverified
10AdaBinsRMS0.36Unverified
#ModelMetricClaimedVerifiedStatus
1T2NetAbs Rel0.35Unverified
2MIDASAbs Rel0.31Unverified
3Bhattacharjee et al.Abs Rel0.25Unverified
#ModelMetricClaimedVerifiedStatus
1T2NetAbs Rel0.49Unverified
2MIDASAbs Rel0.42Unverified
3Bhattacharjee et al.Abs Rel0.38Unverified
#ModelMetricClaimedVerifiedStatus
1LeReSabsolute relative error0.1Unverified
2DELTASabsolute relative error0.09Unverified
3Distill Any Depthabsolute relative error0.04Unverified
#ModelMetricClaimedVerifiedStatus
1SDC-DepthRMSE6.92Unverified
2SwinMTLRMSE6.35Unverified
#ModelMetricClaimedVerifiedStatus
1AIP-BrownDelta < 1.250.36Unverified
2LeResDelta < 1.250.23Unverified
#ModelMetricClaimedVerifiedStatus
1H-Net (Ours)Absolute relative error (AbsRel)0.09Unverified
2H-Net (Ours) Full EigenAbsolute relative error (AbsRel)0.08Unverified
#ModelMetricClaimedVerifiedStatus
1GLPDepthDelta < 1.250.43Unverified
2SRDINET (Model A)Delta < 1.250.4Unverified
#ModelMetricClaimedVerifiedStatus
1Atlas (finetuned)RMSE0.17Unverified
2Atlas (plain)RMSE0.17Unverified
#ModelMetricClaimedVerifiedStatus
1LFattNetBadPix(0.01)17.23Unverified
#ModelMetricClaimedVerifiedStatus
1LightDepthNumber of parameters (M)42.6Unverified
#ModelMetricClaimedVerifiedStatus
1UniFuseAbs Rel0.11Unverified
#ModelMetricClaimedVerifiedStatus
1X-TC (Cross-Task Consistency)L1 error1.63Unverified