SOTAVerified

Monocular Depth Estimation

Monocular Depth Estimation is the task of estimating the depth value (distance relative to the camera) of each pixel given a single (monocular) RGB image. This challenging task is a key prerequisite for determining scene understanding for applications such as 3D scene reconstruction, autonomous driving, and AR. State-of-the-art methods usually fall into one of two categories: designing a complex network that is powerful enough to directly regress the depth map, or splitting the input into bins or windows to reduce computational complexity. The most popular benchmarks are the KITTI and NYUv2 datasets. Models are typically evaluated using RMSE or absolute relative error.

Source: Defocus Deblurring Using Dual-Pixel Data

Papers

Showing 1120 of 876 papers

TitleStatusHype
ZoeDepth: Zero-shot Transfer by Combining Relative and Metric DepthCode5
UniK3D: Universal Camera Monocular 3D EstimationCode4
Distill Any Depth: Distillation Creates a Stronger Monocular Depth EstimatorCode4
MonSter: Marry Monodepth to Stereo Unleashes PowerCode4
Fine-Tuning Image-Conditional Diffusion Models is Easier than You ThinkCode4
Self-Supervised Geometry-Guided Initialization for Robust Monocular Visual OdometryCode4
DepthFM: Fast Monocular Depth Estimation with Flow MatchingCode4
Repurposing Diffusion-Based Image Generators for Monocular Depth EstimationCode4
Metric3D: Towards Zero-shot Metric 3D Prediction from A Single ImageCode4
LiftFeat: 3D Geometry-Aware Local Feature MatchingCode3
Show:102550
← PrevPage 2 of 88Next →

No leaderboard results yet.