SOTAVerified

Monocular Depth Estimation

Monocular Depth Estimation is the task of estimating the depth value (distance relative to the camera) of each pixel given a single (monocular) RGB image. This challenging task is a key prerequisite for determining scene understanding for applications such as 3D scene reconstruction, autonomous driving, and AR. State-of-the-art methods usually fall into one of two categories: designing a complex network that is powerful enough to directly regress the depth map, or splitting the input into bins or windows to reduce computational complexity. The most popular benchmarks are the KITTI and NYUv2 datasets. Models are typically evaluated using RMSE or absolute relative error.

Source: Defocus Deblurring Using Dual-Pixel Data

Papers

Showing 3140 of 876 papers

TitleStatusHype
DepthMaster: Taming Diffusion Models for Monocular Depth EstimationCode2
OSMLoc: Single Image-Based Visual Localization in OpenStreetMap with Fused Geometric and Semantic GuidanceCode2
ImOV3D: Learning Open-Vocabulary Point Clouds 3D Object Detection from Only 2D ImagesCode2
Refinement of Monocular Depth Maps via Multi-View Differentiable RenderingCode2
PrimeDepth: Efficient Monocular Depth Estimation with a Stable Diffusion PreimageCode2
Plane2Depth: Hierarchical Adaptive Plane Guidance for Monocular Depth EstimationCode2
HybridDepth: Robust Metric Depth Fusion by Leveraging Depth from Focus and Single-Image PriorsCode2
Diffusion Models for Monocular Depth Estimation: Overcoming Challenging ConditionsCode2
Mono-ViFI: A Unified Learning Framework for Self-supervised Single- and Multi-frame Monocular Depth EstimationCode2
IDOL: Unified Dual-Modal Latent Diffusion for Human-Centric Joint Video-Depth GenerationCode2
Show:102550
← PrevPage 4 of 88Next →

No leaderboard results yet.