SOTAVerified

Monocular Depth Estimation

Monocular Depth Estimation is the task of estimating the depth value (distance relative to the camera) of each pixel given a single (monocular) RGB image. This challenging task is a key prerequisite for determining scene understanding for applications such as 3D scene reconstruction, autonomous driving, and AR. State-of-the-art methods usually fall into one of two categories: designing a complex network that is powerful enough to directly regress the depth map, or splitting the input into bins or windows to reduce computational complexity. The most popular benchmarks are the KITTI and NYUv2 datasets. Models are typically evaluated using RMSE or absolute relative error.

Source: Defocus Deblurring Using Dual-Pixel Data

Papers

Showing 2130 of 876 papers

TitleStatusHype
Relative Pose Estimation through Affine Corrections of Monocular Depth PriorsCode3
PF3plat: Pose-Free Feed-Forward 3D Gaussian SplattingCode3
Flash3D: Feed-Forward Generalisable 3D Scene Reconstruction from a Single ImageCode3
GenWarp: Single Image to Novel Views with Semantic-Preserving Generative WarpingCode3
RoadBEV: Road Surface Reconstruction in Bird's Eye ViewCode3
ECoDepth: Effective Conditioning of Diffusion Models for Monocular Depth EstimationCode3
What Matters When Repurposing Diffusion Models for General Dense Perception Tasks?Code3
iDisc: Internal Discretization for Monocular Depth EstimationCode3
Vision Transformers for Dense PredictionCode3
Towards Robust Monocular Depth Estimation: Mixing Datasets for Zero-shot Cross-dataset TransferCode3
Show:102550
← PrevPage 3 of 88Next →

No leaderboard results yet.