SOTAVerified

Monocular Depth Estimation

Monocular Depth Estimation is the task of estimating the depth value (distance relative to the camera) of each pixel given a single (monocular) RGB image. This challenging task is a key prerequisite for determining scene understanding for applications such as 3D scene reconstruction, autonomous driving, and AR. State-of-the-art methods usually fall into one of two categories: designing a complex network that is powerful enough to directly regress the depth map, or splitting the input into bins or windows to reduce computational complexity. The most popular benchmarks are the KITTI and NYUv2 datasets. Models are typically evaluated using RMSE or absolute relative error.

Source: Defocus Deblurring Using Dual-Pixel Data

Papers

Showing 1120 of 876 papers

TitleStatusHype
EndoMUST: Monocular Depth Estimation for Robotic Endoscopy via End-to-end Multi-step Self-supervised TrainingCode1
TR2M: Transferring Monocular Relative Depth to Metric Depth with Language Descriptions and Scale-Oriented ContrastCode1
Flow-Anything: Learning Real-World Optical Flow Estimation from Large-Scale Single-view Images0
EgoM2P: Egocentric Multimodal Multitask Pretraining0
Jamais Vu: Exposing the Generalization Gap in Supervised Semantic Correspondence0
Structure-Aware Radar-Camera Depth Estimation0
Toward Better SSIM Loss for Unsupervised Monocular Depth Estimation0
Bridging Geometric and Semantic Foundation Models for Generalized Monocular Depth Estimation0
Spatial RoboGrasp: Generalized Robotic Grasping Control Policy0
From Single Images to Motion Policies via Video-Generation Environment Representations0
Show:102550
← PrevPage 2 of 88Next →

No leaderboard results yet.