SOTAVerified

Monocular Depth Estimation

Monocular Depth Estimation is the task of estimating the depth value (distance relative to the camera) of each pixel given a single (monocular) RGB image. This challenging task is a key prerequisite for determining scene understanding for applications such as 3D scene reconstruction, autonomous driving, and AR. State-of-the-art methods usually fall into one of two categories: designing a complex network that is powerful enough to directly regress the depth map, or splitting the input into bins or windows to reduce computational complexity. The most popular benchmarks are the KITTI and NYUv2 datasets. Models are typically evaluated using RMSE or absolute relative error.

Source: Defocus Deblurring Using Dual-Pixel Data

Papers

Showing 5175 of 876 papers

TitleStatusHype
NeW CRFs: Neural Window Fully-connected CRFs for Monocular Depth EstimationCode2
Plane2Depth: Hierarchical Adaptive Plane Guidance for Monocular Depth EstimationCode2
SelfOcc: Self-Supervised Vision-Based 3D Occupancy PredictionCode2
Towards Zero-Shot Scale-Aware Monocular Depth EstimationCode2
Learning to Recover 3D Scene Shape from a Single ImageCode2
DDP: Diffusion Model for Dense Visual PredictionCode2
Lite-Mono: A Lightweight CNN and Transformer Architecture for Self-Supervised Monocular Depth EstimationCode2
Unleashing Text-to-Image Diffusion Models for Visual PerceptionCode2
Joint 2D-3D Multi-Task Learning on Cityscapes-3D: 3D Detection, Segmentation, and Depth EstimationCode2
Boosting Monocular Depth Estimation Models to High-Resolution via Content-Adaptive Multi-Resolution MergingCode2
Kick Back & Relax++: Scaling Beyond Ground-Truth Depth with SlowTV & CribsTVCode2
A Survey on RGB-D DatasetsCode2
HybridDepth: Robust Metric Depth Fusion by Leveraging Depth from Focus and Single-Image PriorsCode2
Enforcing geometric constraints of virtual normal for depth predictionCode2
IDOL: Unified Dual-Modal Latent Diffusion for Human-Centric Joint Video-Depth GenerationCode2
DurLAR: A High-fidelity 128-channel LiDAR Dataset with Panoramic Ambient and Reflectivity Imagery for Multi-modal Autonomous Driving ApplicationsCode2
DepthMaster: Taming Diffusion Models for Monocular Depth EstimationCode2
BinsFormer: Revisiting Adaptive Bins for Monocular Depth EstimationCode2
ImOV3D: Learning Open-Vocabulary Point Clouds 3D Object Detection from Only 2D ImagesCode2
Deeper into Self-Supervised Monocular Indoor Depth EstimationCode1
Detecting Invisible PeopleCode1
A Study on the Generality of Neural Network Structures for Monocular Depth EstimationCode1
A Study on Self-Supervised Pretraining for Vision Problems in Gastrointestinal EndoscopyCode1
Absolute distance prediction based on deep learning object detection and monocular depth estimation modelsCode1
Deconstructing Self-Supervised Monocular Reconstruction: The Design Decisions that MatterCode1
Show:102550
← PrevPage 3 of 36Next →

No leaderboard results yet.