SOTAVerified

Monocular Depth Estimation

Monocular Depth Estimation is the task of estimating the depth value (distance relative to the camera) of each pixel given a single (monocular) RGB image. This challenging task is a key prerequisite for determining scene understanding for applications such as 3D scene reconstruction, autonomous driving, and AR. State-of-the-art methods usually fall into one of two categories: designing a complex network that is powerful enough to directly regress the depth map, or splitting the input into bins or windows to reduce computational complexity. The most popular benchmarks are the KITTI and NYUv2 datasets. Models are typically evaluated using RMSE or absolute relative error.

Source: Defocus Deblurring Using Dual-Pixel Data

Papers

Showing 226250 of 876 papers

TitleStatusHype
SSAP: A Shape-Sensitive Adversarial Patch for Comprehensive Disruption of Monocular Depth Estimation in Autonomous Navigation Applications0
SwinMTL: A Shared Architecture for Simultaneous Depth Estimation and Semantic Segmentation from Monocular Camera ImagesCode1
Touch-GS: Visual-Tactile Supervised 3D Gaussian Splatting0
METER: a mobile vision transformer architecture for monocular depth estimationCode0
WaveShot: A Compact Portable Unmanned Surface Vessel for Dynamic Water Surface Videography and Media Production0
Adaptive Fusion of Single-View and Multi-View Depth for Autonomous DrivingCode2
D4D: An RGBD diffusion model to boost monocular depth estimationCode0
What Matters When Repurposing Diffusion Models for General Dense Perception Tasks?Code3
Stealing Stable Diffusion Prior for Robust Monocular Depth EstimationCode1
Scalable Vision-Based 3D Object Detection and Monocular Depth Estimation for Autonomous DrivingCode1
DD-VNB: A Depth-based Dual-Loop Framework for Real-time Visually Navigated Bronchoscopy0
Pyramid Feature Attention Network for Monocular Depth Prediction0
Kick Back & Relax++: Scaling Beyond Ground-Truth Depth with SlowTV & CribsTVCode2
PCDepth: Pattern-based Complementary Learning for Monocular Depth Estimation by Best of Both Worlds0
TIE-KD: Teacher-Independent and Explainable Knowledge Distillation for Monocular Depth EstimationCode0
Zero-BEV: Zero-shot Projection of Any First-Person Modality to BEV Maps0
An Endoscopic Chisel: Intraoperative Imaging Carves 3D Anatomical Models0
Unveiling the Depths: A Multi-Modal Fusion Framework for Challenging Scenarios0
MAL: Motion-Aware Loss with Temporal and Distillation Hints for Self-Supervised Depth Estimation0
Efficient Multi-task Uncertainties for Joint Semantic Segmentation and Monocular Depth Estimation0
MoD-SLAM: Monocular Dense Mapping for Unbounded 3D Scene Reconstruction0
CLIP Can Understand Depth0
Diffusion-based Light Field Synthesis0
Depth Anything in Medical Images: A Comparative Study0
Endo-4DGS: Endoscopic Monocular Scene Reconstruction with 4D Gaussian SplattingCode7
Show:102550
← PrevPage 10 of 36Next →

No leaderboard results yet.