SOTAVerified

Monocular Depth Estimation

Monocular Depth Estimation is the task of estimating the depth value (distance relative to the camera) of each pixel given a single (monocular) RGB image. This challenging task is a key prerequisite for determining scene understanding for applications such as 3D scene reconstruction, autonomous driving, and AR. State-of-the-art methods usually fall into one of two categories: designing a complex network that is powerful enough to directly regress the depth map, or splitting the input into bins or windows to reduce computational complexity. The most popular benchmarks are the KITTI and NYUv2 datasets. Models are typically evaluated using RMSE or absolute relative error.

Source: Defocus Deblurring Using Dual-Pixel Data

Papers

Showing 141150 of 876 papers

TitleStatusHype
DepthCrafter: Generating Consistent Long Depth Sequences for Open-world VideosCode5
Large Language Models Can Understanding Depth from Monocular Images0
DARES: Depth Anything in Robotic Endoscopic Surgery with Self-supervised Vector-LoRA of the Foundation ModelCode1
EvLight++: Low-Light Video Enhancement with an Event Camera: A Large-Scale Real-World Dataset, Novel Method, and More0
Adversarial Manhole: Challenging Monocular Depth Estimation and Semantic Segmentation Models with Patch AttackCode0
NimbleD: Enhancing Self-supervised Monocular Depth Estimation with Pseudo-labels and Large-scale Video Pre-trainingCode0
TranSplat: Generalizable 3D Gaussian Splatting from Sparse Multi-View Images with Transformers0
InSpaceType: Dataset and Benchmark for Reconsidering Cross-Space Type Performance in Indoor Monocular DepthCode1
Structure-preserving Image Translation for Depth Estimation in Colonoscopy VideoCode1
Enhanced Scale-aware Depth Estimation for Monocular Endoscopic Scenes with Geometric Modeling0
Show:102550
← PrevPage 15 of 88Next →

No leaderboard results yet.