SOTAVerified

Monocular Depth Estimation

Monocular Depth Estimation is the task of estimating the depth value (distance relative to the camera) of each pixel given a single (monocular) RGB image. This challenging task is a key prerequisite for determining scene understanding for applications such as 3D scene reconstruction, autonomous driving, and AR. State-of-the-art methods usually fall into one of two categories: designing a complex network that is powerful enough to directly regress the depth map, or splitting the input into bins or windows to reduce computational complexity. The most popular benchmarks are the KITTI and NYUv2 datasets. Models are typically evaluated using RMSE or absolute relative error.

Source: Defocus Deblurring Using Dual-Pixel Data

Papers

Showing 101150 of 876 papers

TitleStatusHype
Instance-wise Depth and Motion Learning from Monocular VideosCode1
Improving Semi-Supervised and Domain-Adaptive Semantic Segmentation with Self-Supervised Depth EstimationCode1
Lifelong-MonoDepth: Lifelong Learning for Multi-Domain Monocular Metric Depth EstimationCode1
EndoDepth: A Benchmark for Assessing Robustness in Endoscopic Depth PredictionCode1
Automated Distance Estimation for Wildlife Camera TrappingCode1
EndoMUST: Monocular Depth Estimation for Robotic Endoscopy via End-to-end Multi-step Self-supervised TrainingCode1
Disentangling Object Motion and Occlusion for Unsupervised Multi-frame Monocular DepthCode1
Can Language Understand Depth?Code1
ENRICH: Multi-purposE dataset for beNchmaRking In Computer vision and pHotogrammetryCode1
EVP: Enhanced Visual Perception using Inverse Multi-Attentive Feature Refinement and Regularized Image-Text AlignmentCode1
Image Masking for Robust Self-Supervised Monocular Depth EstimationCode1
Channel-Wise Attention-Based Network for Self-Supervised Monocular Depth EstimationCode1
Chitransformer: Towards Reliable Stereo From CuesCode1
A Practical Stereo Depth System for Smart GlassesCode1
Boosting Light-Weight Depth Estimation Via Knowledge DistillationCode1
Latent Discriminant deterministic UncertaintyCode1
Fast Neural Architecture Search of Compact Semantic Segmentation Models via Auxiliary CellsCode1
CoDEPS: Online Continual Learning for Depth Estimation and Panoptic SegmentationCode1
A Study on Self-Supervised Pretraining for Vision Problems in Gastrointestinal EndoscopyCode1
Combining Events and Frames using Recurrent Asynchronous Multimodal Networks for Monocular Depth PredictionCode1
A Study on the Generality of Neural Network Structures for Monocular Depth EstimationCode1
Feature-metric Loss for Self-supervised Learning of Depth and EgomotionCode1
3D-PL: Domain Adaptive Depth Estimation with 3D-aware Pseudo-LabelingCode1
Digging Into Uncertainty-based Pseudo-label for Robust Stereo MatchingCode1
BodySLAM: A Generalized Monocular Visual SLAM Framework for Surgical ApplicationsCode1
Adaptive confidence thresholding for monocular depth estimationCode1
DiPE: Deeper into Photometric Errors for Unsupervised Learning of Depth and Ego-motion from Monocular VideosCode1
IEBins: Iterative Elastic Bins for Monocular Depth EstimationCode1
Implicit Integration of Superpixel Segmentation into Fully Convolutional NetworksCode1
Always Clear Depth: Robust Monocular Depth Estimation under Adverse WeatherCode1
BiFuse++: Self-supervised and Efficient Bi-projection Fusion for 360 Depth EstimationCode1
Bidirectional Attention Network for Monocular Depth EstimationCode1
Detaching and Boosting: Dual Engine for Scale-Invariant Self-Supervised Monocular Depth EstimationCode1
altiro3D: Scene representation from single image and novel view synthesisCode1
Detecting Invisible PeopleCode1
All in Tokens: Unifying Output Space of Visual Tasks via Soft TokenCode1
BaseBoostDepth: Exploiting Larger Baselines For Self-supervised Monocular Depth EstimationCode1
AdaBins: Depth Estimation using Adaptive BinsCode1
Digging Into Self-Supervised Monocular Depth EstimationCode1
Distilled Semantics for Comprehensive Scene Understanding from VideosCode1
Improving 360 Monocular Depth Estimation via Non-local Dense Prediction Transformer and Joint Supervised and Self-supervised LearningCode1
DEPTHOR: Depth Enhancement from a Practical Light-Weight dToF Sensor and RGB ImageCode1
HSPFormer: Hierarchical Spatial Perception Transformer for Semantic SegmentationCode1
BadPart: Unified Black-box Adversarial Patch Attacks against Pixel-wise Regression TasksCode1
Depth Map Decomposition for Monocular Depth EstimationCode1
Depth Map Prediction from a Single Image using a Multi-Scale Deep NetworkCode1
Deeper Depth Prediction with Fully Convolutional Residual NetworksCode1
High Quality Monocular Depth Estimation via Transfer LearningCode1
Depthformer : Multiscale Vision Transformer For Monocular Depth Estimation With Local Global Information FusionCode1
Harnessing Diffusion Models for Visual Perception with Meta PromptsCode1
Show:102550
← PrevPage 3 of 18Next →

No leaderboard results yet.