SOTAVerified

Monocular Depth Estimation

Monocular Depth Estimation is the task of estimating the depth value (distance relative to the camera) of each pixel given a single (monocular) RGB image. This challenging task is a key prerequisite for determining scene understanding for applications such as 3D scene reconstruction, autonomous driving, and AR. State-of-the-art methods usually fall into one of two categories: designing a complex network that is powerful enough to directly regress the depth map, or splitting the input into bins or windows to reduce computational complexity. The most popular benchmarks are the KITTI and NYUv2 datasets. Models are typically evaluated using RMSE or absolute relative error.

Source: Defocus Deblurring Using Dual-Pixel Data

Papers

Showing 151200 of 876 papers

TitleStatusHype
RM-Depth: Unsupervised Learning of Recurrent Monocular Depth in Dynamic ScenesCode1
URCDC-Depth: Uncertainty Rectified Cross-Distillation with CutFlip for Monocular Depth EstimationCode1
VA-DepthNet: A Variational Approach to Single Image Depth PredictionCode1
Adversarial Training of Self-supervised Monocular Depth Estimation against Physical-World AttacksCode1
Improving Deep Regression with Ordinal EntropyCode1
SwinDepth: Unsupervised Depth Estimation using Monocular Sequences via Swin Transformer and Densely Cascaded NetworkCode1
A Study on the Generality of Neural Network Structures for Monocular Depth EstimationCode1
All in Tokens: Unifying Output Space of Visual Tasks via Soft TokenCode1
Trap Attention: Monocular Depth Estimation With Manual TrapsCode1
LightedDepth: Video Depth Estimation in Light of Limited Inference View AnglesCode1
MaskingDepth: Masked Consistency Regularization for Semi-supervised Monocular Depth EstimationCode1
Mind The Edge: Refining Depth Edges in Sparsely-Supervised Monocular Depth EstimationCode1
Multi-resolution Monocular Depth Map Fusion by Self-supervised Gradient-based CompositionCode1
Self-Supervised Surround-View Depth Estimation with Volumetric Feature FusionCode1
3DPPE: 3D Point Positional Encoding for Multi-Camera 3D Object Detection TransformersCode1
The Monocular Depth Estimation ChallengeCode1
A Practical Stereo Depth System for Smart GlassesCode1
LightDepth: A Resource Efficient Depth Estimation Approach for Dealing with Ground Truth Sparsity via Curriculum LearningCode1
RCDPT: Radar-Camera fusion Dense Prediction TransformerCode1
Attention Attention Everywhere: Monocular Depth Prediction with Skip AttentionCode1
Frequency-Aware Self-Supervised Monocular Depth EstimationCode1
Detaching and Boosting: Dual Engine for Scale-Invariant Self-Supervised Monocular Depth EstimationCode1
IronDepth: Iterative Refinement of Single-View Depth using Surface Normal and its UncertaintyCode1
Image Masking for Robust Self-Supervised Monocular Depth EstimationCode1
FreDSNet: Joint Monocular Depth and Semantic Segmentation with Fast Fourier ConvolutionsCode1
PlaneDepth: Self-supervised Depth Estimation via Orthogonal PlanesCode1
Self-Supervised Monocular Depth Estimation: Solving the Edge-Fattening ProblemCode1
UDepth: Fast Monocular Depth Estimation for Visually-guided Underwater RobotsCode1
3D-PL: Domain Adaptive Depth Estimation with 3D-aware Pseudo-LabelingCode1
Self-distilled Feature Aggregation for Self-supervised Monocular Depth EstimationCode1
BiFuse++: Self-supervised and Efficient Bi-projection Fusion for 360 Depth EstimationCode1
LiteDepth: Digging into Fast and Accurate Depth Estimation on Mobile DevicesCode1
Depth Map Decomposition for Monocular Depth EstimationCode1
MonoViT: Self-Supervised Monocular Depth Estimation with a Vision TransformerCode1
TransDSSL: Transformer based Depth Estimation via Self-Supervised LearningCode1
Gradient-based Uncertainty for Monocular Depth EstimationCode1
Deconstructing Self-Supervised Monocular Reconstruction: The Design Decisions that MatterCode1
RA-Depth: Resolution Adaptive Self-Supervised Monocular Depth EstimationCode1
Latent Discriminant deterministic UncertaintyCode1
Physical Attack on Monocular Depth Estimation with Optimal Adversarial PatchesCode1
Depthformer : Multiscale Vision Transformer For Monocular Depth Estimation With Local Global Information FusionCode1
Can Language Understand Depth?Code1
LaRa: Latents and Rays for Multi-Camera Bird's-Eye-View Semantic SegmentationCode1
MGNet: Monocular Geometric Scene Understanding for Autonomous DrivingCode1
Dyna-DM: Dynamic Object-aware Self-supervised Monocular Depth MapsCode1
Revealing the Dark Secrets of Masked Image ModelingCode1
Deep Digging into the Generalization of Self-Supervised Monocular Depth EstimationCode1
Visual Attention-based Self-supervised Absolute Depth Estimation using Geometric Priors in Autonomous DrivingCode1
Overcoming the Distance Estimation Bottleneck in Estimating Animal Abundance with Camera TrapsCode1
P3Depth: Monocular Depth Estimation with a Piecewise Planarity PriorCode1
Show:102550
← PrevPage 4 of 18Next →

No leaderboard results yet.