SOTAVerified

Depth Estimation

Depth Estimation is the task of measuring the distance of each pixel relative to the camera. Depth is extracted from either monocular (single) or stereo (multiple views of a scene) images. Traditional methods use multi-view geometry to find the relationship between the images. Newer methods can directly estimate depth by minimizing the regression loss, or by learning to generate a novel view from a sequence. The most popular benchmarks are KITTI and NYUv2. Models are typically evaluated according to a RMS metric.

Source: DIODE: A Dense Indoor and Outdoor DEpth Dataset

Papers

Showing 901950 of 2454 papers

TitleStatusHype
Self-supervised Monocular Depth Estimation: Let's Talk About The WeatherCode1
NVDS+: Towards Efficient and Versatile Neural Stabilizer for Video Depth EstimationCode1
RayMVSNet++: Learning Ray-based 1D Implicit Fields for Accurate Multi-View Stereo0
Multi-Object Discovery by Low-Dimensional Object Motion0
Learning Subjective Time-Series Data via Utopia Label Distribution Approximation0
DFR: Depth from Rotation by Uncalibrated Image Rectification with Latitudinal Motion AssumptionCode0
TransPose: A Transformer-based 6D Object Pose Estimation Network with Depth Refinement0
Depth Estimation Analysis of Orthogonally Divergent Fisheye Cameras with Distortion Removal0
SVDM: Single-View Diffusion Model for Pseudo-Stereo 3D Object Detection0
LXL: LiDAR Excluded Lean 3D Object Detection with 4D Imaging Radar and Camera Fusion0
FlipNeRF: Flipped Reflection Rays for Few-shot Novel View SynthesisCode1
GMM: Delving into Gradient Aware and Model Perceive Depth Mining for Monocular 3D Detection0
Towards Zero-Shot Scale-Aware Monocular Depth EstimationCode2
MIMIC: Masked Image Modeling with Image CorrespondencesCode1
Cross-modal transformers for infrared and visible image fusionCode1
Learnable Differencing Center for Nighttime Depth PerceptionCode1
Neural 360^ Structured Light with Learned Metasurfaces0
Continuous Online Extrinsic Calibration of Fisheye Camera and LiDAR0
One at a Time: Progressive Multi-step Volumetric Probability Learning for Reliable 3D Scene Perception0
Self-supervised Multi-task Learning Framework for Safety and Health-Oriented Connected Driving Environment Perception using Onboard Camera0
Depth and DOF Cues Make A Better Defocus Blur DetectorCode1
BEVScope: Enhancing Self-Supervised Depth Estimation Leveraging Bird's-Eye-View in Dynamic ScenariosCode0
Tame a Wild Camera: In-the-Wild Monocular Camera CalibrationCode1
Understanding Depth Map Progressively: Adaptive Distance Interval Separation for Monocular 3d Object Detection0
C2F2NeUS: Cascade Cost Frustum Fusion for High Fidelity and Generalizable Neural Surface Reconstruction0
SimpleMapping: Real-Time Visual-Inertial Dense Mapping with Deep Multi-View Stereo0
Predict to Detect: Prediction-guided 3D Object Detection using Sequential ImagesCode1
Lightweight Monocular Depth Estimation via Token-Sharing Transformer0
SparseTrack: Multi-Object Tracking by Performing Scene Decomposition based on Pseudo-DepthCode1
A Dynamic Feature Interaction Framework for Multi-task Visual Perception0
Single-Stage 3D Geometry-Preserving Depth Estimation Model Training on Dataset Mixtures with Uncalibrated Stereo Data0
The Surprising Effectiveness of Diffusion Models for Optical Flow and Monocular Depth Estimation0
Towards In-context Scene UnderstandingCode1
PanoGRF: Generalizable Spherical Radiance Fields for Wide-baseline Panoramas0
A technique to jointly estimate depth and depth uncertainty for unmanned aerial vehiclesCode1
HQDec: Self-Supervised Monocular Depth Estimation Based on a High-Quality DecoderCode0
DaRF: Boosting Radiance Fields from Sparse Inputs with Monocular Depth AdaptationCode1
Independent Component Alignment for Multi-Task Learning0
OccCasNet: Occlusion-aware Cascade Cost Volume for Light Field Depth EstimationCode1
USIM-DAL: Uncertainty-aware Statistical Image Modeling-based Dense Active Learning for Super-resolution0
SimHaze: game engine simulated data for real-world dehazing0
SING: A Plug-and-Play DNN Learning TechniqueCode0
Polarimetric Imaging for Perception0
AutoDepthNet: High Frame Rate Depth Map Reconstruction using Commodity Depth and RGB Cameras0
FEDORA: Flying Event Dataset fOr Reactive behAvior0
Gated Stereo: Joint Depth Estimation from Gated and Wide-Baseline Active Stereo Cues0
Text2NeRF: Text-Driven 3D Scene Generation with Neural Radiance FieldsCode1
Brain Captioning: Decoding human brain activity into images and textCode1
PanelNet: Understanding 360 Indoor Environment via Panel Representation0
MetaMorphosis: Task-oriented Privacy Cognizant Feature Generation for Multi-task Learning0
Show:102550
← PrevPage 19 of 50Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1OmniDepthRMSE0.62Unverified
2SphereDepthRMSE0.45Unverified
3Jin et al.RMSE0.42Unverified
4BiFuse with fusionRMSE0.41Unverified
5HoHoNet (ResNet-101)RMSE0.38Unverified
6PanoDepthRMSE0.37Unverified
7BiFuse++RMSE0.37Unverified
8UniFuse with fusionRMSE0.37Unverified
9DisConvRMSE0.37Unverified
10SliceNetRMSE0.37Unverified
#ModelMetricClaimedVerifiedStatus
1A2JmAP8.61Unverified
2PAD-NetRMS0.79Unverified
3MS-CRFRMS0.59Unverified
4DORNRMS0.51Unverified
5FreeformRMS0.43Unverified
6Optimized, freeformRMS0.43Unverified
7VNLRMS0.42Unverified
8BTSRMS0.41Unverified
9TransDepth (AGD+ ViT)RMS0.37Unverified
10AdaBinsRMS0.36Unverified
#ModelMetricClaimedVerifiedStatus
1T2NetAbs Rel0.35Unverified
2MIDASAbs Rel0.31Unverified
3Bhattacharjee et al.Abs Rel0.25Unverified
#ModelMetricClaimedVerifiedStatus
1T2NetAbs Rel0.49Unverified
2MIDASAbs Rel0.42Unverified
3Bhattacharjee et al.Abs Rel0.38Unverified
#ModelMetricClaimedVerifiedStatus
1LeReSabsolute relative error0.1Unverified
2DELTASabsolute relative error0.09Unverified
3Distill Any Depthabsolute relative error0.04Unverified
#ModelMetricClaimedVerifiedStatus
1SDC-DepthRMSE6.92Unverified
2SwinMTLRMSE6.35Unverified
#ModelMetricClaimedVerifiedStatus
1AIP-BrownDelta < 1.250.36Unverified
2LeResDelta < 1.250.23Unverified
#ModelMetricClaimedVerifiedStatus
1H-Net (Ours)Absolute relative error (AbsRel)0.09Unverified
2H-Net (Ours) Full EigenAbsolute relative error (AbsRel)0.08Unverified
#ModelMetricClaimedVerifiedStatus
1GLPDepthDelta < 1.250.43Unverified
2SRDINET (Model A)Delta < 1.250.4Unverified
#ModelMetricClaimedVerifiedStatus
1Atlas (finetuned)RMSE0.17Unverified
2Atlas (plain)RMSE0.17Unverified
#ModelMetricClaimedVerifiedStatus
1LFattNetBadPix(0.01)17.23Unverified
#ModelMetricClaimedVerifiedStatus
1LightDepthNumber of parameters (M)42.6Unverified
#ModelMetricClaimedVerifiedStatus
1UniFuseAbs Rel0.11Unverified
#ModelMetricClaimedVerifiedStatus
1X-TC (Cross-Task Consistency)L1 error1.63Unverified