SOTAVerified

Depth Estimation

Depth Estimation is the task of measuring the distance of each pixel relative to the camera. Depth is extracted from either monocular (single) or stereo (multiple views of a scene) images. Traditional methods use multi-view geometry to find the relationship between the images. Newer methods can directly estimate depth by minimizing the regression loss, or by learning to generate a novel view from a sequence. The most popular benchmarks are KITTI and NYUv2. Models are typically evaluated according to a RMS metric.

Source: DIODE: A Dense Indoor and Outdoor DEpth Dataset

Papers

Showing 751800 of 2454 papers

TitleStatusHype
RadarRGBD A Multi-Sensor Fusion Dataset for Perception with RGB-D and mmWave RadarCode0
M3Depth: Wavelet-Enhanced Depth Estimation on Mars via Mutual Boosting of Dual-Modal Data0
IA-MVS: Instance-Focused Adaptive Depth Sampling for Multi-View StereoCode0
DB3D-L: Depth-aware BEV Feature Transformation for Accurate 3D Lane Detection0
MonoMobility: Zero-Shot 3D Mobility Analysis from Monocular Videos0
SurgPose: Generalisable Surgical Instrument Pose Estimation using Zero-Shot Learning and Stereo Vision0
Depth Anything with Any Prior0
JointDistill: Adaptive Multi-Task Distillation for Joint Depth Estimation and Scene Segmentation0
Boosting Zero-shot Stereo Matching using Large-scale Mixed Images Sources in the Real World0
Reinforcement Learning-Based Monocular Vision Approach for Autonomous UAV Landing0
ElectricSight: 3D Hazard Monitoring for Power Lines Using Low-Cost Sensors0
Camera-Only Bird's Eye View Perception: A Neural Approach to LiDAR-Free Environmental Mapping for Autonomous Vehicles0
MonoCoP: Chain-of-Prediction for Monocular 3D Object Detection0
Self-Supervised Learning for Robotic Leaf Manipulation: A Hybrid Geometric-Neural Approach0
VGLD: Visually-Guided Linguistic Disambiguation for Monocular Depth Scale RecoveryCode0
DELTA: Dense Depth from Events and LiDAR using Transformer's Attention0
In-situ and Non-contact Etch Depth Prediction in Plasma Etching via Machine Learning (ANN & BNN) and Digital Image Colorimetry0
PosePilot: Steering Camera Pose for Generative World Models with Self-supervised Depth0
LMDepth: Lightweight Mamba-based Monocular Depth Estimation for Real-World Deployment0
JointDiT: Enhancing RGB-Depth Joint Modeling with Diffusion Transformers0
Dense Geometry Supervision for Underwater Depth Estimation0
Occlusion-Aware Self-Supervised Monocular Depth Estimation for Weak-Texture Endoscopic Images0
DERD-Net: Learning Depth from Event-based Ray DensitiesCode0
UniVG: A Generalist Diffusion Model for Unified Image Generation and Editing0
VistaDepth: Frequency Modulation With Bias Reweighting For Enhanced Long-Range Depth Estimation0
Enhancing Pothole Detection and Characterization: Integrated Segmentation and Depth Estimation in Road Anomaly Systems0
Occlusion-Ordered Semantic Instance Segmentation0
TSGS: Improving Gaussian Splatting for Transparent Surface Reconstruction via Normal and De-lighting Priors0
Privacy-Preserving Operating Room Workflow Analysis using Digital Twins0
Metric-Solver: Sliding Anchored Metric Depth Estimation from a Single Image0
TacoDepth: Towards Efficient Radar-Camera Depth Estimation with One-stage Fusion0
An Online Adaptation Method for Robust Depth Estimation and Visual Odometry in the Open WorldCode0
DeepWheel: Generating a 3D Synthetic Wheel Dataset for Design and Performance Evaluation0
Cut-and-Splat: Leveraging Gaussian Splatting for Synthetic Data GenerationCode0
Stereo-LiDAR Fusion by Semi-Global Matching With Discrete Disparity-Matching Cost and SemidensificationCode0
Endo3R: Unified Online Reconstruction from Dynamic Monocular Endoscopic Video0
RingMoE: Mixture-of-Modality-Experts Multi-Modal Foundation Models for Universal Remote Sensing Image Interpretation0
GaussianLSS -- Toward Real-world BEV Perception: Depth Uncertainty Estimation via Gaussian Splatting0
FreSca: Unveiling the Scaling Space in Diffusion Models0
A novel gesture interaction control method for rehabilitation lower extremity exoskeleton0
GeometryCrafter: Consistent Geometry Estimation for Open-world Videos with Diffusion Priors0
Monocular and Generalizable Gaussian Talking Head Animation0
ExScene: Free-View 3D Scene Reconstruction with Gaussian Splatting from a Single Image0
Detail-aware multi-view stereo network for depth estimationCode0
Blurry-Edges: Photon-Limited Depth Estimation from Defocused Boundaries0
EndoLRMGS: Complete Endoscopic Scene Reconstruction combining Large Reconstruction Modelling and Gaussian SplattingCode0
Intrinsic Image Decomposition for Robust Self-supervised Monocular Depth Estimation on Reflective Surfaces0
One Look is Enough: A Novel Seamless Patchwise Refinement for Zero-Shot Monocular Depth Estimation Models on High-Resolution Images0
MVSAnywhere: Zero-Shot Multi-View Stereo0
ICG-MVSNet: Learning Intra-view and Cross-view Relationships for Guidance in Multi-View Stereo0
Show:102550
← PrevPage 16 of 50Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1OmniDepthRMSE0.62Unverified
2SphereDepthRMSE0.45Unverified
3Jin et al.RMSE0.42Unverified
4BiFuse with fusionRMSE0.41Unverified
5HoHoNet (ResNet-101)RMSE0.38Unverified
6PanoDepthRMSE0.37Unverified
7BiFuse++RMSE0.37Unverified
8UniFuse with fusionRMSE0.37Unverified
9DisConvRMSE0.37Unverified
10SliceNetRMSE0.37Unverified
#ModelMetricClaimedVerifiedStatus
1A2JmAP8.61Unverified
2PAD-NetRMS0.79Unverified
3MS-CRFRMS0.59Unverified
4DORNRMS0.51Unverified
5FreeformRMS0.43Unverified
6Optimized, freeformRMS0.43Unverified
7VNLRMS0.42Unverified
8BTSRMS0.41Unverified
9TransDepth (AGD+ ViT)RMS0.37Unverified
10AdaBinsRMS0.36Unverified
#ModelMetricClaimedVerifiedStatus
1T2NetAbs Rel0.35Unverified
2MIDASAbs Rel0.31Unverified
3Bhattacharjee et al.Abs Rel0.25Unverified
#ModelMetricClaimedVerifiedStatus
1T2NetAbs Rel0.49Unverified
2MIDASAbs Rel0.42Unverified
3Bhattacharjee et al.Abs Rel0.38Unverified
#ModelMetricClaimedVerifiedStatus
1LeReSabsolute relative error0.1Unverified
2DELTASabsolute relative error0.09Unverified
3Distill Any Depthabsolute relative error0.04Unverified
#ModelMetricClaimedVerifiedStatus
1SDC-DepthRMSE6.92Unverified
2SwinMTLRMSE6.35Unverified
#ModelMetricClaimedVerifiedStatus
1AIP-BrownDelta < 1.250.36Unverified
2LeResDelta < 1.250.23Unverified
#ModelMetricClaimedVerifiedStatus
1H-Net (Ours)Absolute relative error (AbsRel)0.09Unverified
2H-Net (Ours) Full EigenAbsolute relative error (AbsRel)0.08Unverified
#ModelMetricClaimedVerifiedStatus
1GLPDepthDelta < 1.250.43Unverified
2SRDINET (Model A)Delta < 1.250.4Unverified
#ModelMetricClaimedVerifiedStatus
1Atlas (finetuned)RMSE0.17Unverified
2Atlas (plain)RMSE0.17Unverified
#ModelMetricClaimedVerifiedStatus
1LFattNetBadPix(0.01)17.23Unverified
#ModelMetricClaimedVerifiedStatus
1LightDepthNumber of parameters (M)42.6Unverified
#ModelMetricClaimedVerifiedStatus
1UniFuseAbs Rel0.11Unverified
#ModelMetricClaimedVerifiedStatus
1X-TC (Cross-Task Consistency)L1 error1.63Unverified