SOTAVerified

Depth Estimation

Depth Estimation is the task of measuring the distance of each pixel relative to the camera. Depth is extracted from either monocular (single) or stereo (multiple views of a scene) images. Traditional methods use multi-view geometry to find the relationship between the images. Newer methods can directly estimate depth by minimizing the regression loss, or by learning to generate a novel view from a sequence. The most popular benchmarks are KITTI and NYUv2. Models are typically evaluated according to a RMS metric.

Source: DIODE: A Dense Indoor and Outdoor DEpth Dataset

Papers

Showing 19011950 of 2454 papers

TitleStatusHype
Flowing from Words to Pixels: A Framework for Cross-Modality Evolution0
Flowing from Words to Pixels: A Noise-Free Framework for Cross-Modality Evolution0
Flow-NeRF: Joint Learning of Geometry, Poses, and Dense Flow within Unified Neural Representations0
Focal Depth Estimation: A Calibration-Free, Subject- and Daytime Invariant Approach0
FocDepthFormer: Transformer with latent LSTM for Depth Estimation from Focal Stack0
Forest Inspection Dataset for Aerial Semantic Segmentation and Depth Estimation0
Foundation Models Meet Low-Cost Sensors: Test-Time Adaptation for Rescaling Disparity for Zero-Shot Metric Depth Estimation0
FoVA-Depth: Field-of-View Agnostic Depth Estimation for Cross-Dataset Generalization0
FPGA-based Acceleration of Neural Network for Image Classification using Vitis AI0
FP-Stereo: Hardware-Efficient Stereo Vision for Embedded Applications0
Fractal Pyramid Networks0
Free Supervision From Video Games0
FreSca: Unveiling the Scaling Space in Diffusion Models0
From 2D to 3D: Re-thinking Benchmarking of Monocular Depth Prediction0
From-Ground-To-Objects: Coarse-to-Fine Self-supervised Monocular Depth Estimation of Dynamic Objects with Ground Contact Prior0
From Image to Video: An Empirical Study of Diffusion Representations0
From Pixels to Damage Severity: Estimating Earthquake Impacts Using Semantic Segmentation of Social Media Images0
From Real to Synthetic and Back: Synthesizing Training Data for Multi-Person Scene Understanding0
From Single Images to Motion Policies via Video-Generation Environment Representations0
FrozenRecon: Pose-free 3D Scene Reconstruction with Frozen Depth Models0
FS-Depth: Focal-and-Scale Depth Estimation from a Single Image in Unseen Indoor Scene0
FSNet: Redesign Self-Supervised MonoDepth for Full-Scale Depth Prediction for Autonomous Driving0
Fully Convolutional Networks for Monocular Retinal Depth Estimation and Optic Disc-Cup Segmentation0
Fully Exploiting Vision Foundation Model's Profound Prior Knowledge for Generalizable RGB-Depth Driving Scene Parsing0
Fully Hyperbolic Convolutional Neural Networks0
Towards Long-Range 3D Object Detection for Autonomous Vehicles0
FusionDepth: Complement Self-Supervised Monocular Depth Estimation with Cost Volume0
FusionFormer: A Multi-sensory Fusion in Bird's-Eye-View and Temporal Consistent Transformer for 3D Object Detection0
FusionMapping: Learning Depth Prediction with Monocular Images and 2D Laser Scans0
Fusion of Real Time Thermal Image and 1D/2D/3D Depth Laser Readings for Remote Thermal Sensing in Industrial Plants by Means of UAVs and/or Robots0
Fusion of stereo and still monocular depth estimates in a self-supervised learning context0
FutureDepth: Learning to Predict the Future Improves Video Depth Estimation0
GAA-TSO: Geometry-Aware Assisted Depth Completion for Transparent and Specular Objects0
GANVO: Unsupervised Deep Monocular Visual Odometry and Depth Estimation with Generative Adversarial Networks0
GARF:Geometry-Aware Generalized Neural Radiance Field0
GarmentCrafter: Progressive Novel View Synthesis for Single-View 3D Garment Reconstruction and Editing0
Gated Stereo: Joint Depth Estimation from Gated and Wide-Baseline Active Stereo Cues0
GaussianLSS -- Toward Real-world BEV Perception: Depth Uncertainty Estimation via Gaussian Splatting0
Gaussian Scenes: Pose-Free Sparse-View Scene Reconstruction using Depth-Enhanced Diffusion Priors0
Gaussian Splashing: Direct Volumetric Rendering Underwater0
Gaze-Vergence-Controlled See-Through Vision in Augmented Reality0
GenDepth: Generalizing Monocular Depth Estimation for Arbitrary Camera Parameters via Ground Plane Embedding0
Generalized Face Anti-Spoofing via Multi-Task Learning and One-Side Meta Triplet Loss0
Generalizing Interactive Backpropagating Refinement for Dense Prediction Networks0
Generalizing monocular colonoscopy image depth estimation by uncertainty-based global and local fusion network0
Generalizing to the Open World: Deep Visual Odometry with Online Adaptation0
Generative Omnimatte: Learning to Decompose Video into Layers0
Generic Perceptual Loss for Modeling Structured Output Dependencies0
GenMM: Geometrically and Temporally Consistent Multimodal Data Generation for Video and LiDAR0
GEN-SLAM: Generative Modeling for Monocular Simultaneous Localization and Mapping0
Show:102550
← PrevPage 39 of 50Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1OmniDepthRMSE0.62Unverified
2SphereDepthRMSE0.45Unverified
3Jin et al.RMSE0.42Unverified
4BiFuse with fusionRMSE0.41Unverified
5HoHoNet (ResNet-101)RMSE0.38Unverified
6PanoDepthRMSE0.37Unverified
7BiFuse++RMSE0.37Unverified
8UniFuse with fusionRMSE0.37Unverified
9DisConvRMSE0.37Unverified
10SliceNetRMSE0.37Unverified
#ModelMetricClaimedVerifiedStatus
1A2JmAP8.61Unverified
2PAD-NetRMS0.79Unverified
3MS-CRFRMS0.59Unverified
4DORNRMS0.51Unverified
5FreeformRMS0.43Unverified
6Optimized, freeformRMS0.43Unverified
7VNLRMS0.42Unverified
8BTSRMS0.41Unverified
9TransDepth (AGD+ ViT)RMS0.37Unverified
10AdaBinsRMS0.36Unverified
#ModelMetricClaimedVerifiedStatus
1T2NetAbs Rel0.35Unverified
2MIDASAbs Rel0.31Unverified
3Bhattacharjee et al.Abs Rel0.25Unverified
#ModelMetricClaimedVerifiedStatus
1T2NetAbs Rel0.49Unverified
2MIDASAbs Rel0.42Unverified
3Bhattacharjee et al.Abs Rel0.38Unverified
#ModelMetricClaimedVerifiedStatus
1LeReSabsolute relative error0.1Unverified
2DELTASabsolute relative error0.09Unverified
3Distill Any Depthabsolute relative error0.04Unverified
#ModelMetricClaimedVerifiedStatus
1SDC-DepthRMSE6.92Unverified
2SwinMTLRMSE6.35Unverified
#ModelMetricClaimedVerifiedStatus
1AIP-BrownDelta < 1.250.36Unverified
2LeResDelta < 1.250.23Unverified
#ModelMetricClaimedVerifiedStatus
1H-Net (Ours)Absolute relative error (AbsRel)0.09Unverified
2H-Net (Ours) Full EigenAbsolute relative error (AbsRel)0.08Unverified
#ModelMetricClaimedVerifiedStatus
1GLPDepthDelta < 1.250.43Unverified
2SRDINET (Model A)Delta < 1.250.4Unverified
#ModelMetricClaimedVerifiedStatus
1Atlas (finetuned)RMSE0.17Unverified
2Atlas (plain)RMSE0.17Unverified
#ModelMetricClaimedVerifiedStatus
1LFattNetBadPix(0.01)17.23Unverified
#ModelMetricClaimedVerifiedStatus
1LightDepthNumber of parameters (M)42.6Unverified
#ModelMetricClaimedVerifiedStatus
1UniFuseAbs Rel0.11Unverified
#ModelMetricClaimedVerifiedStatus
1X-TC (Cross-Task Consistency)L1 error1.63Unverified