SOTAVerified

Camera Pose Estimation

Camera pose estimation is a crucial task in computer vision and robotics that involves determining the position and orientation (pose) of a camera relative to a given reference frame. This task is essential for various applications, such as augmented reality, 3D reconstruction, SLAM, and autonomous navigation.

Papers

Showing 150 of 304 papers

TitleStatusHype
Fast3R: Towards 3D Reconstruction of 1000+ Images in One Forward PassCode5
MonST3R: A Simple Approach for Estimating Geometry in the Presence of MotionCode5
SpatialTrackerV2: 3D Point Tracking Made EasyCode4
Easi3R: Estimating Disentangled Motion from DUSt3R Without TrainingCode4
MV-DUSt3R+: Single-Stage Scene Reconstruction from Sparse Views In 2 SecondsCode4
One Diffusion to Generate Them AllCode4
Cameras as Rays: Pose Estimation via Ray DiffusionCode4
GIM: Learning Generalizable Image Matcher From Internet VideosCode4
SplaTAM: Splat, Track & Map 3D Gaussians for Dense RGB-D SLAMCode4
LightGlue: Local Feature Matching at Light SpeedCode4
Uni4D: Unifying Visual Foundation Models for 4D Modeling from a Single VideoCode3
WHAC: World-grounded Humans and CamerasCode3
SGS-SLAM: Semantic Gaussian Splatting For Neural Dense SLAMCode3
RoMa: Robust Dense Feature MatchingCode3
Recollection from Pensieve: Novel View Synthesis via Learning from Uncalibrated VideosCode2
Seeing from Another Perspective: Evaluating Multi-View Understanding in MLLMsCode2
HumanMM: Global Human Motion Recovery from Multi-shot VideosCode2
Reconstructing People, Places, and CamerasCode2
SplatFlow: Multi-View Rectified Flow Model for 3D Gaussian Splatting SynthesisCode2
Look Gauss, No Pose: Novel View Synthesis using Gaussian Splatting without Accurate Pose InitializationCode2
Enhancing Soccer Camera Calibration Through Keypoint ExploitationCode2
DreamCar: Leveraging Car-specific Prior for in-the-wild 3D Car ReconstructionCode2
GLACE: Global Local Accelerated Coordinate EncodingCode2
Learning to Produce Semi-dense Correspondences for Visual LocalizationCode2
COLMAP-Free 3D Gaussian SplattingCode2
PoseDiffusion: Solving Pose Estimation via Diffusion-aided Bundle AdjustmentCode2
SiLK -- Simple Learned KeypointsCode2
CroCo: Self-Supervised Pre-training for 3D Vision Tasks by Cross-View CompletionCode2
MeshLoc: Mesh-Based Visual LocalizationCode2
ParticleSfM: Exploiting Dense Point Trajectories for Localizing Moving Cameras in the WildCode2
PVO: Panoptic Visual OdometryCode2
Investigating the Role of Image Retrieval for Visual Localization -- An exhaustive benchmarkCode2
ViewFormer: NeRF-free Neural Rendering from Few Images Using TransformersCode2
DKM: Dense Kernelized Feature Matching for Geometry EstimationCode2
Princeton365: A Diverse Dataset with Accurate Camera PoseCode1
EVLoc: Event-based Visual Localization in LiDAR Maps via Event-Depth RegistrationCode1
RePoseD: Efficient Relative Pose Estimation With Known Depth InformationCode1
Learning to Filter Outlier Edges in Global SfMCode1
Enhancing Scene Coordinate Regression with Efficient Keypoint Detection and Sequential InformationCode1
Activating Self-Attention for Multi-Scene Absolute Pose RegressionCode1
PuzzleBoard: A New Camera Calibration Pattern with Position EncodingCode1
GS-EVT: Cross-Modal Event Camera Tracking based on Gaussian SplattingCode1
SCIPaD: Incorporating Spatial Clues into Unsupervised Pose-Depth Joint LearningCode1
DepthMOT: Depth Cues Lead to a Strong Multi-Object TrackerCode1
Human Mesh Recovery from Arbitrary Multi-view ImagesCode1
Extreme Two-View Geometry From Object Poses with Diffusion ModelsCode1
iFusion: Inverting Diffusion for Pose-Free Reconstruction from Sparse ViewsCode1
Manydepth2: Motion-Aware Self-Supervised Multi-Frame Monocular Depth Estimation in Dynamic ScenesCode1
Unifying Correspondence, Pose and NeRF for Pose-Free Novel View Synthesis from Stereo PairsCode1
USB-NeRF: Unrolling Shutter Bundle Adjusted Neural Radiance FieldsCode1
Show:102550
← PrevPage 1 of 7Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Monodepth2Average Translational Error et[%]43.21Unverified
2SfMLearnerAverage Translational Error et[%]29.78Unverified
3GeoNetAverage Translational Error et[%]26.31Unverified
4SC-DepthAverage Translational Error et[%]12.2Unverified
5DeepMatchVOAverage Translational Error et[%]11.05Unverified
6SCIPaDAverage Translational Error et[%]8.63Unverified
7Manydepth2Average Translational Error et[%]7.15Unverified