SOTAVerified

Camera Pose Estimation

Camera pose estimation is a crucial task in computer vision and robotics that involves determining the position and orientation (pose) of a camera relative to a given reference frame. This task is essential for various applications, such as augmented reality, 3D reconstruction, SLAM, and autonomous navigation.

Papers

Showing 150 of 304 papers

TitleStatusHype
π^3: Scalable Permutation-Equivariant Visual Geometry Learning0
BRUM: Robust 3D Vehicle Reconstruction from 360 Sparse Images0
SGLoc: Semantic Localization System for Camera Pose Estimation from 3D Gaussian Splatting RepresentationCode0
SpatialTrackerV2: 3D Point Tracking Made EasyCode4
Kaleidoscopic Background Attack: Disrupting Pose Estimation with Multi-Fold Radial Symmetry Textures0
RA-NeRF: Robust Neural Radiance Field Reconstruction with Accurate Camera Pose Estimation under Complex Trajectories0
Princeton365: A Diverse Dataset with Accurate Camera PoseCode1
Voyager: Long-Range and World-Consistent Video Diffusion for Explorable 3D Scene Generation0
4DTAM: Non-Rigid Tracking and Mapping via Dynamic Surface Gaussians0
Recollection from Pensieve: Novel View Synthesis via Learning from Uncalibrated VideosCode2
PosePilot: Steering Camera Pose for Generative World Models with Self-supervised Depth0
T-Graph: Enhancing Sparse-view Camera Pose Estimation by Pairwise Translation Graph0
Large-scale visual SLAM for in-the-wild videos0
SmallGS: Gaussian Splatting-based Camera Pose Estimation for Small-Baseline Videos0
Seeing from Another Perspective: Evaluating Multi-View Understanding in MLLMsCode2
Back on Track: Bundle Adjustment for Dynamic Scene Reconstruction0
Mono3R: Exploiting Monocular Cues for Geometric 3D Reconstruction0
Regist3R: Incremental Registration with Stereo Foundation Model0
A Constrained Optimization Approach for Gaussian Splatting from Coarsely-posed Images and Noisy Lidar Point Clouds0
Endo3R: Unified Online Reconstruction from Dynamic Monocular Endoscopic Video0
Easi3R: Estimating Disentangled Motion from DUSt3R Without TrainingCode4
LiM-Loc: Visual Localization with Dense and Accurate 3D Reference Maps Directly Corresponding 2D Keypoints to 3D LiDAR Point Clouds0
GCRayDiffusion: Pose-Free Surface Reconstruction via Geometric Consistent Ray Diffusion0
STAMICS: Splat, Track And Map with Integrated Consistency and Semantics for Dense RGB-D SLAM0
Uni4D: Unifying Visual Foundation Models for 4D Modeling from a Single VideoCode3
Image as an IMU: Estimating Camera Motion from a Single Motion-Blurred Image0
VicaSplat: A Single Run is All You Need for 3D Gaussian Splatting and Camera Estimation from Unposed Video Frames0
Endo-FASt3r: Endoscopic Foundation model Adaptation for Structure from motion0
HumanMM: Global Human Motion Recovery from Multi-shot VideosCode2
GaussianCAD: Robust Self-Supervised CAD Reconstruction from Three Orthographic Views Using 3D Gaussian Splatting0
EVLoc: Event-based Visual Localization in LiDAR Maps via Event-Depth RegistrationCode1
RUBIK: A Structured Benchmark for Image Matching across Geometric Challenges0
Cutting-edge 3D reconstruction solutions for underwater coral reef images: A review and comparison0
RGB-Only Gaussian Splatting SLAM for Unbounded Outdoor Scenes0
FLARE: Feed-forward Geometry, Appearance and Camera Estimation from Uncalibrated Sparse Views0
Light3R-SfM: Towards Feed-forward Structure-from-Motion0
Fast3R: Towards 3D Reconstruction of 1000+ Images in One Forward PassCode5
RePoseD: Efficient Relative Pose Estimation With Known Depth InformationCode1
JOG3R: Towards 3D-Consistent Video Generators0
EquiPose: Exploiting Permutation Equivariance for Relative Camera Pose Estimation0
Learning to Filter Outlier Edges in Global SfMCode1
KeyGS: A Keyframe-Centric Gaussian Splatting Method for Monocular Image Sequences0
Reconstructing People, Places, and CamerasCode2
Scaling 4D Representations0
Stereo4D: Learning How Things Move in 3D from Internet Stereo Videos0
MV-DUSt3R+: Single-Stage Scene Reconstruction from Sparse Views In 2 SecondsCode4
Enhancing Scene Coordinate Regression with Efficient Keypoint Detection and Sequential InformationCode1
AC3D: Analyzing and Improving 3D Camera Control in Video Diffusion Transformers0
SplatFlow: Multi-View Rectified Flow Model for 3D Gaussian Splatting SynthesisCode2
One Diffusion to Generate Them AllCode4
Show:102550
← PrevPage 1 of 7Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Monodepth2Average Translational Error et[%]43.21Unverified
2SfMLearnerAverage Translational Error et[%]29.78Unverified
3GeoNetAverage Translational Error et[%]26.31Unverified
4SC-DepthAverage Translational Error et[%]12.2Unverified
5DeepMatchVOAverage Translational Error et[%]11.05Unverified
6SCIPaDAverage Translational Error et[%]8.63Unverified
7Manydepth2Average Translational Error et[%]7.15Unverified