Sapiens: Foundation for Human Vision Models Aug 22, 2024 2D Human Pose Estimation 2D Pose Estimation
Code Code Available 9OmniGen: Unified Image Generation Sep 17, 2024 Edge Detection Image Generation
Code Code Available 7Fast3R: Towards 3D Reconstruction of 1000+ Images in One Forward Pass Jan 23, 2025 3D Reconstruction Camera Pose Estimation
Code Code Available 5DINO-X: A Unified Vision Model for Open-World Object Detection and Understanding Nov 21, 2024 Long-tailed Object Detection Object
Code Code Available 5Neural Fields in Robotics: A Survey Oct 26, 2024 3D Reconstruction Autonomous Driving
Code Code Available 5MonST3R: A Simple Approach for Estimating Geometry in the Presence of Motion Oct 4, 2024 4D reconstruction Camera Pose Estimation
Code Code Available 5VisionLLM v2: An End-to-End Generalist Multimodal Large Language Model for Hundreds of Vision-Language Tasks Jun 12, 2024 Image Generation Language Modeling
Code Code Available 5XFeat: Accelerated Features for Lightweight Image Matching Apr 30, 2024 CPU Keypoint detection and image matching
Code Code Available 5DUSt3R: Geometric 3D Vision Made Easy Dec 21, 2023 3D Reconstruction Camera Calibration
Code Code Available 5AlphaPose: Whole-Body Regional Multi-Person Pose Estimation and Tracking in Real-Time Nov 7, 2022 Knowledge Distillation Multi-Person Pose Estimation
Code Code Available 5SuperAnimal pretrained pose estimation models for behavioral analysis Mar 14, 2022 2D Pose Estimation Animal Pose Estimation
Code Code Available 5SpatialTrackerV2: 3D Point Tracking Made Easy Jul 16, 2025 3D Reconstruction Camera Pose Estimation
Code Code Available 4Easi3R: Estimating Disentangled Motion from DUSt3R Without Training Mar 31, 2025 4D reconstruction Camera Pose Estimation
Code Code Available 4MV-DUSt3R+: Single-Stage Scene Reconstruction from Sparse Views In 2 Seconds Dec 9, 2024 Camera Calibration Camera Pose Estimation
Code Code Available 4One Diffusion to Generate Them All Nov 25, 2024 All Camera Pose Estimation
Code Code Available 4No Pose, No Problem: Surprisingly Simple 3D Gaussian Splats from Sparse Unposed Images Oct 31, 2024 3D Reconstruction Generalizable Novel View Synthesis
Code Code Available 4RUMI: Rummaging Using Mutual Information Aug 19, 2024 Model Predictive Control Object
Code Code Available 4Cameras as Rays: Pose Estimation via Ray Diffusion Feb 22, 2024 3D Reconstruction Camera Pose Estimation
Code Code Available 4GIM: Learning Generalizable Image Matcher From Internet Videos Feb 16, 2024 3D Reconstruction Camera Pose Estimation
Code Code Available 4PIN-SLAM: LiDAR SLAM Using a Point-Based Implicit Neural Representation for Achieving Global Map Consistency Jan 17, 2024 GPU Incremental Learning
Code Code Available 4FoundationPose: Unified 6D Pose Estimation and Tracking of Novel Objects Dec 13, 2023 3D Object Detection 3D Object Tracking
Code Code Available 4SplaTAM: Splat, Track & Map 3D Gaussians for Dense RGB-D SLAM Dec 4, 2023 Camera Pose Estimation Novel View Synthesis
Code Code Available 4SPHINX: The Joint Mixing of Weights, Tasks, and Visual Embeddings for Multi-modal Large Language Models Nov 13, 2023 Described Object Detection Language Modeling
Code Code Available 4Effective Whole-body Pose Estimation with Two-stages Distillation Jul 29, 2023 2D Human Pose Estimation Knowledge Distillation
Code Code Available 4LightGlue: Local Feature Matching at Light Speed Jun 23, 2023 3D Reconstruction Camera Pose Estimation
Code Code Available 4Visual Attention Network Feb 20, 2022 image-classification Image Classification
Code Code Available 4BlazePose: On-device Real-time Body Pose tracking Jun 17, 2020 2D Human Pose Estimation 3D Human Pose Estimation
Code Code Available 4SupeRANSAC: One RANSAC to Rule Them All Jun 5, 2025 All Pose Estimation
Code Code Available 3CompSLAM: Complementary Hierarchical Multi-Modal Localization and Mapping for Robot Autonomy in Underground Environments May 10, 2025 Pose Estimation
Code Code Available 3LiftFeat: 3D Geometry-Aware Local Feature Matching May 6, 2025 3D geometry Depth Estimation
Code Code Available 3CoMotion: Concurrent Multi-person 3D Motion Apr 16, 2025 3D Pose Estimation Pose Estimation
Code Code Available 3Uni4D: Unifying Visual Foundation Models for 4D Modeling from a Single Video Mar 27, 2025 Camera Pose Estimation Depth Estimation
Code Code Available 3Relative Pose Estimation through Affine Corrections of Monocular Depth Priors Jan 9, 2025 Depth Estimation Monocular Depth Estimation
Code Code Available 3ROLO-SLAM: Rotation-Optimized LiDAR-Only SLAM in Uneven Terrain with Ground Vehicle Jan 4, 2025 Pose Estimation
Code Code Available 3Reloc3r: Large-Scale Training of Relative Camera Pose Regression for Generalizable, Fast, and Accurate Visual Localization Dec 11, 2024 Pose Estimation Visual Localization
Code Code Available 3Detection, Pose Estimation and Segmentation for Multiple Bodies: Closing the Virtuous Circle Dec 2, 2024 Human Instance Segmentation Pose-Based Human Instance Segmentation
Code Code Available 3emg2pose: A Large and Diverse Benchmark for Surface Electromyographic Hand Pose Estimation Dec 2, 2024 Anatomy Hand Pose Estimation
Code Code Available 3WiLoR: End-to-end 3D Hand Localization and Reconstruction in-the-wild Sep 18, 2024 3D Hand Pose Estimation Hand Detection
Code Code Available 3TCFormer: Visual Recognition via Token Clustering Transformer Jul 16, 2024 Clustering image-classification
Code Code Available 3MoSca: Dynamic Gaussian Fusion from Casual Videos via 4D Motion Scaffolds May 27, 2024 4D reconstruction Pose Estimation
Code Code Available 3Deep Learning-Based Object Pose Estimation: A Comprehensive Survey May 13, 2024 Deep Learning Object
Code Code Available 3DeDoDe v2: Analyzing and Improving the DeDoDe Keypoint Detector Apr 13, 2024 Data Augmentation Key Point Matching
Code Code Available 3Benchmarks and Challenges in Pose Estimation for Egocentric Hand Interactions with Objects Mar 25, 2024 Action Recognition Motion Generation
Code Code Available 3WHAC: World-grounded Humans and Cameras Mar 19, 2024 Camera Pose Estimation Pose Estimation
Code Code Available 3What Matters When Repurposing Diffusion Models for General Dense Perception Tasks? Mar 10, 2024 Depth Estimation Image Matting
Code Code Available 3SGS-SLAM: Semantic Gaussian Splatting For Neural Dense SLAM Feb 5, 2024 3D Semantic Segmentation Camera Pose Estimation
Code Code Available 3Beyond Appearance: a Semantic Controllable Self-Supervised Learning Framework for Human-Centric Visual Tasks Mar 30, 2023 Human Parsing Pedestrian Attribute Recognition
Code Code Available 3EPro-PnP: Generalized End-to-End Probabilistic Perspective-n-Points for Monocular Object Pose Estimation Mar 22, 2023 3D Object Detection 6D Pose Estimation using RGB
Code Code Available 3ViTPose++: Vision Transformer for Generic Body Pose Estimation Dec 7, 2022 2D Human Pose Estimation Animal Pose Estimation
Code Code Available 3MotionBERT: A Unified Perspective on Learning Human Motion Representations Oct 12, 2022 3D Human Pose Estimation 3D Pose Estimation
Code Code Available 3