SOTAVerified

Object Localization

Object Localization is the task of locating an instance of a particular object category in an image, typically by specifying a tightly cropped bounding box centered on the instance. An object proposal specifies a candidate bounding box, and an object proposal is said to be a correct localization if it sufficiently overlaps a human-labeled “ground-truth” bounding box for the given object. In the literature, the “Object Localization” task is to locate one instance of an object category, whereas “object detection” focuses on locating all instances of a category in a given image.

Source: Fast On-Line Kernel Density Estimation for Active Object Localization

Papers

Showing 351400 of 617 papers

TitleStatusHype
Who Make Drivers Stop? Towards Driver-centric Risk Assessment: Risk Object Identification via Causal Inference0
Parity Models: A General Framework for Coding-Based Resilience in ML Inference0
Particle filter re-detection for visual tracking via correlation filters0
Parts-Based Articulated Object Localization in Clutter Using Belief Propagation0
PBRnet: Pyramidal Bounding Box Refinement to Improve Object Localization Accuracy0
Photorealistic Image Synthesis for Object Instance Detection0
Plenoptic Monte Carlo Object Localization for Robot Grasping under Layered Translucency0
PMR-Net: Parallel Multi-Resolution Encoder-Decoder Network Framework for Medical Image Segmentation0
POEM: Precise Object-level Editing via MLLM control0
PointArena: Probing Multimodal Grounding Through Language-Guided Pointing0
Point Cloud Registration-Driven Robust Feature Matching for 3D Siamese Object Tracking0
Point-DETR3D: Leveraging Imagery Data with Spatial Point Prior for Weakly Semi-supervised 3D Object Detection0
Video Instance Segmentation by Instance Flow Assembly0
3D Spatial Understanding in MLLMs: Disambiguation and Evaluation0
Polyp-artifact relationship analysis using graph inductive learned representations0
Practical, Fast and Robust Point Cloud Registration for 3D Scene Stitching and Object Localization0
Prediction Accuracy & Reliability: Classification and Object Localization under Distribution Shift0
3D Object Localization Using 2D Estimates for Computer Vision Applications0
Pro2SAM: Mask Prompt to SAM with Grid Points for Weakly Supervised Object Localization0
Probing the Mid-level Vision Capabilities of Self-Supervised Learning0
Probing the Role of Positional Information in Vision-Language Models0
Probing the Role of Positional Information in Vision-Language Models0
Video OWL-ViT: Temporally-consistent open-world localization in video0
Generalized Coarse-to-Fine Visual Recognition with Progressive Training0
Vision-based Real-Time Aerial Object Localization and Tracking for UAV Sensing System0
Query-guided Attention in Vision Transformers for Localizing Objects Using a Single Sketch0
RAG-6DPose: Retrieval-Augmented 6D Pose Estimation via Leveraging CAD as Knowledge Base0
R-AGNO-RPN: A LIDAR-Camera Region Deep Network for Resolution-Agnostic Detection0
Rapid Training Data Creation by Synthesizing Medical Images for Classification and Localization0
Real-time Full-stack Traffic Scene Perception for Autonomous Driving with Roadside Cameras0
Real-world Instance-specific Image Goal Navigation: Bridging Domain Gaps via Contrastive Learning0
Evaluating and Enhancing Trustworthiness of LLMs in Perception Tasks0
Visual-Linguistic Agent: Towards Collaborative Contextual Object Reasoning0
Visually Guided Spatial Relation Extraction from Text0
Deep Learning Methods and Applications for Region of Interest Detection in Dermoscopic Images0
RELOCATE: A Simple Training-Free Baseline for Visual Query Localization Using Region-Based Representations0
3DCNN-DQN-RNN: A Deep Reinforcement Learning Framework for Semantic Parsing of Large-scale 3D Point Clouds0
Visuomotor Control in Multi-Object Scenes Using Object-Aware Representations0
Why is plausibility surprisingly problematic as an XAI criterion?0
Rethinking the Localization in Weakly Supervised Object Localization0
VoteSplat: Hough Voting Gaussian Splatting for 3D Scene Understanding0
Foveated Retinotopy Improves Classification and Localization in CNNs0
WoMAP: World Models For Embodied Open-Vocabulary Object Localization0
Revisiting Dilated Convolution: A Simple Approach for Weakly- and Semi-Supervised Semantic Segmentation0
Revisiting Dilated Convolution: A Simple Approach for Weakly- and Semi- Supervised Semantic Segmentation0
1-HKUST: Object Detection in ILSVRC 20140
3D-Aware Object Localization using Gaussian Implicit Occupancy Function0
Weakly Supervised Foreground Learning for Weakly Supervised Localization and Detection0
Robust Scale Estimation in Real-Time Monocular SFM for Autonomous Driving0
ROML: A Robust Feature Correspondence Approach for Matching Objects in A Set of Images0
Show:102550
← PrevPage 8 of 13Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1OSMaNRGSPL32.99Unverified
2SUSARGSPL27.31Unverified
3ShanksRGSPL22.85Unverified
4CVPR22RGSPL22.06Unverified
5damm1RGSPL15.96Unverified
61637RGSPL14.03Unverified
7init. PREVALENTRGSPL13.51Unverified
8AirbertRGSPL13.28Unverified
9init. OSCARRGSPL10Unverified
10SIARGSPL9.2Unverified
#ModelMetricClaimedVerifiedStatus
1VoxelNetAP89.35Unverified
2VoxelNetAP89.35Unverified
3Frustum PointNetsAP88.7Unverified
4Frustum PointNetsAP81.2Unverified
5VoxelNetAP77.47Unverified
#ModelMetricClaimedVerifiedStatus
1Frustrum-PointPillarsAP48.3Unverified
2Frustum PointNetsAP47.2Unverified
3Frustum PointNetsAP40.23Unverified
4VoxelNetAP38.11Unverified
5VoxelNetAP31.51Unverified
#ModelMetricClaimedVerifiedStatus
1Frustrum-PointPillarsAP52.23Unverified
2Frustum PointNetsAP50.22Unverified
3Frustum PointNetsAP42.15Unverified
4VoxelNetAP40.74Unverified
5VoxelNetAP33.69Unverified
#ModelMetricClaimedVerifiedStatus
1VoxelNetAP77.39Unverified
2Frustum PointNetsAP75.33Unverified
3Frustum PointNetsAP62.19Unverified
4VoxelNetAP57.73Unverified
#ModelMetricClaimedVerifiedStatus
1Frustum PointNetsAP75.38Unverified
2Frustum PointNetsAP71.96Unverified
3VoxelNetAP66.7Unverified
4VoxelNetAP61.22Unverified
#ModelMetricClaimedVerifiedStatus
1Frustum PointNetsAP61.96Unverified
2Frustum PointNetsAP56.77Unverified
3VoxelNetAP54.76Unverified
4VoxelNetAP48.36Unverified
#ModelMetricClaimedVerifiedStatus
1Frustum PointNetsAP58.09Unverified
2Frustum PointNetsAP51.21Unverified
3VoxelNetAP46.13Unverified
4VoxelNetAP39.48Unverified
#ModelMetricClaimedVerifiedStatus
1Unified-IOXLLocalization (ablation)67Unverified
2GPV-2Localization (ablation)53.6Unverified
3Mask R-CNNLocalization (ablation)44.7Unverified
#ModelMetricClaimedVerifiedStatus
1Frustum PointNetsAP54.68Unverified
2VoxelNeAP50.55Unverified
3Frustum PointNetsAP50.39Unverified
#ModelMetricClaimedVerifiedStatus
1GPT4-Vision 4-shot+CoTAccuracy49.7Unverified
2Gemini-Pro 4-shot+CoTAccuracy33.9Unverified
#ModelMetricClaimedVerifiedStatus
1Frustum PointNetsAP84Unverified
2VoxelNetAP79.26Unverified
#ModelMetricClaimedVerifiedStatus
1Frustrum-PointPillarsAP60.98Unverified
#ModelMetricClaimedVerifiedStatus
1Hausdorff LossPrecision88.1Unverified
#ModelMetricClaimedVerifiedStatus
1oursCorLoc41.2Unverified
#ModelMetricClaimedVerifiedStatus
1oursCorLoc47.45Unverified
#ModelMetricClaimedVerifiedStatus
1Hausdorff LossF-Score88.6Unverified
#ModelMetricClaimedVerifiedStatus
1Hausdorff LossRecall89.2Unverified