SOTAVerified

Object Localization

Object Localization is the task of locating an instance of a particular object category in an image, typically by specifying a tightly cropped bounding box centered on the instance. An object proposal specifies a candidate bounding box, and an object proposal is said to be a correct localization if it sufficiently overlaps a human-labeled “ground-truth” bounding box for the given object. In the literature, the “Object Localization” task is to locate one instance of an object category, whereas “object detection” focuses on locating all instances of a category in a given image.

Source: Fast On-Line Kernel Density Estimation for Active Object Localization

Papers

Showing 501550 of 617 papers

TitleStatusHype
Spiking Neural Networks for Frame-based and Event-based Single Object Localization0
Square Localization for Efficient and Accurate Object Detection0
Embodied Amodal Recognition: Learning to Move to Perceive Objects0
Embodied Visual Recognition0
Enabling Computer Vision Driven Assistive Devices for the Visually Impaired via Micro-architecture Design Exploration0
Ensemble of Part Detectors for Simultaneous Classification and Localization0
Entropy Guided Adversarial Model for Weakly Supervised Object Localization0
Equivariant Spatio-Temporal Self-Supervision for LiDAR Object Detection0
Erasing Integrated Learning: A Simple Yet Effective Approach for Weakly Supervised Object Localization0
SSA: Semantic Structure Aware Inference for Weakly Pixel-Wise Dense Predictions without Cost0
Stereo Vision-based Semantic 3D Object and Ego-motion Tracking for Autonomous Driving0
Exploring Modality Guidance to Enhance VFM-based Feature Fusion for UDA in 3D Semantic Segmentation0
Exploring to learn visual saliency: The RL-IAC approach0
Extending Class Activation Mapping Using Gaussian Receptive Field0
STNet: Selective Tuning of Convolutional Networks for Object Localization0
Fast Object Localization Using a CNN Feature Map Based Multi-Scale Search0
FAST OBJECT LOCALIZATION VIA SENSITIVITY ANALYSIS0
Fast On-Line Kernel Density Estimation for Active Object Localization0
FAST-Splat: Fast, Ambiguity-Free Semantics Transfer in Gaussian Splatting0
"Sliced" Subwindow Search: a Sublinear-complexity Solution to the Maximum Rectangle Problem0
Few-shot Geometry-Aware Keypoint Localization0
Weakly-supervised Object Localization for Few-shot Learning and Fine-grained Few-shot Learning0
SuperGSeg: Open-Vocabulary 3D Segmentation with Structured Super-Gaussians0
Tactile Mapping and Localization from High-Resolution Tactile Imprints0
Few-shot Weakly-Supervised Object Detection via Directional Statistics0
Finding Fallen Objects Via Asynchronous Audio-Visual Integration0
Fine-Grained Attention for Weakly Supervised Object Localization0
LocPoseNet: Robust Location Prior for Unseen Object Pose Estimation0
FingerSLAM: Closed-loop Unknown Object Localization and Reconstruction from Visuo-tactile Feedback0
Flash Photography for Data-Driven Hidden Scene Recovery0
Foreground Activation Maps for Weakly Supervised Object Localization0
Foundation Models for Remote Sensing: An Analysis of MLLMs for Object Localization0
FPAN: Fine-grained and Progressive Attention Localization Network for Data Retrieval0
FRED: Towards a Full Rotation-Equivariance in Aerial Image Object Detection0
Weakly Supervised Object Localization Using Things and Stuff Transfer0
Fusing Saliency Maps with Region Proposals for Unsupervised Object Localization0
Gaussian Processes with Context-Supported Priors for Active Object Localization0
Generative Adversarial Networks for Unsupervised Object Co-localization0
Geometry Aligned Variational Transformer for Image-conditioned Layout Generation0
Boundary-aware Camouflaged Object Detection via Deformable Point Sampling0
Text-guided Zero-Shot Object Localization0
GloFinder: AI-empowered QuPath Plugin for WSI-level Glomerular Detection, Visualization, and Curation0
GLRD: Global-Local Collaborative Reason and Debate with PSL for 3D Open-Vocabulary Detection0
GP-select: Accelerating EM using adaptive subspace preselection0
GridMix: Strong regularization through local context mapping0
Stimulating Imagination: Towards General-purpose Object Rearrangement0
The Devil is in the Task: Exploiting Reciprocal Appearance-Localization Features for Monocular 3D Object Detection0
Grounding Scene Graphs on Natural Images via Visio-Lingual Message Passing0
Ground-V: Teaching VLMs to Ground Complex Instructions in Pixels0
GTA: Guided Transfer of Spatial Attention from Object-Centric Representations0
Show:102550
← PrevPage 11 of 13Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1OSMaNRGSPL32.99Unverified
2SUSARGSPL27.31Unverified
3ShanksRGSPL22.85Unverified
4CVPR22RGSPL22.06Unverified
5damm1RGSPL15.96Unverified
61637RGSPL14.03Unverified
7init. PREVALENTRGSPL13.51Unverified
8AirbertRGSPL13.28Unverified
9init. OSCARRGSPL10Unverified
10SIARGSPL9.2Unverified
#ModelMetricClaimedVerifiedStatus
1VoxelNetAP89.35Unverified
2VoxelNetAP89.35Unverified
3Frustum PointNetsAP88.7Unverified
4Frustum PointNetsAP81.2Unverified
5VoxelNetAP77.47Unverified
#ModelMetricClaimedVerifiedStatus
1Frustrum-PointPillarsAP48.3Unverified
2Frustum PointNetsAP47.2Unverified
3Frustum PointNetsAP40.23Unverified
4VoxelNetAP38.11Unverified
5VoxelNetAP31.51Unverified
#ModelMetricClaimedVerifiedStatus
1Frustrum-PointPillarsAP52.23Unverified
2Frustum PointNetsAP50.22Unverified
3Frustum PointNetsAP42.15Unverified
4VoxelNetAP40.74Unverified
5VoxelNetAP33.69Unverified
#ModelMetricClaimedVerifiedStatus
1VoxelNetAP77.39Unverified
2Frustum PointNetsAP75.33Unverified
3Frustum PointNetsAP62.19Unverified
4VoxelNetAP57.73Unverified
#ModelMetricClaimedVerifiedStatus
1Frustum PointNetsAP75.38Unverified
2Frustum PointNetsAP71.96Unverified
3VoxelNetAP66.7Unverified
4VoxelNetAP61.22Unverified
#ModelMetricClaimedVerifiedStatus
1Frustum PointNetsAP61.96Unverified
2Frustum PointNetsAP56.77Unverified
3VoxelNetAP54.76Unverified
4VoxelNetAP48.36Unverified
#ModelMetricClaimedVerifiedStatus
1Frustum PointNetsAP58.09Unverified
2Frustum PointNetsAP51.21Unverified
3VoxelNetAP46.13Unverified
4VoxelNetAP39.48Unverified
#ModelMetricClaimedVerifiedStatus
1Unified-IOXLLocalization (ablation)67Unverified
2GPV-2Localization (ablation)53.6Unverified
3Mask R-CNNLocalization (ablation)44.7Unverified
#ModelMetricClaimedVerifiedStatus
1Frustum PointNetsAP54.68Unverified
2VoxelNeAP50.55Unverified
3Frustum PointNetsAP50.39Unverified
#ModelMetricClaimedVerifiedStatus
1GPT4-Vision 4-shot+CoTAccuracy49.7Unverified
2Gemini-Pro 4-shot+CoTAccuracy33.9Unverified
#ModelMetricClaimedVerifiedStatus
1Frustum PointNetsAP84Unverified
2VoxelNetAP79.26Unverified
#ModelMetricClaimedVerifiedStatus
1Frustrum-PointPillarsAP60.98Unverified
#ModelMetricClaimedVerifiedStatus
1Hausdorff LossPrecision88.1Unverified
#ModelMetricClaimedVerifiedStatus
1oursCorLoc41.2Unverified
#ModelMetricClaimedVerifiedStatus
1oursCorLoc47.45Unverified
#ModelMetricClaimedVerifiedStatus
1Hausdorff LossF-Score88.6Unverified
#ModelMetricClaimedVerifiedStatus
1Hausdorff LossRecall89.2Unverified