SOTAVerified

Object Localization

Object Localization is the task of locating an instance of a particular object category in an image, typically by specifying a tightly cropped bounding box centered on the instance. An object proposal specifies a candidate bounding box, and an object proposal is said to be a correct localization if it sufficiently overlaps a human-labeled “ground-truth” bounding box for the given object. In the literature, the “Object Localization” task is to locate one instance of an object category, whereas “object detection” focuses on locating all instances of a category in a given image.

Source: Fast On-Line Kernel Density Estimation for Active Object Localization

Papers

Showing 351400 of 617 papers

TitleStatusHype
Improving Few-shot Learning with Weakly-supervised Object Localization0
Improving Weakly-Supervised Object Localization By Micro-Annotation0
Improving Weakly-Supervised Object Localization Using Adversarial Erasing and Pseudo Label0
Information Entropy Based Feature Pooling for Convolutional Neural Networks0
In pixels we trust: From Pixel Labeling to Object Localization and Scene Categorization0
Top-GAP: Integrating Size Priors in CNNs for more Interpretability, Robustness, and Bias Mitigation0
I see what you hear: a vision-inspired method to localize words0
Is Object Localization for Free? - Weakly-Supervised Learning With Convolutional Neural Networks0
Iterative Spectral Clustering for Unsupervised Object Localization0
Joint ANN-SNN Co-training for Object Localization and Image Segmentation0
Joint SFM and Detection Cues for Monocular 3D Localization in Road Scenes0
Toward Accurate Camera-based 3D Object Detection via Cascade Depth Estimation and Calibration0
Towards Accurate Localization by Instance Search0
Adaptively Denoising Proposal Collection for Weakly Supervised Object Localization0
Categorical Knowledge Fused Recognition: Fusing Hierarchical Knowledge with Image Classification through Aligning and Deep Metric Learning0
Adaptively Denoising Proposal Collection forWeakly Supervised Object Localization0
Towards Accurate State Estimation: Kalman Filter Incorporating Motion Dynamics for 3D Multi-Object Tracking0
Latent Constrained Correlation Filters for Object Localization0
LCTR: On Awakening the Local Continuity of Transformer for Weakly Supervised Object Localization0
Learning 6-DoF Object Poses to Grasp Category-level Objects by Language Instructions0
Learning Consistency from High-quality Pseudo-labels for Weakly Supervised Object Localization0
Towards Grounded Visual Spatial Reasoning in Multi-Modal Vision Language Models0
Learning from Counting: Leveraging Temporal Classification for Weakly Supervised Object Localization and Detection0
Learning from Web Data: the Benefit of Unsupervised Object Localization0
Learning Instance Activation Maps for Weakly Supervised Instance Segmentation0
Learning Multi-Modal Class-Specific Tokens for Weakly Supervised Dense Object Localization0
Learning Object Localization and 6D Pose Estimation from Simulation and Weakly Labeled Real Images0
Weakly Supervised Object Localization with Multi-fold Multiple Instance Learning0
Learning task-agnostic representation via toddler-inspired learning0
Learning to Detect Instance-level Salient Objects Using Complementary Image Labels0
Learning to Grasp Without Seeing0
Learning to search for and detect objects in foveal images using deep learning0
Leveraging Activations for Superpixel Explanations0
Leveraging Next-Active Objects for Context-Aware Anticipation in Egocentric Videos0
LID 2020: The Learning from Imperfect Data Challenge Results0
LIT: Light-field Inference of Transparency for Refractive Object Localization0
Localization: A Missing Link in the Pipeline of Object Matching and Registration0
Towards Omnidirectional Reasoning with 360-R1: A Dataset, Benchmark, and GRPO-based Method0
Adaptive Label Smoothing0
Locating 3D Object Proposals: A Depth-Based Online Approach0
Location-free Human Pose Estimation0
Towards Two-Stream Foveation-based Active Vision Learning0
Language-guided Scale-aware MedSegmentor for Lesion Segmentation in Medical Imaging0
LUDVIG: Learning-free Uplifting of 2D Visual features to Gaussian Splatting scenes0
Adapting Mask-RCNN for Automatic Nucleus Segmentation0
Mask-aware Text-to-Image Retrieval: Referring Expression Segmentation Meets Cross-modal Retrieval0
Improved Semantic Segmentation of Tuberculosis-consistent findings in Chest X-rays Using Augmented Training of Modality-specific U-Net Models with Weak Localizations0
Maximum Cohesive Grid of Superpixels for Fast Object Localization0
Max-Margin Structured Output Regression for Spatio-Temporal Action Localization0
Training-Free Open-Ended Object Detection and Segmentation via Attention as Prompts0
Show:102550
← PrevPage 8 of 13Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1OSMaNRGSPL32.99Unverified
2SUSARGSPL27.31Unverified
3ShanksRGSPL22.85Unverified
4CVPR22RGSPL22.06Unverified
5damm1RGSPL15.96Unverified
61637RGSPL14.03Unverified
7init. PREVALENTRGSPL13.51Unverified
8AirbertRGSPL13.28Unverified
9init. OSCARRGSPL10Unverified
10SIARGSPL9.2Unverified
#ModelMetricClaimedVerifiedStatus
1VoxelNetAP89.35Unverified
2VoxelNetAP89.35Unverified
3Frustum PointNetsAP88.7Unverified
4Frustum PointNetsAP81.2Unverified
5VoxelNetAP77.47Unverified
#ModelMetricClaimedVerifiedStatus
1Frustrum-PointPillarsAP48.3Unverified
2Frustum PointNetsAP47.2Unverified
3Frustum PointNetsAP40.23Unverified
4VoxelNetAP38.11Unverified
5VoxelNetAP31.51Unverified
#ModelMetricClaimedVerifiedStatus
1Frustrum-PointPillarsAP52.23Unverified
2Frustum PointNetsAP50.22Unverified
3Frustum PointNetsAP42.15Unverified
4VoxelNetAP40.74Unverified
5VoxelNetAP33.69Unverified
#ModelMetricClaimedVerifiedStatus
1VoxelNetAP77.39Unverified
2Frustum PointNetsAP75.33Unverified
3Frustum PointNetsAP62.19Unverified
4VoxelNetAP57.73Unverified
#ModelMetricClaimedVerifiedStatus
1Frustum PointNetsAP75.38Unverified
2Frustum PointNetsAP71.96Unverified
3VoxelNetAP66.7Unverified
4VoxelNetAP61.22Unverified
#ModelMetricClaimedVerifiedStatus
1Frustum PointNetsAP61.96Unverified
2Frustum PointNetsAP56.77Unverified
3VoxelNetAP54.76Unverified
4VoxelNetAP48.36Unverified
#ModelMetricClaimedVerifiedStatus
1Frustum PointNetsAP58.09Unverified
2Frustum PointNetsAP51.21Unverified
3VoxelNetAP46.13Unverified
4VoxelNetAP39.48Unverified
#ModelMetricClaimedVerifiedStatus
1Unified-IOXLLocalization (ablation)67Unverified
2GPV-2Localization (ablation)53.6Unverified
3Mask R-CNNLocalization (ablation)44.7Unverified
#ModelMetricClaimedVerifiedStatus
1Frustum PointNetsAP54.68Unverified
2VoxelNeAP50.55Unverified
3Frustum PointNetsAP50.39Unverified
#ModelMetricClaimedVerifiedStatus
1GPT4-Vision 4-shot+CoTAccuracy49.7Unverified
2Gemini-Pro 4-shot+CoTAccuracy33.9Unverified
#ModelMetricClaimedVerifiedStatus
1Frustum PointNetsAP84Unverified
2VoxelNetAP79.26Unverified
#ModelMetricClaimedVerifiedStatus
1Frustrum-PointPillarsAP60.98Unverified
#ModelMetricClaimedVerifiedStatus
1Hausdorff LossPrecision88.1Unverified
#ModelMetricClaimedVerifiedStatus
1oursCorLoc41.2Unverified
#ModelMetricClaimedVerifiedStatus
1oursCorLoc47.45Unverified
#ModelMetricClaimedVerifiedStatus
1Hausdorff LossF-Score88.6Unverified
#ModelMetricClaimedVerifiedStatus
1Hausdorff LossRecall89.2Unverified