SOTAVerified

Object Localization

Object Localization is the task of locating an instance of a particular object category in an image, typically by specifying a tightly cropped bounding box centered on the instance. An object proposal specifies a candidate bounding box, and an object proposal is said to be a correct localization if it sufficiently overlaps a human-labeled “ground-truth” bounding box for the given object. In the literature, the “Object Localization” task is to locate one instance of an object category, whereas “object detection” focuses on locating all instances of a category in a given image.

Source: Fast On-Line Kernel Density Estimation for Active Object Localization

Papers

Showing 151200 of 617 papers

TitleStatusHype
Distilling Knowledge from Refinement in Multiple Instance Detection NetworksCode1
A Generic Visualization Approach for Convolutional Neural NetworksCode1
Improving Weakly-supervised Object Localization via Causal InterventionCode1
OCDet: Object Center Detection via Bounding Box-Aware Heatmap Prediction on Edge Devices with NPUsCode1
On Label Granularity and Object LocalizationCode1
Online Refinement of Low-level Feature Based Activation Map for Weakly Supervised Object LocalizationCode1
CIA-SSD: Confident IoU-Aware Single-Stage Object Detector From Point CloudCode1
Open-World Weakly-Supervised Object LocalizationCode1
LayerCAM: Exploring Hierarchical Class Activation Maps for LocalizationCode1
Dual-attention Guided Dropblock Module for Weakly Supervised Object LocalizationCode1
Dual Attention U-Net with Feature Infusion: Pushing the Boundaries of Multiclass Defect SegmentationCode1
Agent Journey Beyond RGB: Unveiling Hybrid Semantic-Spatial Environmental Representations for Vision-and-Language NavigationCode1
Dual Progressive Transformations for Weakly Supervised Semantic SegmentationCode1
MCTformer+: Multi-Class Token Transformer for Weakly Supervised Semantic SegmentationCode1
Unveiling the Potential of Structure Preserving for Weakly Supervised Object LocalizationCode1
EDN: Salient Object Detection via Extremely-Downsampled NetworkCode1
Efficient Object Localization Using Convolutional NetworksCode1
Egocentric Audio-Visual Object LocalizationCode1
Deep Learning for Weakly-Supervised Object Detection and Object Localization: A Survey0
Bootstrapping Labelled Dataset Construction for Cow Tracking and Behavior Analysis0
Deep learning architectures for automated image segmentation0
Deep Joint Task Learning for Generic Object Extraction0
How spatial frequencies and color drive object search in real-world scenes: A new eye-movement corpus0
Deep Contextual Attention for Human-Object Interaction Detection0
DeepAdaIn-Net: Deep Adaptive Device-Edge Collaborative Inference for Augmented Reality0
Adaptively Denoising Proposal Collection for Weakly Supervised Object Localization0
I3DOD: Towards Incremental 3D Object Detection via Prompting0
Modelling Lips-State Detection Using CNN for Non-Verbal Communications0
An Application of Deep Learning for Sweet Cherry Phenotyping using YOLO Object Detection0
D2DF2WOD: Learning Object Proposals for Weakly-Supervised Object Detection via Progressive Domain Adaptation0
BIV-Priv-Seg: Locating Private Content in Images Taken by People With Visual Impairments0
Adaptively Denoising Proposal Collection forWeakly Supervised Object Localization0
Cyclic Learning for Binaural Audio Generation and Localization0
BirdSLAM: Monocular Multibody SLAM in Bird's-Eye View0
A Model Generalization Study in Localizing Indoor Cows with COw LOcalization (COLO) dataset0
Evaluating and Enhancing Trustworthiness of LLMs in Perception Tasks0
Progressive Domain Adaptation with Contrastive Learning for Object Detection in the Satellite Imagery0
Cross-Modal Distillation for 2D/3D Multi-Object Discovery from 2D Motion0
Beyond Object Categories: Multi-Attribute Reference Understanding for Visual Grounding0
A Memory-Augmented Multi-Task Collaborative Framework for Unsupervised Traffic Accident Detection in Driving Videos0
Adaptive Label Smoothing0
HiLM-D: Towards High-Resolution Understanding in Multimodal Large Language Models for Autonomous Driving0
Could We Generate Cytology Images from Histopathology Images? An Empirical Study0
A Markerless Deep Learning-based 6 Degrees of Freedom PoseEstimation for with Mobile Robots using RGB Data0
How hard can it be? Estimating the difficulty of visual search in an image0
Cooperative Multi-Monostatic Sensing for Object Localization in 6G Networks0
Adapting Mask-RCNN for Automatic Nucleus Segmentation0
Few-shot Geometry-Aware Keypoint Localization0
3DCNN-DQN-RNN: A Deep Reinforcement Learning Framework for Semantic Parsing of Large-scale 3D Point Clouds0
Heterogeneous Grid Convolution for Adaptive, Efficient, and Controllable Computation0
Show:102550
← PrevPage 4 of 13Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1OSMaNRGSPL32.99Unverified
2SUSARGSPL27.31Unverified
3ShanksRGSPL22.85Unverified
4CVPR22RGSPL22.06Unverified
5damm1RGSPL15.96Unverified
61637RGSPL14.03Unverified
7init. PREVALENTRGSPL13.51Unverified
8AirbertRGSPL13.28Unverified
9init. OSCARRGSPL10Unverified
10SIARGSPL9.2Unverified
#ModelMetricClaimedVerifiedStatus
1VoxelNetAP89.35Unverified
2VoxelNetAP89.35Unverified
3Frustum PointNetsAP88.7Unverified
4Frustum PointNetsAP81.2Unverified
5VoxelNetAP77.47Unverified
#ModelMetricClaimedVerifiedStatus
1Frustrum-PointPillarsAP48.3Unverified
2Frustum PointNetsAP47.2Unverified
3Frustum PointNetsAP40.23Unverified
4VoxelNetAP38.11Unverified
5VoxelNetAP31.51Unverified
#ModelMetricClaimedVerifiedStatus
1Frustrum-PointPillarsAP52.23Unverified
2Frustum PointNetsAP50.22Unverified
3Frustum PointNetsAP42.15Unverified
4VoxelNetAP40.74Unverified
5VoxelNetAP33.69Unverified
#ModelMetricClaimedVerifiedStatus
1VoxelNetAP77.39Unverified
2Frustum PointNetsAP75.33Unverified
3Frustum PointNetsAP62.19Unverified
4VoxelNetAP57.73Unverified
#ModelMetricClaimedVerifiedStatus
1Frustum PointNetsAP75.38Unverified
2Frustum PointNetsAP71.96Unverified
3VoxelNetAP66.7Unverified
4VoxelNetAP61.22Unverified
#ModelMetricClaimedVerifiedStatus
1Frustum PointNetsAP61.96Unverified
2Frustum PointNetsAP56.77Unverified
3VoxelNetAP54.76Unverified
4VoxelNetAP48.36Unverified
#ModelMetricClaimedVerifiedStatus
1Frustum PointNetsAP58.09Unverified
2Frustum PointNetsAP51.21Unverified
3VoxelNetAP46.13Unverified
4VoxelNetAP39.48Unverified
#ModelMetricClaimedVerifiedStatus
1Unified-IOXLLocalization (ablation)67Unverified
2GPV-2Localization (ablation)53.6Unverified
3Mask R-CNNLocalization (ablation)44.7Unverified
#ModelMetricClaimedVerifiedStatus
1Frustum PointNetsAP54.68Unverified
2VoxelNeAP50.55Unverified
3Frustum PointNetsAP50.39Unverified
#ModelMetricClaimedVerifiedStatus
1GPT4-Vision 4-shot+CoTAccuracy49.7Unverified
2Gemini-Pro 4-shot+CoTAccuracy33.9Unverified
#ModelMetricClaimedVerifiedStatus
1Frustum PointNetsAP84Unverified
2VoxelNetAP79.26Unverified
#ModelMetricClaimedVerifiedStatus
1Frustrum-PointPillarsAP60.98Unverified
#ModelMetricClaimedVerifiedStatus
1Hausdorff LossPrecision88.1Unverified
#ModelMetricClaimedVerifiedStatus
1oursCorLoc41.2Unverified
#ModelMetricClaimedVerifiedStatus
1oursCorLoc47.45Unverified
#ModelMetricClaimedVerifiedStatus
1Hausdorff LossF-Score88.6Unverified
#ModelMetricClaimedVerifiedStatus
1Hausdorff LossRecall89.2Unverified