SOTAVerified

Object Localization

Object Localization is the task of locating an instance of a particular object category in an image, typically by specifying a tightly cropped bounding box centered on the instance. An object proposal specifies a candidate bounding box, and an object proposal is said to be a correct localization if it sufficiently overlaps a human-labeled “ground-truth” bounding box for the given object. In the literature, the “Object Localization” task is to locate one instance of an object category, whereas “object detection” focuses on locating all instances of a category in a given image.

Source: Fast On-Line Kernel Density Estimation for Active Object Localization

Papers

Showing 151200 of 617 papers

TitleStatusHype
Distilling Knowledge from Refinement in Multiple Instance Detection NetworksCode1
A Generic Visualization Approach for Convolutional Neural NetworksCode1
IllusionVQA: A Challenging Optical Illusion Dataset for Vision Language ModelsCode1
SoccerNet-v3D: Leveraging Sports Broadcast Replays for 3D Scene UnderstandingCode1
Re-Attention Transformer for Weakly Supervised Object LocalizationCode1
InstanceRefer: Cooperative Holistic Understanding for Visual Grounding on Point Clouds through Instance Multi-level Contextual ReferringCode1
CIA-SSD: Confident IoU-Aware Single-Stage Object Detector From Point CloudCode1
TDAM: Top-Down Attention Module for Contextually Guided Feature Selection in CNNsCode1
RfD-Net: Point Scene Understanding by Semantic Instance ReconstructionCode1
Dual-attention Guided Dropblock Module for Weakly Supervised Object LocalizationCode1
Training Interpretable Convolutional Neural Networks by Differentiating Class-specific FiltersCode1
Agent Journey Beyond RGB: Unveiling Hybrid Semantic-Spatial Environmental Representations for Vision-and-Language NavigationCode1
Dual Progressive Transformations for Weakly Supervised Semantic SegmentationCode1
Multi-class Token Transformer for Weakly Supervised Semantic SegmentationCode1
LayerCAM: Exploring Hierarchical Class Activation Maps for LocalizationCode1
EDN: Salient Object Detection via Extremely-Downsampled NetworkCode1
Efficient Object Localization Using Convolutional NetworksCode1
Egocentric Audio-Visual Object LocalizationCode1
Deep Learning for Identifying Iran's Cultural Heritage Buildings in Need of Conservation Using Image Classification and Grad-CAMCode0
Personal Fixations-Based Object Segmentation with Object Localization and Boundary PreservationCode0
PixelCAM: Pixel Class Activation Mapping for Histology Image Classification and ROI LocalizationCode0
PEEKABOO: Hiding parts of an image for unsupervised object localizationCode0
DAP: Detection-Aware Pre-training with Weak SupervisionCode0
DANet: Divergent Activation for Weakly Supervised Object LocalizationCode0
3-Dimensional Sonic Phase-invariant Echo LocalizationCode0
Attributional Robustness Training using Input-Gradient Spatial AlignmentCode0
Trade-offs in Fine-tuned Diffusion Models Between Accuracy and InterpretabilityCode0
One-Shot General Object LocalizationCode0
CPR++: Object Localization via Single Coarse Point SupervisionCode0
Count-ception: Counting by Fully Convolutional Redundant CountingCode0
Co-Segmentation without any Pixel-level Supervision with Application to Large-Scale Sketch ClassificationCode0
Object Detectors Emerge in Deep Scene CNNsCode0
Object Localization under Single Coarse Point SupervisionCode0
Background-aware Classification Activation Map for Weakly Supervised Object LocalizationCode0
ALWOD: Active Learning for Weakly-Supervised Object DetectionCode0
Convolutional STN for Weakly Supervised Object LocalizationCode0
Object Detection via a Multi-Region and Semantic Segmentation-Aware CNN ModelCode0
Object detection via a multi-region & semantic segmentation-aware CNN modelCode0
Contrastive Corpus Attribution for Explaining RepresentationsCode0
Fast YOLO: A Fast You Only Look Once System for Real-time Embedded Object Detection in VideoCode0
ContextLocNet: Context-Aware Deep Network Models for Weakly Supervised LocalizationCode0
Multispectral Detection Transformer with Infrared-Centric Sensor FusionCode0
Progressive Representation Adaptation for Weakly Supervised Object LocalizationCode0
Min-Entropy Latent Model for Weakly Supervised Object DetectionCode0
MB-ORES: A Multi-Branch Object Reasoner for Visual Grounding in Remote SensingCode0
Explaining Multi-modal Large Language Models by Analyzing their Vision PerceptionCode0
Concept Visualization: Explaining the CLIP Multi-modal Embedding Using WordNetCode0
Expeditious Saliency-guided Mix-up through Random Gradient ThresholdingCode0
Evaluation of Audio-Visual Alignments in Visually Grounded Speech ModelsCode0
All-pairs Consistency Learning for Weakly Supervised Semantic SegmentationCode0
Show:102550
← PrevPage 4 of 13Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1OSMaNRGSPL32.99Unverified
2SUSARGSPL27.31Unverified
3ShanksRGSPL22.85Unverified
4CVPR22RGSPL22.06Unverified
5damm1RGSPL15.96Unverified
61637RGSPL14.03Unverified
7init. PREVALENTRGSPL13.51Unverified
8AirbertRGSPL13.28Unverified
9init. OSCARRGSPL10Unverified
10SIARGSPL9.2Unverified
#ModelMetricClaimedVerifiedStatus
1VoxelNetAP89.35Unverified
2VoxelNetAP89.35Unverified
3Frustum PointNetsAP88.7Unverified
4Frustum PointNetsAP81.2Unverified
5VoxelNetAP77.47Unverified
#ModelMetricClaimedVerifiedStatus
1Frustrum-PointPillarsAP48.3Unverified
2Frustum PointNetsAP47.2Unverified
3Frustum PointNetsAP40.23Unverified
4VoxelNetAP38.11Unverified
5VoxelNetAP31.51Unverified
#ModelMetricClaimedVerifiedStatus
1Frustrum-PointPillarsAP52.23Unverified
2Frustum PointNetsAP50.22Unverified
3Frustum PointNetsAP42.15Unverified
4VoxelNetAP40.74Unverified
5VoxelNetAP33.69Unverified
#ModelMetricClaimedVerifiedStatus
1VoxelNetAP77.39Unverified
2Frustum PointNetsAP75.33Unverified
3Frustum PointNetsAP62.19Unverified
4VoxelNetAP57.73Unverified
#ModelMetricClaimedVerifiedStatus
1Frustum PointNetsAP75.38Unverified
2Frustum PointNetsAP71.96Unverified
3VoxelNetAP66.7Unverified
4VoxelNetAP61.22Unverified
#ModelMetricClaimedVerifiedStatus
1Frustum PointNetsAP61.96Unverified
2Frustum PointNetsAP56.77Unverified
3VoxelNetAP54.76Unverified
4VoxelNetAP48.36Unverified
#ModelMetricClaimedVerifiedStatus
1Frustum PointNetsAP58.09Unverified
2Frustum PointNetsAP51.21Unverified
3VoxelNetAP46.13Unverified
4VoxelNetAP39.48Unverified
#ModelMetricClaimedVerifiedStatus
1Unified-IOXLLocalization (ablation)67Unverified
2GPV-2Localization (ablation)53.6Unverified
3Mask R-CNNLocalization (ablation)44.7Unverified
#ModelMetricClaimedVerifiedStatus
1Frustum PointNetsAP54.68Unverified
2VoxelNeAP50.55Unverified
3Frustum PointNetsAP50.39Unverified
#ModelMetricClaimedVerifiedStatus
1GPT4-Vision 4-shot+CoTAccuracy49.7Unverified
2Gemini-Pro 4-shot+CoTAccuracy33.9Unverified
#ModelMetricClaimedVerifiedStatus
1Frustum PointNetsAP84Unverified
2VoxelNetAP79.26Unverified
#ModelMetricClaimedVerifiedStatus
1Frustrum-PointPillarsAP60.98Unverified
#ModelMetricClaimedVerifiedStatus
1Hausdorff LossPrecision88.1Unverified
#ModelMetricClaimedVerifiedStatus
1oursCorLoc41.2Unverified
#ModelMetricClaimedVerifiedStatus
1oursCorLoc47.45Unverified
#ModelMetricClaimedVerifiedStatus
1Hausdorff LossF-Score88.6Unverified
#ModelMetricClaimedVerifiedStatus
1Hausdorff LossRecall89.2Unverified