SOTAVerified

Object Localization

Object Localization is the task of locating an instance of a particular object category in an image, typically by specifying a tightly cropped bounding box centered on the instance. An object proposal specifies a candidate bounding box, and an object proposal is said to be a correct localization if it sufficiently overlaps a human-labeled “ground-truth” bounding box for the given object. In the literature, the “Object Localization” task is to locate one instance of an object category, whereas “object detection” focuses on locating all instances of a category in a given image.

Source: Fast On-Line Kernel Density Estimation for Active Object Localization

Papers

Showing 501550 of 617 papers

TitleStatusHype
EgoCOL: Egocentric Camera pose estimation for Open-world 3D object Localization @Ego4D challenge 2023Code0
CoLo-CAM: Class Activation Mapping for Object Co-Localization in Weakly-Labeled Unconstrained VideosCode0
Object Detectors Emerge in Deep Scene CNNsCode0
Combinational Class Activation Maps for Weakly Supervised Object LocalizationCode0
Self-produced Guidance for Weakly-supervised Object LocalizationCode0
End-to-end detection-segmentation network with ROI convolutionCode0
Enhancing Satellite Object Localization with Dilated Convolutions and Attention-aided Spatial PoolingCode0
Background-aware Classification Activation Map for Weakly Supervised Object LocalizationCode0
Detecting Lesion Bounding Ellipses With Gaussian Proposal NetworksCode0
Concept Visualization: Explaining the CLIP Multi-modal Embedding Using WordNetCode0
Deep Learning for Identifying Iran's Cultural Heritage Buildings in Need of Conservation Using Image Classification and Grad-CAMCode0
Source-Free Domain Adaptation of Weakly-Supervised Object Localization Models for HistologyCode0
Evaluation and Comparison of Visual Language Models for Transportation Engineering ProblemsCode0
StarNet: towards Weakly Supervised Few-Shot Object DetectionCode0
Evaluation of Audio-Visual Alignments in Visually Grounded Speech ModelsCode0
Expeditious Saliency-guided Mix-up through Random Gradient ThresholdingCode0
Explaining Multi-modal Large Language Models by Analyzing their Vision PerceptionCode0
Towards Learning Monocular 3D Object Localization From 2D Labels using the Physical Laws of MotionCode0
Explaining image classifiers by removing input features using generative modelsCode0
Soft Proposal Networks for Weakly Supervised Object LocalizationCode0
SIXray : A Large-scale Security Inspection X-ray Benchmark for Prohibited Item Discovery in Overlapping ImagesCode0
Learning to Terminate in Object NavigationCode0
Strengthen Learning Tolerance for Weakly Supervised Object LocalizationCode0
Learning Transferable Reward for Query Object Localization with Policy AdaptationCode0
WILDCAT: Weakly Supervised Learning of Deep ConvNets for Image Classification, Pointwise Localization and SegmentationCode0
WildRefer: 3D Object Localization in Large-scale Dynamic Scenes with Multi-modal Visual Data and Natural LanguageCode0
Leveraging Transformers for Weakly Supervised Object Localization in Unconstrained VideosCode0
Fast YOLO: A Fast You Only Look Once System for Real-time Embedded Object Detection in VideoCode0
Sub-frame Appearance and 6D Pose Estimation of Fast Moving ObjectsCode0
Rethinking Localization Map: Towards Accurate Object Perception with Self-Enhancement MapsCode0
Rethinking Object Detection in Retail StoresCode0
One-Shot General Object LocalizationCode0
Localizing Infinity-shaped fishes: Sketch-guided object localization in the wildCode0
ContextLocNet: Context-Aware Deep Network Models for Weakly Supervised LocalizationCode0
Contrastive Corpus Attribution for Explaining RepresentationsCode0
ALINA: Advanced Line Identification and Notation AlgorithmCode0
Unsupervised Object Localization with Representer Point SelectionCode0
Convolutional STN for Weakly Supervised Object LocalizationCode0
Attributional Robustness Training using Input-Gradient Spatial AlignmentCode0
Do Pre-trained Vision-Language Models Encode Object States?Code0
FlexLoc: Conditional Neural Networks for Zero-Shot Sensor Perspective Invariance in Object Localization with Distributed Multimodal SensorsCode0
TCAM: Temporal Class Activation Maps for Object Localization in Weakly-Labeled Unconstrained VideosCode0
Focal and Efficient IOU Loss for Accurate Bounding Box RegressionCode0
Tracking using Numerous Anchor pointsCode0
SIXray: A Large-Scale Security Inspection X-Ray Benchmark for Prohibited Item Discovery in Overlapping ImagesCode0
Active Object Localization with Deep Reinforcement LearningCode0
Union-over-Intersections: Object Detection beyond Winner-Takes-AllCode0
Masked Multi-Query Slot Attention for Unsupervised Object DiscoveryCode0
YCB-LUMA: YCB Object Dataset with Luminance Keying for Object LocalizationCode0
Knowledge-guided Causal Intervention for Weakly-supervised Object LocalizationCode0
Show:102550
← PrevPage 11 of 13Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1OSMaNRGSPL32.99Unverified
2SUSARGSPL27.31Unverified
3ShanksRGSPL22.85Unverified
4CVPR22RGSPL22.06Unverified
5damm1RGSPL15.96Unverified
61637RGSPL14.03Unverified
7init. PREVALENTRGSPL13.51Unverified
8AirbertRGSPL13.28Unverified
9init. OSCARRGSPL10Unverified
10SIARGSPL9.2Unverified
#ModelMetricClaimedVerifiedStatus
1VoxelNetAP89.35Unverified
2VoxelNetAP89.35Unverified
3Frustum PointNetsAP88.7Unverified
4Frustum PointNetsAP81.2Unverified
5VoxelNetAP77.47Unverified
#ModelMetricClaimedVerifiedStatus
1Frustrum-PointPillarsAP48.3Unverified
2Frustum PointNetsAP47.2Unverified
3Frustum PointNetsAP40.23Unverified
4VoxelNetAP38.11Unverified
5VoxelNetAP31.51Unverified
#ModelMetricClaimedVerifiedStatus
1Frustrum-PointPillarsAP52.23Unverified
2Frustum PointNetsAP50.22Unverified
3Frustum PointNetsAP42.15Unverified
4VoxelNetAP40.74Unverified
5VoxelNetAP33.69Unverified
#ModelMetricClaimedVerifiedStatus
1VoxelNetAP77.39Unverified
2Frustum PointNetsAP75.33Unverified
3Frustum PointNetsAP62.19Unverified
4VoxelNetAP57.73Unverified
#ModelMetricClaimedVerifiedStatus
1Frustum PointNetsAP75.38Unverified
2Frustum PointNetsAP71.96Unverified
3VoxelNetAP66.7Unverified
4VoxelNetAP61.22Unverified
#ModelMetricClaimedVerifiedStatus
1Frustum PointNetsAP61.96Unverified
2Frustum PointNetsAP56.77Unverified
3VoxelNetAP54.76Unverified
4VoxelNetAP48.36Unverified
#ModelMetricClaimedVerifiedStatus
1Frustum PointNetsAP58.09Unverified
2Frustum PointNetsAP51.21Unverified
3VoxelNetAP46.13Unverified
4VoxelNetAP39.48Unverified
#ModelMetricClaimedVerifiedStatus
1Unified-IOXLLocalization (ablation)67Unverified
2GPV-2Localization (ablation)53.6Unverified
3Mask R-CNNLocalization (ablation)44.7Unverified
#ModelMetricClaimedVerifiedStatus
1Frustum PointNetsAP54.68Unverified
2VoxelNeAP50.55Unverified
3Frustum PointNetsAP50.39Unverified
#ModelMetricClaimedVerifiedStatus
1GPT4-Vision 4-shot+CoTAccuracy49.7Unverified
2Gemini-Pro 4-shot+CoTAccuracy33.9Unverified
#ModelMetricClaimedVerifiedStatus
1Frustum PointNetsAP84Unverified
2VoxelNetAP79.26Unverified
#ModelMetricClaimedVerifiedStatus
1Frustrum-PointPillarsAP60.98Unverified
#ModelMetricClaimedVerifiedStatus
1Hausdorff LossPrecision88.1Unverified
#ModelMetricClaimedVerifiedStatus
1oursCorLoc41.2Unverified
#ModelMetricClaimedVerifiedStatus
1oursCorLoc47.45Unverified
#ModelMetricClaimedVerifiedStatus
1Hausdorff LossF-Score88.6Unverified
#ModelMetricClaimedVerifiedStatus
1Hausdorff LossRecall89.2Unverified