SOTAVerified

Object Localization

Object Localization is the task of locating an instance of a particular object category in an image, typically by specifying a tightly cropped bounding box centered on the instance. An object proposal specifies a candidate bounding box, and an object proposal is said to be a correct localization if it sufficiently overlaps a human-labeled “ground-truth” bounding box for the given object. In the literature, the “Object Localization” task is to locate one instance of an object category, whereas “object detection” focuses on locating all instances of a category in a given image.

Source: Fast On-Line Kernel Density Estimation for Active Object Localization

Papers

Showing 201250 of 617 papers

TitleStatusHype
Open-World Weakly-Supervised Object LocalizationCode1
Learning to search for and detect objects in foveal images using deep learning0
WildRefer: 3D Object Localization in Large-scale Dynamic Scenes with Multi-modal Visual Data and Natural LanguageCode0
MOST: Multiple Object localization with Self-supervised Transformers for object discovery0
Sketch-based Video Object LocalizationCode0
Trade-offs in Fine-tuned Diffusion Models Between Accuracy and InterpretabilityCode0
Why is plausibility surprisingly problematic as an XAI criterion?0
Few-shot Geometry-Aware Keypoint Localization0
Audio-Visual Grouping Network for Sound Localization from MixturesCode1
CORA: Adapting CLIP for Open-Vocabulary Detection with Region Prompting and Anchor Pre-MatchingCode1
Egocentric Audio-Visual Object LocalizationCode1
Spatial-Aware Token for Weakly Supervised Object LocalizationCode1
DiffusionSeg: Adapting Diffusion Towards Unsupervised Object Discovery0
CoLo-CAM: Class Activation Mapping for Object Co-Localization in Weakly-Labeled Unconstrained VideosCode0
Query-guided Attention in Vision Transformers for Localizing Objects Using a Single Sketch0
FingerSLAM: Closed-loop Unknown Object Localization and Reconstruction from Visuo-tactile Feedback0
Multiparticle Kalman filter for object localization in symmetric environments0
Joint ANN-SNN Co-training for Object Localization and Image Segmentation0
Confidence-driven Bounding Box Localization for Small Object Detection0
3D-Aware Object Localization using Gaussian Implicit Occupancy Function0
Deep Learning for Identifying Iran's Cultural Heritage Buildings in Need of Conservation Using Image Classification and Grad-CAMCode0
NU-AIR -- A Neuromorphic Urban Aerial Dataset for Detection and Localization of Pedestrians and Vehicles0
An Application of Deep Learning for Sweet Cherry Phenotyping using YOLO Object Detection0
Few-Shot Object Detection via Variational Feature AggregationCode1
RREx-BoT: Remote Referring Expressions with a Bag of Tricks0
Object Preserving Siamese Network for Single Object Tracking on Point Clouds0
CLIP the Gap: A Single Domain Generalization Approach for Object DetectionCode1
Knowledge-guided Causal Intervention for Weakly-supervised Object LocalizationCode0
Category-aware Allocation Transformer for Weakly Supervised Object Localization0
Discriminating Known From Unknown Objects via Structure-Enhanced Recurrent Variational AutoEncoderCode0
Learning Multi-Modal Class-Specific Tokens for Weakly Supervised Dense Object Localization0
Adversarial Normalization: I Can Visualize Everything (ICE)Code0
Unsupervised Object Localization: Observing the Background to Discover ObjectsCode1
EgoLoc: Revisiting 3D Object Localization from Egocentric Videos with Visual QueriesCode1
DeepCut: Unsupervised Segmentation using Graph Neural Networks ClusteringCode1
Expeditious Saliency-guided Mix-up through Random Gradient ThresholdingCode0
Semi-Supervised Object Detection with Object-wise Contrastive Learning and Regression Uncertainty0
D2DF2WOD: Learning Object Proposals for Weakly-Supervised Object Detection via Progressive Domain Adaptation0
Multimodal Query-guided Object Localization0
LocPoseNet: Robust Location Prior for Unseen Object Pose Estimation0
MUSTER: A Multi-scale Transformer-based Decoder for Semantic SegmentationCode1
Roboflow 100: A Rich, Multi-Domain Object Detection BenchmarkCode2
One-Shot General Object LocalizationCode0
Autonomous Marker-less Rapid Aerial Grasping0
Boundary-aware Camouflaged Object Detection via Deformable Point Sampling0
Unifying Vision-Language Representation Space with Single-tower Transformer0
Revisiting Color-Event based Tracking: A Unified Network, Dataset, and MetricCode1
A Low-Shot Object Counting Network With Iterative Prototype AdaptationCode1
Scene-Text Oriented Reffering Expression ComprehensionCode0
Grounding Scene Graphs on Natural Images via Visio-Lingual Message Passing0
Show:102550
← PrevPage 5 of 13Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1OSMaNRGSPL32.99Unverified
2SUSARGSPL27.31Unverified
3ShanksRGSPL22.85Unverified
4CVPR22RGSPL22.06Unverified
5damm1RGSPL15.96Unverified
61637RGSPL14.03Unverified
7init. PREVALENTRGSPL13.51Unverified
8AirbertRGSPL13.28Unverified
9init. OSCARRGSPL10Unverified
10SIARGSPL9.2Unverified
#ModelMetricClaimedVerifiedStatus
1VoxelNetAP89.35Unverified
2VoxelNetAP89.35Unverified
3Frustum PointNetsAP88.7Unverified
4Frustum PointNetsAP81.2Unverified
5VoxelNetAP77.47Unverified
#ModelMetricClaimedVerifiedStatus
1Frustrum-PointPillarsAP48.3Unverified
2Frustum PointNetsAP47.2Unverified
3Frustum PointNetsAP40.23Unverified
4VoxelNetAP38.11Unverified
5VoxelNetAP31.51Unverified
#ModelMetricClaimedVerifiedStatus
1Frustrum-PointPillarsAP52.23Unverified
2Frustum PointNetsAP50.22Unverified
3Frustum PointNetsAP42.15Unverified
4VoxelNetAP40.74Unverified
5VoxelNetAP33.69Unverified
#ModelMetricClaimedVerifiedStatus
1VoxelNetAP77.39Unverified
2Frustum PointNetsAP75.33Unverified
3Frustum PointNetsAP62.19Unverified
4VoxelNetAP57.73Unverified
#ModelMetricClaimedVerifiedStatus
1Frustum PointNetsAP75.38Unverified
2Frustum PointNetsAP71.96Unverified
3VoxelNetAP66.7Unverified
4VoxelNetAP61.22Unverified
#ModelMetricClaimedVerifiedStatus
1Frustum PointNetsAP61.96Unverified
2Frustum PointNetsAP56.77Unverified
3VoxelNetAP54.76Unverified
4VoxelNetAP48.36Unverified
#ModelMetricClaimedVerifiedStatus
1Frustum PointNetsAP58.09Unverified
2Frustum PointNetsAP51.21Unverified
3VoxelNetAP46.13Unverified
4VoxelNetAP39.48Unverified
#ModelMetricClaimedVerifiedStatus
1Unified-IOXLLocalization (ablation)67Unverified
2GPV-2Localization (ablation)53.6Unverified
3Mask R-CNNLocalization (ablation)44.7Unverified
#ModelMetricClaimedVerifiedStatus
1Frustum PointNetsAP54.68Unverified
2VoxelNeAP50.55Unverified
3Frustum PointNetsAP50.39Unverified
#ModelMetricClaimedVerifiedStatus
1GPT4-Vision 4-shot+CoTAccuracy49.7Unverified
2Gemini-Pro 4-shot+CoTAccuracy33.9Unverified
#ModelMetricClaimedVerifiedStatus
1Frustum PointNetsAP84Unverified
2VoxelNetAP79.26Unverified
#ModelMetricClaimedVerifiedStatus
1Frustrum-PointPillarsAP60.98Unverified
#ModelMetricClaimedVerifiedStatus
1Hausdorff LossPrecision88.1Unverified
#ModelMetricClaimedVerifiedStatus
1oursCorLoc41.2Unverified
#ModelMetricClaimedVerifiedStatus
1oursCorLoc47.45Unverified
#ModelMetricClaimedVerifiedStatus
1Hausdorff LossF-Score88.6Unverified
#ModelMetricClaimedVerifiedStatus
1Hausdorff LossRecall89.2Unverified