SOTAVerified

Object Localization

Object Localization is the task of locating an instance of a particular object category in an image, typically by specifying a tightly cropped bounding box centered on the instance. An object proposal specifies a candidate bounding box, and an object proposal is said to be a correct localization if it sufficiently overlaps a human-labeled “ground-truth” bounding box for the given object. In the literature, the “Object Localization” task is to locate one instance of an object category, whereas “object detection” focuses on locating all instances of a category in a given image.

Source: Fast On-Line Kernel Density Estimation for Active Object Localization

Papers

Showing 351400 of 617 papers

TitleStatusHype
Point Cloud Registration-Driven Robust Feature Matching for 3D Siamese Object Tracking0
Discriminative Sampling of Proposals in Self-Supervised Transformers for Weakly Supervised Object Localization0
Constrained Sampling for Class-Agnostic Weakly Supervised Object Localization0
Progressive Domain Adaptation with Contrastive Learning for Object Detection in the Satellite Imagery0
Geometry Aligned Variational Transformer for Image-conditioned Layout Generation0
TCAM: Temporal Class Activation Maps for Object Localization in Weakly-Labeled Unconstrained VideosCode0
Detect and Approach: Close-Range Navigation Support for People with Blindness and Low Vision0
TransPillars: Coarse-to-Fine Aggregation for Multi-Frame 3D Object Detection0
Finding Fallen Objects Via Asynchronous Audio-Visual Integration0
Real-time Full-stack Traffic Scene Perception for Autonomous Driving with Roadside Cameras0
Unified-IO: A Unified Model for Vision, Language, and Multi-Modal Tasks0
Spiking Neural Networks for Frame-based and Event-based Single Object Localization0
Location-free Human Pose Estimation0
Visuomotor Control in Multi-Object Scenes Using Object-Aware Representations0
Learning 6-DoF Object Poses to Grasp Category-level Objects by Language Instructions0
Diverse Instance Discovery: Vision-Transformer for Instance-Aware Multi-Label Image Recognition0
Anti-Adversarially Manipulated Attributions for Weakly Supervised Semantic Segmentation and Object Localization0
Bridging the Gap between Classification and Localization for Weakly Supervised Object Localization0
Learning Consistency from High-quality Pseudo-labels for Weakly Supervised Object Localization0
Object Localization under Single Coarse Point Supervision0
Learning Transferable Reward for Query Object Localization with Policy AdaptationCode0
Webly Supervised Concept Expansion for General Purpose Vision Models0
Probing the Role of Positional Information in Vision-Language Models0
CaFT: Clustering and Filter on Tokens of Transformer for Weakly Supervised Object Localization0
P2P-Loc: Point to Point Tiny Person Localization0
Background-aware Classification Activation Map for Weakly Supervised Object LocalizationCode0
The Devil is in the Task: Exploiting Reciprocal Appearance-Localization Features for Monocular 3D Object Detection0
A Simple Single-Scale Vision Transformer for Object Localization and Instance SegmentationCode0
LCTR: On Awakening the Local Continuity of Transformer for Weakly Supervised Object Localization0
Modelling Lips-State Detection Using CNN for Non-Verbal Communications0
Learning to Detect Instance-level Salient Objects Using Complementary Image Labels0
Space-Time Memory Network for Sounding Object Localization in Videos0
Practical, Fast and Robust Point Cloud Registration for 3D Scene Stitching and Object Localization0
SSA: Semantic Structure Aware Inference for Weakly Pixel-Wise Dense Predictions without Cost0
Asynchronous Collaborative Localization by Integrating Spatiotemporal Graph Learning with Model-Based Estimation0
Boundary Distribution Estimation for Precise Object Detection0
Skeleton-Based Mutually Assisted Interacted Object Localization and Human Action Recognition0
Video Instance Segmentation by Instance Flow Assembly0
Oriented Feature Alignment for Fine-grained Object Recognition in High-Resolution Satellite Imagery0
Localizing Infinity-shaped fishes: Sketch-guided object localization in the wildCode0
Self-Taught Cross-Domain Few-Shot Learning with Weakly Supervised Object Localization and Task-Decomposition0
Weakly Supervised Foreground Learning for Weakly Supervised Localization and Detection0
Towards Accurate Localization by Instance Search0
Evaluation of Audio-Visual Alignments in Visually Grounded Speech ModelsCode0
Exploring Depth Contribution for Camouflaged Object Detection0
Strengthen Learning Tolerance for Weakly Supervised Object LocalizationCode0
Dual Normalization Multitasking for Audio-Visual Sounding Object Localization0
Deep Learning for Weakly-Supervised Object Detection and Object Localization: A Survey0
Improving Few-shot Learning with Weakly-supervised Object Localization0
Deep Spiking Convolutional Neural Network for Single Object Localization Based On Deep Continuous Local Learning0
Show:102550
← PrevPage 8 of 13Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1OSMaNRGSPL32.99Unverified
2SUSARGSPL27.31Unverified
3ShanksRGSPL22.85Unverified
4CVPR22RGSPL22.06Unverified
5damm1RGSPL15.96Unverified
61637RGSPL14.03Unverified
7init. PREVALENTRGSPL13.51Unverified
8AirbertRGSPL13.28Unverified
9init. OSCARRGSPL10Unverified
10SIARGSPL9.2Unverified
#ModelMetricClaimedVerifiedStatus
1VoxelNetAP89.35Unverified
2VoxelNetAP89.35Unverified
3Frustum PointNetsAP88.7Unverified
4Frustum PointNetsAP81.2Unverified
5VoxelNetAP77.47Unverified
#ModelMetricClaimedVerifiedStatus
1Frustrum-PointPillarsAP48.3Unverified
2Frustum PointNetsAP47.2Unverified
3Frustum PointNetsAP40.23Unverified
4VoxelNetAP38.11Unverified
5VoxelNetAP31.51Unverified
#ModelMetricClaimedVerifiedStatus
1Frustrum-PointPillarsAP52.23Unverified
2Frustum PointNetsAP50.22Unverified
3Frustum PointNetsAP42.15Unverified
4VoxelNetAP40.74Unverified
5VoxelNetAP33.69Unverified
#ModelMetricClaimedVerifiedStatus
1VoxelNetAP77.39Unverified
2Frustum PointNetsAP75.33Unverified
3Frustum PointNetsAP62.19Unverified
4VoxelNetAP57.73Unverified
#ModelMetricClaimedVerifiedStatus
1Frustum PointNetsAP75.38Unverified
2Frustum PointNetsAP71.96Unverified
3VoxelNetAP66.7Unverified
4VoxelNetAP61.22Unverified
#ModelMetricClaimedVerifiedStatus
1Frustum PointNetsAP61.96Unverified
2Frustum PointNetsAP56.77Unverified
3VoxelNetAP54.76Unverified
4VoxelNetAP48.36Unverified
#ModelMetricClaimedVerifiedStatus
1Frustum PointNetsAP58.09Unverified
2Frustum PointNetsAP51.21Unverified
3VoxelNetAP46.13Unverified
4VoxelNetAP39.48Unverified
#ModelMetricClaimedVerifiedStatus
1Unified-IOXLLocalization (ablation)67Unverified
2GPV-2Localization (ablation)53.6Unverified
3Mask R-CNNLocalization (ablation)44.7Unverified
#ModelMetricClaimedVerifiedStatus
1Frustum PointNetsAP54.68Unverified
2VoxelNeAP50.55Unverified
3Frustum PointNetsAP50.39Unverified
#ModelMetricClaimedVerifiedStatus
1GPT4-Vision 4-shot+CoTAccuracy49.7Unverified
2Gemini-Pro 4-shot+CoTAccuracy33.9Unverified
#ModelMetricClaimedVerifiedStatus
1Frustum PointNetsAP84Unverified
2VoxelNetAP79.26Unverified
#ModelMetricClaimedVerifiedStatus
1Frustrum-PointPillarsAP60.98Unverified
#ModelMetricClaimedVerifiedStatus
1Hausdorff LossPrecision88.1Unverified
#ModelMetricClaimedVerifiedStatus
1oursCorLoc41.2Unverified
#ModelMetricClaimedVerifiedStatus
1oursCorLoc47.45Unverified
#ModelMetricClaimedVerifiedStatus
1Hausdorff LossF-Score88.6Unverified
#ModelMetricClaimedVerifiedStatus
1Hausdorff LossRecall89.2Unverified