SOTAVerified

Object Localization

Object Localization is the task of locating an instance of a particular object category in an image, typically by specifying a tightly cropped bounding box centered on the instance. An object proposal specifies a candidate bounding box, and an object proposal is said to be a correct localization if it sufficiently overlaps a human-labeled “ground-truth” bounding box for the given object. In the literature, the “Object Localization” task is to locate one instance of an object category, whereas “object detection” focuses on locating all instances of a category in a given image.

Source: Fast On-Line Kernel Density Estimation for Active Object Localization

Papers

Showing 151200 of 617 papers

TitleStatusHype
Object Pose Estimation Annotation Pipeline for Multi-view Monocular Camera Systems in Industrial Settings0
CLIP meets Model Zoo Experts: Pseudo-Supervision for Visual Enhancement0
Unsupervised Object Localization in the Era of Self-Supervised ViTs: A SurveyCode1
DiPS: Discriminative Pseudo-Label Sampling with Self-Supervised Transformers for Weakly Supervised Object LocalizationCode0
CoDA: Collaborative Novel Box Discovery and Cross-modal Alignment for Open-vocabulary 3D Object DetectionCode2
Memory-efficient particle filter recurrent neural network for object localization0
Learning to Terminate in Object NavigationCode0
Context-Aware Entity Grounding with Open-Vocabulary 3D Scene GraphsCode1
CLIP-DIY: CLIP Dense Inference Yields Open-Vocabulary Semantic Segmentation For-FreeCode1
DeepAdaIn-Net: Deep Adaptive Device-Edge Collaborative Inference for Augmented Reality0
Background Activation Suppression for Weakly Supervised Object Localization and Semantic SegmentationCode1
SEMPART: Self-supervised Multi-resolution Partitioning of Image Semantics0
Unsupervised Open-Vocabulary Object Localization in VideosCode1
FDCNet: Feature Drift Compensation Network for Class-Incremental Weakly Supervised Object LocalizationCode1
ALWOD: Active Learning for Weakly-Supervised Object DetectionCode0
Keep It SimPool: Who Said Supervised Transformers Suffer from Attention Deficit?Code1
HiLM-D: Towards High-Resolution Understanding in Multimodal Large Language Models for Autonomous Driving0
Four Ways to Improve Verbo-visual Fusion for Dense 3D Visual Grounding0
Unsupervised Object Localization with Representer Point SelectionCode0
BroadCAM: Outcome-agnostic Class Activation Mapping for Small-scale Weakly Supervised ApplicationsCode0
Context-Aware 3D Object Localization from Single Calibrated Images: A Study of BasketballsCode1
Vote2Cap-DETR++: Decoupling Localization and Describing for End-to-End 3D Dense CaptioningCode1
Semantic-Constraint Matching Transformer for Weakly Supervised Object Localization0
Object-Centric Multiple Object TrackingCode1
Referring Image Segmentation Using Text SupervisionCode1
I3DOD: Towards Incremental 3D Object Detection via Prompting0
Video OWL-ViT: Temporally-consistent open-world localization in video0
Towards Grounded Visual Spatial Reasoning in Multi-Modal Vision Language Models0
Leveraging Next-Active Objects for Context-Aware Anticipation in Egocentric Videos0
Rethinking the Localization in Weakly Supervised Object Localization0
Rapid Training Data Creation by Synthesizing Medical Images for Classification and Localization0
All-pairs Consistency Learning for Weakly Supervised Semantic SegmentationCode0
MCTformer+: Multi-Class Token Transformer for Weakly Supervised Semantic SegmentationCode1
A Memory-Augmented Multi-Task Collaborative Framework for Unsupervised Traffic Accident Detection in Driving Videos0
Optical Flow boosts Unsupervised Localization and SegmentationCode1
Cascade-DETR: Delving into High-Quality Universal Object DetectionCode1
Generative Prompt Model for Weakly Supervised Object LocalizationCode1
MPDIoU: A Loss for Efficient and Accurate Bounding Box Regression0
YOLIC: An Efficient Method for Object Localization and Classification on Edge DevicesCode0
Open-Vocabulary Object Detection via Scene Graph Discovery0
EgoCOL: Egocentric Camera pose estimation for Open-world 3D object Localization @Ego4D challenge 2023Code0
PanoOcc: Unified Occupancy Representation for Camera-based 3D Panoptic SegmentationCode1
3-Dimensional Sonic Phase-invariant Echo LocalizationCode0
A Novel Confidence Induced Class Activation Mapping for MRI Brain Tumor SegmentationCode0
NeurOCS: Neural NOCS Supervision for Monocular 3D Object Localization0
Counterfactual Co-occurring Learning for Bias Mitigation in Weakly-supervised Object Localization0
Learning high-level visual representations from a child's perspective without strong inductive biasesCode1
Probing the Role of Positional Information in Vision-Language Models0
AV-SAM: Segment Anything Model Meets Audio-Visual Localization and Segmentation0
A Systematic Study on Object Recognition Using Millimeter-wave Radar0
Show:102550
← PrevPage 4 of 13Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1OSMaNRGSPL32.99Unverified
2SUSARGSPL27.31Unverified
3ShanksRGSPL22.85Unverified
4CVPR22RGSPL22.06Unverified
5damm1RGSPL15.96Unverified
61637RGSPL14.03Unverified
7init. PREVALENTRGSPL13.51Unverified
8AirbertRGSPL13.28Unverified
9init. OSCARRGSPL10Unverified
10SIARGSPL9.2Unverified
#ModelMetricClaimedVerifiedStatus
1VoxelNetAP89.35Unverified
2VoxelNetAP89.35Unverified
3Frustum PointNetsAP88.7Unverified
4Frustum PointNetsAP81.2Unverified
5VoxelNetAP77.47Unverified
#ModelMetricClaimedVerifiedStatus
1Frustrum-PointPillarsAP48.3Unverified
2Frustum PointNetsAP47.2Unverified
3Frustum PointNetsAP40.23Unverified
4VoxelNetAP38.11Unverified
5VoxelNetAP31.51Unverified
#ModelMetricClaimedVerifiedStatus
1Frustrum-PointPillarsAP52.23Unverified
2Frustum PointNetsAP50.22Unverified
3Frustum PointNetsAP42.15Unverified
4VoxelNetAP40.74Unverified
5VoxelNetAP33.69Unverified
#ModelMetricClaimedVerifiedStatus
1VoxelNetAP77.39Unverified
2Frustum PointNetsAP75.33Unverified
3Frustum PointNetsAP62.19Unverified
4VoxelNetAP57.73Unverified
#ModelMetricClaimedVerifiedStatus
1Frustum PointNetsAP75.38Unverified
2Frustum PointNetsAP71.96Unverified
3VoxelNetAP66.7Unverified
4VoxelNetAP61.22Unverified
#ModelMetricClaimedVerifiedStatus
1Frustum PointNetsAP61.96Unverified
2Frustum PointNetsAP56.77Unverified
3VoxelNetAP54.76Unverified
4VoxelNetAP48.36Unverified
#ModelMetricClaimedVerifiedStatus
1Frustum PointNetsAP58.09Unverified
2Frustum PointNetsAP51.21Unverified
3VoxelNetAP46.13Unverified
4VoxelNetAP39.48Unverified
#ModelMetricClaimedVerifiedStatus
1Unified-IOXLLocalization (ablation)67Unverified
2GPV-2Localization (ablation)53.6Unverified
3Mask R-CNNLocalization (ablation)44.7Unverified
#ModelMetricClaimedVerifiedStatus
1Frustum PointNetsAP54.68Unverified
2VoxelNeAP50.55Unverified
3Frustum PointNetsAP50.39Unverified
#ModelMetricClaimedVerifiedStatus
1GPT4-Vision 4-shot+CoTAccuracy49.7Unverified
2Gemini-Pro 4-shot+CoTAccuracy33.9Unverified
#ModelMetricClaimedVerifiedStatus
1Frustum PointNetsAP84Unverified
2VoxelNetAP79.26Unverified
#ModelMetricClaimedVerifiedStatus
1Frustrum-PointPillarsAP60.98Unverified
#ModelMetricClaimedVerifiedStatus
1Hausdorff LossPrecision88.1Unverified
#ModelMetricClaimedVerifiedStatus
1oursCorLoc41.2Unverified
#ModelMetricClaimedVerifiedStatus
1oursCorLoc47.45Unverified
#ModelMetricClaimedVerifiedStatus
1Hausdorff LossF-Score88.6Unverified
#ModelMetricClaimedVerifiedStatus
1Hausdorff LossRecall89.2Unverified