SOTAVerified

Object Detection

Papers

Showing 39514000 of 10957 papers

TitleStatusHype
A Simple and Generic Framework for Feature Distillation via Channel-wise Transformation0
Open-Vocabulary Object Detection using Pseudo Caption Labels0
Explore the Power of Synthetic Data on Few-shot Object Detection0
Box-Level Active DetectionCode1
DetOFA: Efficient Training of Once-for-All Networks for Object Detection Using Path Filter0
The effectiveness of MAE pre-pretraining for billion-scale pretrainingCode1
CORA: Adapting CLIP for Open-Vocabulary Detection with Region Prompting and Anchor Pre-MatchingCode1
MonoATT: Online Monocular 3D Object Detection with Adaptive Token Transformer0
MV-JAR: Masked Voxel Jigsaw and Reconstruction for LiDAR-Based Self-Supervised Pre-TrainingCode1
EPro-PnP: Generalized End-to-End Probabilistic Perspective-n-Points for Monocular Object Pose EstimationCode3
Uncertainty Aware Active Learning for Reconfiguration of Pre-trained Deep Object-Detection Networks for New Target Domains0
OcTr: Octree-based Transformer for 3D Object Detection0
Rigidity-Aware Detection for 6D Object Pose EstimationCode1
Dense Distinct Query for End-to-End Object DetectionCode2
Spherical Transformer for LiDAR-based 3D RecognitionCode2
Efficient Feature Distillation for Zero-shot Annotation Object DetectionCode0
Exploring Object-Centric Temporal Modeling for Efficient Multi-View 3D Object DetectionCode2
Penalty-Based Imitation Learning With Cross Semantics Generation Sensor Fusion for Autonomous Driving0
Anchor Free remote sensing detector based on solving discrete polar coordinate equation0
Detecting the open-world objects with the help of the BrainCode1
Detecting Everything in the Open World: Towards Universal Object DetectionCode2
STDLens: Model Hijacking-Resilient Federated Learning for Object DetectionCode0
DR.CPO: Diversified and Realistic 3D Augmentation via Iterative Construction, Random Placement, and HPR OcclusionCode0
Understanding the Role of the Projector in Knowledge DistillationCode1
Accurate Detection of Mediastinal Lesions with nnDetection0
VIMI: Vehicle-Infrastructure Multi-view Intermediate Fusion for Camera-based 3D Object DetectionCode1
Constructing Metric-Semantic Maps using Floor Plan Priors for Long-Term Indoor LocalizationCode1
Boosting Weakly Supervised Object Detection using Fusion and Priors from Hallucinated Depth0
GeoMIM: Towards Better 3D Knowledge Transfer via Masked Image Modeling for Multi-view 3D UnderstandingCode1
Benchmarking Robustness of 3D Object Detection to Common Corruptions in Autonomous DrivingCode0
VoxelNeXt: Fully Sparse VoxelNet for 3D Object Detection and TrackingCode2
Augment and Criticize: Exploring Informative Samples for Semi-Supervised Monocular 3D Object Detection0
Rethinking the backbone architecture for tiny object detection0
LiDAR Spoofing Meets the New-Gen: Capability Improvements, Broken Assumptions, and New Attack Strategies0
Vehicle-Infrastructure Cooperative 3D Object Detection via Feature Flow PredictionCode1
CCTV-Gun: Benchmarking Handgun Detection in CCTV ImagesCode1
Supervision Interpolation via LossMix: Generalizing Mixup for Object Detection and Beyond0
Multi-Semantic Interactive Learning for Object Detection0
Identification of Novel Classes for Improving Few-Shot Object DetectionCode1
GOOD: General Optimization-based Fusion for 3D Object Detection via LiDAR-Camera Object Candidates0
CAPE: Camera View Position Embedding for Multi-View 3D Object DetectionCode1
Dual Memory Aggregation Network for Event-Based Object Detection with Learnable RepresentationCode1
Adaptive Graph Convolution Module for Salient Object Detection0
Scribble-Supervised RGB-T Salient Object DetectionCode1
A Simple Framework for 3D Occupancy Estimation in Autonomous DrivingCode2
Investigating the Role of Attribute Context in Vision-Language Models for Object Recognition and Detection0
Action knowledge for video captioning with graph neural networksCode1
VEIL: Vetting Extracted Image Labels from In-the-Wild Captions for Weakly-Supervised Object DetectionCode0
Cross-Modal Causal Intervention for Medical Report GenerationCode3
Towards Commonsense Knowledge based Fuzzy Systems for Supporting Size-Related Fine-Grained Object DetectionCode0
Show:102550
← PrevPage 80 of 220Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Co-DETRbox mAP66Unverified
2InternImage-H (M3I Pre-training)box mAP65.5Unverified
3M3I Pre-training (InternImage-H)box mAP65.4Unverified
4MoCaEbox mAP65.1Unverified
5Co-DETR (Swin-L)box mAP64.8Unverified
6Focal-Stable-DINO (Focal-Huge, no TTA)box mAP64.8Unverified
7EVAbox mAP64.7Unverified
8Group DETR v2box mAP64.5Unverified
9FocalNet-H (DINO)box mAP64.4Unverified
10InternImage-XLbox mAP64.3Unverified