SOTAVerified

Object Detection

Papers

Showing 201250 of 10957 papers

TitleStatusHype
Aria Digital Twin: A New Benchmark Dataset for Egocentric 3D Machine PerceptionCode2
AeroGen: Enhancing Remote Sensing Object Detection with Diffusion-Driven Data GenerationCode2
EFFOcc: A Minimal Baseline for EFficient Fusion-based 3D Occupancy NetworkCode2
EPTQ: Enhanced Post-Training Quantization via Hessian-guided Network-wise OptimizationCode2
EMIFF: Enhanced Multi-scale Image Feature Fusion for Vehicle-Infrastructure Cooperative 3D Object DetectionCode2
EMOv2: Pushing 5M Vision Model FrontierCode2
3DGS-DET: Empower 3D Gaussian Splatting with Boundary Guidance and Box-Focused Sampling for 3D Object DetectionCode2
ESOD: Efficient Small Object Detection on High-Resolution ImagesCode2
Visible-Thermal Multiple Object Tracking: Large-scale Video Dataset and Progressive Fusion ApproachCode2
Exploiting Unlabeled Data with Multiple Expert Teachers for Open Vocabulary Aerial Object Detection and Its Orientation AdaptationCode2
UNetFormer: A UNet-like Transformer for Efficient Semantic Segmentation of Remote Sensing Urban Scene ImageryCode2
F2DNet: Fast Focal Detection Network for Pedestrian DetectionCode2
FasterViT: Fast Vision Transformers with Hierarchical AttentionCode2
A Simple Framework for 3D Occupancy Estimation in Autonomous DrivingCode2
Accelerating DETR Convergence via Semantic-Aligned MatchingCode2
Agent Attention: On the Integration of Softmax and Linear AttentionCode2
Fine-Grained Prototypes Distillation for Few-Shot Object DetectionCode2
Fine-Grained Stochastic Architecture SearchCode2
A Simple Latent Diffusion Approach for Panoptic Segmentation and Mask InpaintingCode2
FocalFormer3D : Focusing on Hard Instance for 3D Object DetectionCode2
Focal Loss for Dense Object DetectionCode2
Focal Modulation NetworksCode2
E2E-MFD: Towards End-to-End Synchronous Multimodal Fusion DetectionCode2
Frustratingly Simple Few-Shot Object DetectionCode2
Fully Test-Time Adaptation for Monocular 3D Object DetectionCode2
FusionVision: A comprehensive approach of 3D object reconstruction and segmentation from RGB-D cameras using YOLO and fast segment anythingCode2
EdgeNeXt: Efficiently Amalgamated CNN-Transformer Architecture for Mobile Vision ApplicationsCode2
ParC-Net: Position Aware Circular Convolution with Merits from ConvNets and TransformerCode2
MogaNet: Multi-order Gated Aggregation NetworkCode2
Generalized Semantic Contrastive Learning via Embedding Side Information for Few-Shot Object DetectionCode2
Dynamic Graph Induced Contour-aware Heat Conduction Network for Event-based Object DetectionCode2
A Light-Weight Framework for Open-Set Object Detection with Decoupled Feature Alignment in Joint SpaceCode2
A Strong and Reproducible Object Detector with Only Public DatasetsCode2
DSVT: Dynamic Sparse Voxel Transformer with Rotated SetsCode2
Global Context NetworksCode2
Global Context Vision TransformersCode2
GOReloc: Graph-based Object-Level Relocalization for Visual SLAMCode2
GreedyViG: Dynamic Axial Graph Construction for Efficient Vision GNNsCode2
EA-LSS: Edge-aware Lift-splat-shot Framework for 3D BEV Object DetectionCode2
DQ-DETR: DETR with Dynamic Query for Tiny Object DetectionCode2
GroupViT: Semantic Segmentation Emerges from Text SupervisionCode2
HASSOD: Hierarchical Adaptive Self-Supervised Object DetectionCode2
A Simple Aerial Detection Baseline of Multimodal Language ModelsCode2
HENet: Hybrid Encoding for End-to-end Multi-task 3D Perception from Multi-view CamerasCode2
A Survey on Open-Vocabulary Detection and Segmentation: Past, Present, and FutureCode2
Drones Help Drones: A Collaborative Framework for Multi-Drone Object Trajectory Prediction and BeyondCode2
ECA-Net: Efficient Channel Attention for Deep Convolutional Neural NetworksCode2
Efficient Multi-Scale Attention Module with Cross-Spatial LearningCode2
Equalized Focal Loss for Dense Long-Tailed Object DetectionCode2
Improving CLIP Fine-tuning PerformanceCode2
Show:102550
← PrevPage 5 of 220Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Co-DETRbox mAP66Unverified
2InternImage-H (M3I Pre-training)box mAP65.5Unverified
3M3I Pre-training (InternImage-H)box mAP65.4Unverified
4MoCaEbox mAP65.1Unverified
5Co-DETR (Swin-L)box mAP64.8Unverified
6Focal-Stable-DINO (Focal-Huge, no TTA)box mAP64.8Unverified
7EVAbox mAP64.7Unverified
8Group DETR v2box mAP64.5Unverified
9FocalNet-H (DINO)box mAP64.4Unverified
10InternImage-XLbox mAP64.3Unverified