SOTAVerified

Object Detection

Papers

Showing 301350 of 10957 papers

TitleStatusHype
Equalized Focal Loss for Dense Long-Tailed Object DetectionCode2
Exploiting Unlabeled Data with Multiple Expert Teachers for Open Vocabulary Aerial Object Detection and Its Orientation AdaptationCode2
FasterViT: Fast Vision Transformers with Hierarchical AttentionCode2
Focal Loss for Dense Object DetectionCode2
Going Denser with Open-Vocabulary Part SegmentationCode2
Image-to-Lidar Self-Supervised Distillation for Autonomous Driving DataCode2
EFM3D: A Benchmark for Measuring Progress Towards 3D Egocentric Foundation ModelsCode2
Efficient Teacher: Semi-Supervised Object Detection for YOLOv5Code2
EGTR: Extracting Graph from Transformer for Scene Graph GenerationCode2
MogaNet: Multi-order Gated Aggregation NetworkCode2
E2E-MFD: Towards End-to-End Synchronous Multimodal Fusion DetectionCode2
Efficient Multi-Scale Attention Module with Cross-Spatial LearningCode2
EdgeNeXt: Efficiently Amalgamated CNN-Transformer Architecture for Mobile Vision ApplicationsCode2
UNetFormer: A UNet-like Transformer for Efficient Semantic Segmentation of Remote Sensing Urban Scene ImageryCode2
MobileOne: An Improved One millisecond Mobile BackboneCode2
Dynamic Graph Induced Contour-aware Heat Conduction Network for Event-based Object DetectionCode2
DSVT: Dynamic Sparse Voxel Transformer with Rotated SetsCode2
EFFOcc: A Minimal Baseline for EFficient Fusion-based 3D Occupancy NetworkCode2
EA-LSS: Edge-aware Lift-splat-shot Framework for 3D BEV Object DetectionCode2
EPTQ: Enhanced Post-Training Quantization via Hessian-guided Network-wise OptimizationCode2
A Novel Unified Architecture for Low-Shot Counting by Detection and SegmentationCode2
ESOD: Efficient Small Object Detection on High-Resolution ImagesCode2
DQ-DETR: DETR with Dynamic Query for Tiny Object DetectionCode2
Exploring Object-Centric Temporal Modeling for Efficient Multi-View 3D Object DetectionCode2
DPFT: Dual Perspective Fusion Transformer for Camera-Radar-based Object DetectionCode2
Exploring Plain Vision Transformer Backbones for Object DetectionCode2
Drones Help Drones: A Collaborative Framework for Multi-Drone Object Trajectory Prediction and BeyondCode2
On the Arbitrary-Oriented Object Detection: Classification based Approaches RevisitedCode2
Fast Vision Transformers with HiLo AttentionCode2
Feature Pyramid Networks for Object DetectionCode2
ECA-Net: Efficient Channel Attention for Deep Convolutional Neural NetworksCode2
Fine-Grained Prototypes Distillation for Few-Shot Object DetectionCode2
FlashOcc: Fast and Memory-Efficient Occupancy Prediction via Channel-to-Height PluginCode2
AdaMixer: A Fast-Converging Query-Based Object DetectorCode2
DiverGen: Improving Instance Segmentation by Learning Wider Data Distribution with More Diverse Generative DataCode2
An Empirical Study of Remote Sensing PretrainingCode2
Aria Digital Twin: A New Benchmark Dataset for Egocentric 3D Machine PerceptionCode2
FreeSOLO: Learning to Segment Objects without AnnotationsCode2
Frustratingly Simple Few-Shot Object DetectionCode2
Adapter is All You Need for Tuning Visual TasksCode2
Fully Test-Time Adaptation for Monocular 3D Object DetectionCode2
FusionVision: A comprehensive approach of 3D object reconstruction and segmentation from RGB-D cameras using YOLO and fast segment anythingCode2
Distance-IoU Loss: Faster and Better Learning for Bounding Box RegressionCode2
GeminiFusion: Efficient Pixel-wise Multimodal Fusion for Vision TransformerCode2
ParC-Net: Position Aware Circular Convolution with Merits from ConvNets and TransformerCode2
Generalized Semantic Contrastive Learning via Embedding Side Information for Few-Shot Object DetectionCode2
EMIFF: Enhanced Multi-scale Image Feature Fusion for Vehicle-Infrastructure Cooperative 3D Object DetectionCode2
DiffBEV: Conditional Diffusion Model for Bird's Eye View PerceptionCode2
A Simple Latent Diffusion Approach for Panoptic Segmentation and Mask InpaintingCode2
DiffusionTrack: Diffusion Model For Multi-Object TrackingCode2
Show:102550
← PrevPage 7 of 220Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Co-DETRbox mAP66Unverified
2InternImage-H (M3I Pre-training)box mAP65.5Unverified
3M3I Pre-training (InternImage-H)box mAP65.4Unverified
4MoCaEbox mAP65.1Unverified
5Co-DETR (Swin-L)box mAP64.8Unverified
6Focal-Stable-DINO (Focal-Huge, no TTA)box mAP64.8Unverified
7EVAbox mAP64.7Unverified
8Group DETR v2box mAP64.5Unverified
9FocalNet-H (DINO)box mAP64.4Unverified
10InternImage-XLbox mAP64.3Unverified