SOTAVerified

Object Detection

Papers

Showing 201250 of 10957 papers

TitleStatusHype
Efficient Teacher: Semi-Supervised Object Detection for YOLOv5Code2
EFFOcc: A Minimal Baseline for EFficient Fusion-based 3D Occupancy NetworkCode2
EMIFF: Enhanced Multi-scale Image Feature Fusion for Vehicle-Infrastructure Cooperative 3D Object DetectionCode2
EMOv2: Pushing 5M Vision Model FrontierCode2
MobileOne: An Improved One millisecond Mobile BackboneCode2
ESOD: Efficient Small Object Detection on High-Resolution ImagesCode2
MogaNet: Multi-order Gated Aggregation NetworkCode2
Exploiting Unlabeled Data with Multiple Expert Teachers for Open Vocabulary Aerial Object Detection and Its Orientation AdaptationCode2
Visible-Thermal Multiple Object Tracking: Large-scale Video Dataset and Progressive Fusion ApproachCode2
Exploring Orthogonality in Open World Object DetectionCode2
Accelerating DETR Convergence via Semantic-Aligned MatchingCode2
FasterViT: Fast Vision Transformers with Hierarchical AttentionCode2
A Novel Unified Architecture for Low-Shot Counting by Detection and SegmentationCode2
Fast Vision Transformers with HiLo AttentionCode2
Efficient Multi-Scale Attention Module with Cross-Spatial LearningCode2
Fine-Grained Stochastic Architecture SearchCode2
FM-Fusion: Instance-aware Semantic Mapping Boosted by Vision-Language Foundation ModelsCode2
FocalFormer3D : Focusing on Hard Instance for 3D Object DetectionCode2
UNetFormer: A UNet-like Transformer for Efficient Semantic Segmentation of Remote Sensing Urban Scene ImageryCode2
On the Arbitrary-Oriented Object Detection: Classification based Approaches RevisitedCode2
Focal Sparse Convolutional Networks for 3D Object DetectionCode2
Focusing on Tracks for Online Multi-Object TrackingCode2
E2E-MFD: Towards End-to-End Synchronous Multimodal Fusion DetectionCode2
ECA-Net: Efficient Channel Attention for Deep Convolutional Neural NetworksCode2
EA-LSS: Edge-aware Lift-splat-shot Framework for 3D BEV Object DetectionCode2
GaussianPretrain: A Simple Unified 3D Gaussian Representation for Visual Pre-training in Autonomous DrivingCode2
GeminiFusion: Efficient Pixel-wise Multimodal Fusion for Vision TransformerCode2
ParC-Net: Position Aware Circular Convolution with Merits from ConvNets and TransformerCode2
DSVT: Dynamic Sparse Voxel Transformer with Rotated SetsCode2
Generative Region-Language Pretraining for Open-Ended Object DetectionCode2
Drones Help Drones: A Collaborative Framework for Multi-Drone Object Trajectory Prediction and BeyondCode2
Dynamic Graph Induced Contour-aware Heat Conduction Network for Event-based Object DetectionCode2
A Simple Framework for 3D Occupancy Estimation in Autonomous DrivingCode2
Global Context NetworksCode2
GOReloc: Graph-based Object-Level Relocalization for Visual SLAMCode2
GreedyViG: Dynamic Axial Graph Construction for Efficient Vision GNNsCode2
A Simple Latent Diffusion Approach for Panoptic Segmentation and Mask InpaintingCode2
GrootVL: Tree Topology is All You Need in State Space ModelCode2
EdgeNeXt: Efficiently Amalgamated CNN-Transformer Architecture for Mobile Vision ApplicationsCode2
Evaluating Large-Vocabulary Object Detectors: The Devil is in the DetailsCode2
GroupViT: Semantic Segmentation Emerges from Text SupervisionCode2
HASSOD: Hierarchical Adaptive Self-Supervised Object DetectionCode2
A Survey on Open-Vocabulary Detection and Segmentation: Past, Present, and FutureCode2
HENet: Hybrid Encoding for End-to-end Multi-task 3D Perception from Multi-view CamerasCode2
Hierarchical Open-vocabulary Universal Image SegmentationCode2
Fully Sparse 3D Object DetectionCode2
Hulk: A Universal Knowledge Translator for Human-Centric TasksCode2
Improving CLIP Fine-tuning PerformanceCode2
Distance-IoU Loss: Faster and Better Learning for Bounding Box RegressionCode2
Dilated Neighborhood Attention TransformerCode2
Show:102550
← PrevPage 5 of 220Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Co-DETRbox mAP66Unverified
2InternImage-H (M3I Pre-training)box mAP65.5Unverified
3M3I Pre-training (InternImage-H)box mAP65.4Unverified
4MoCaEbox mAP65.1Unverified
5Co-DETR (Swin-L)box mAP64.8Unverified
6Focal-Stable-DINO (Focal-Huge, no TTA)box mAP64.8Unverified
7EVAbox mAP64.7Unverified
8Group DETR v2box mAP64.5Unverified
9FocalNet-H (DINO)box mAP64.4Unverified
10InternImage-XLbox mAP64.3Unverified