SOTAVerified

Object Detection

Papers

Showing 251300 of 10957 papers

TitleStatusHype
Fast R-CNNCode2
Evaluating Large-Vocabulary Object Detectors: The Devil is in the DetailsCode2
Exploiting Unlabeled Data with Multiple Expert Teachers for Open Vocabulary Aerial Object Detection and Its Orientation AdaptationCode2
MatrixVT: Efficient Multi-Camera to BEV Transformation for 3D PerceptionCode2
Equalized Focal Loss for Dense Long-Tailed Object DetectionCode2
A Survey on Open-Vocabulary Detection and Segmentation: Past, Present, and FutureCode2
ESOD: Efficient Small Object Detection on High-Resolution ImagesCode2
Exploring Object-Centric Temporal Modeling for Efficient Multi-View 3D Object DetectionCode2
Fast Vision Transformers with HiLo AttentionCode2
EMOv2: Pushing 5M Vision Model FrontierCode2
A Strong and Reproducible Object Detector with Only Public DatasetsCode2
Enhance Then Search: An Augmentation-Search Strategy with Foundation Models for Cross-Domain Few-Shot Object DetectionCode2
EGTR: Extracting Graph from Transformer for Scene Graph GenerationCode2
EMIFF: Enhanced Multi-scale Image Feature Fusion for Vehicle-Infrastructure Cooperative 3D Object DetectionCode2
Agent Attention: On the Integration of Softmax and Linear AttentionCode2
EFFOcc: A Minimal Baseline for EFficient Fusion-based 3D Occupancy NetworkCode2
Efficient Teacher: Semi-Supervised Object Detection for YOLOv5Code2
EFM3D: A Benchmark for Measuring Progress Towards 3D Egocentric Foundation ModelsCode2
EPTQ: Enhanced Post-Training Quantization via Hessian-guided Network-wise OptimizationCode2
Feature Pyramid Networks for Object DetectionCode2
UNetFormer: A UNet-like Transformer for Efficient Semantic Segmentation of Remote Sensing Urban Scene ImageryCode2
E2E-MFD: Towards End-to-End Synchronous Multimodal Fusion DetectionCode2
ParC-Net: Position Aware Circular Convolution with Merits from ConvNets and TransformerCode2
ECA-Net: Efficient Channel Attention for Deep Convolutional Neural NetworksCode2
EdgeNeXt: Efficiently Amalgamated CNN-Transformer Architecture for Mobile Vision ApplicationsCode2
MogaNet: Multi-order Gated Aggregation NetworkCode2
AeroGen: Enhancing Remote Sensing Object Detection with Diffusion-Driven Data GenerationCode2
DSVT: Dynamic Sparse Voxel Transformer with Rotated SetsCode2
Dynamic Graph Induced Contour-aware Heat Conduction Network for Event-based Object DetectionCode2
DQ-DETR: DETR with Dynamic Query for Tiny Object DetectionCode2
A Simple Latent Diffusion Approach for Panoptic Segmentation and Mask InpaintingCode2
Drones Help Drones: A Collaborative Framework for Multi-Drone Object Trajectory Prediction and BeyondCode2
EA-LSS: Edge-aware Lift-splat-shot Framework for 3D BEV Object DetectionCode2
Efficient Multi-Scale Attention Module with Cross-Spatial LearningCode2
DiverGen: Improving Instance Segmentation by Learning Wider Data Distribution with More Diverse Generative DataCode2
Distance-IoU Loss: Faster and Better Learning for Bounding Box RegressionCode2
2nd Place Solution for Waymo Open Dataset Challenge -- Real-time 2D Object DetectionCode2
2nd Place Solution for Waymo Open Dataset Challenge - Real-time 2D Object DetectionCode2
Dilated Neighborhood Attention TransformerCode2
A Simple Framework for 3D Occupancy Estimation in Autonomous DrivingCode2
ALBench: A Framework for Evaluating Active Learning in Object DetectionCode2
DiffusionTrack: Diffusion Model For Multi-Object TrackingCode2
DI-MaskDINO: A Joint Object Detection and Instance Segmentation ModelCode2
DPFT: Dual Perspective Fusion Transformer for Camera-Radar-based Object DetectionCode2
FedPylot: Navigating Federated Learning for Real-Time Object Detection in Internet of VehiclesCode2
DEVIANT: Depth EquiVarIAnt NeTwork for Monocular 3D Object DetectionCode2
DetZero: Rethinking Offboard 3D Object Detection with Long-term Sequential Point CloudsCode2
DEYO: DETR with YOLO for End-to-End Object DetectionCode2
DETR Does Not Need Multi-Scale or Locality DesignCode2
DEYOLO: Dual-Feature-Enhancement YOLO for Cross-Modality Object DetectionCode2
Show:102550
← PrevPage 6 of 220Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Co-DETRbox mAP66Unverified
2InternImage-H (M3I Pre-training)box mAP65.5Unverified
3M3I Pre-training (InternImage-H)box mAP65.4Unverified
4MoCaEbox mAP65.1Unverified
5Co-DETR (Swin-L)box mAP64.8Unverified
6Focal-Stable-DINO (Focal-Huge, no TTA)box mAP64.8Unverified
7EVAbox mAP64.7Unverified
8Group DETR v2box mAP64.5Unverified
9FocalNet-H (DINO)box mAP64.4Unverified
10InternImage-XLbox mAP64.3Unverified