SOTAVerified

Object Detection

Papers

Showing 51100 of 10957 papers

TitleStatusHype
EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment AnythingCode4
Vision GNN: An Image is Worth Graph of NodesCode4
TUMTraf V2X Cooperative Perception DatasetCode4
DN-DETR: Accelerate DETR Training by Introducing Query DeNoisingCode4
UltimateDO: An Efficient Framework to Marry Occupancy Prediction with 3D Object Detection via Channel2heightCode4
Seg-Zero: Reasoning-Chain Guided Segmentation via Cognitive ReinforcementCode4
Detectron2 Object Detection & Manipulating Images using CartoonizationCode4
SARDet-100K: Towards Open-Source Benchmark and ToolKit for Large-Scale SAR Object DetectionCode4
RTMDet: An Empirical Study of Designing Real-Time Object DetectorsCode4
DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object DetectionCode4
Deep Residual Learning for Image RecognitionCode4
Architecture-Agnostic Masked Image Modeling -- From ViT back to CNNCode4
FG-CLIP: Fine-Grained Visual and Textual AlignmentCode4
Strip R-CNN: Large Strip Convolution for Remote Sensing Object DetectionCode4
Relation DETR: Exploring Explicit Position Relation Prior for Object DetectionCode3
ResNeSt: Split-Attention NetworksCode3
Cut and Learn for Unsupervised Object Detection and Instance SegmentationCode3
Cubify Anything: Scaling Indoor 3D Object DetectionCode3
Rethinking the Evaluation of Visible and Infrared Image FusionCode3
Practical Video Object Detection via Feature Selection and AggregationCode3
Cross Modal Transformer: Towards Fast and Robust 3D Object DetectionCode3
RCBEVDet: Radar-camera Fusion in Bird's Eye View for 3D Object DetectionCode3
Playing Non-Embedded Card-Based Games with Reinforcement LearningCode3
PlainMamba: Improving Non-Hierarchical Mamba in Visual RecognitionCode3
Realtime Multi-Person 2D Pose Estimation using Part Affinity FieldsCode3
ConvNeXt V2: Co-designing and Scaling ConvNets with Masked AutoencodersCode3
OVLW-DETR: Open-Vocabulary Light-Weighted Detection TransformerCode3
Panacea+: Panoramic and Controllable Video Generation for Autonomous DrivingCode3
PETR: Position Embedding Transformation for Multi-View 3D Object DetectionCode3
Open-YOLO 3D: Towards Fast and Accurate Open-Vocabulary 3D Instance SegmentationCode3
OmDet: Large-scale vision-language multi-dataset pre-training with multimodal detection networkCode3
OrionBench: A Benchmark for Chart and Human-Recognizable Object Detection in InfographicsCode3
Multiple Object Tracking as ID PredictionCode3
MMLSpark: Unifying Machine Learning Ecosystems at Massive ScalesCode3
Observation-Centric SORT: Rethinking SORT for Robust Multi-Object TrackingCode3
OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective FusionCode3
PETRv2: A Unified Framework for 3D Perception from Multi-Camera ImagesCode3
Revisiting Image Pyramid Structure for High Resolution Salient Object DetectionCode3
MagicDrive: Street View Generation with Diverse 3D Geometry ControlCode3
Locate Anything on Earth: Advancing Open-Vocabulary Object Detection for Remote Sensing CommunityCode3
LION: Linear Group RNN for 3D Object Detection in Point CloudsCode3
Bridging the Gap Between Anchor-based and Anchor-free Detection via Adaptive Training Sample SelectionCode3
IS-Fusion: Instance-Scene Collaborative Fusion for Multimodal 3D Object DetectionCode3
Leveraging Vision-Centric Multi-Modal Expertise for 3D Object DetectionCode3
Cosmos-Drive-Dreams: Scalable Synthetic Driving Data Generation with World Foundation ModelsCode3
How to Evaluate the Generalization of Detection? A Benchmark for Comprehensive Open-Vocabulary DetectionCode3
Collaborative Novel Object Discovery and Box-Guided Cross-Modal Alignment for Open-Vocabulary 3D Object DetectionCode3
MTP: Advancing Remote Sensing Foundation Model via Multi-Task PretrainingCode3
Hyper-YOLO: When Visual Object Detection Meets Hypergraph ComputationCode3
A Comparative Analysis of Object Detection Metrics with a Companion Open-Source ToolkitCode3
Show:102550
← PrevPage 2 of 220Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Co-DETRbox mAP66Unverified
2InternImage-H (M3I Pre-training)box mAP65.5Unverified
3M3I Pre-training (InternImage-H)box mAP65.4Unverified
4MoCaEbox mAP65.1Unverified
5Co-DETR (Swin-L)box mAP64.8Unverified
6Focal-Stable-DINO (Focal-Huge, no TTA)box mAP64.8Unverified
7EVAbox mAP64.7Unverified
8Group DETR v2box mAP64.5Unverified
9FocalNet-H (DINO)box mAP64.4Unverified
10InternImage-XLbox mAP64.3Unverified