SOTAVerified

Object Detection

Papers

Showing 11011150 of 10957 papers

TitleStatusHype
End-to-End Trainable Multi-Instance Pose Estimation with TransformersCode1
CoDet: Co-Occurrence Guided Region-Word Alignment for Open-Vocabulary Object DetectionCode1
CoDiff: Conditional Diffusion Model for Collaborative 3D Object DetectionCode1
CoDeNet: Efficient Deployment of Input-Adaptive Object Detection on Embedded FPGAsCode1
CodeReef: an open platform for portable MLOps, reusable automation actions and reproducible benchmarkingCode1
Co-Fix3D: Enhancing 3D Object Detection with Collaborative RefinementCode1
E-InMeMo: Enhanced Prompting for Visual In-Context LearningCode1
An Asymptotically Optimal Multi-Armed Bandit Algorithm and Hyperparameter OptimizationCode1
Adaptive Class Suppression Loss for Long-Tail Object DetectionCode1
CoCoNets: Continuous Contrastive 3D Scene RepresentationsCode1
Adaptive Calibration: A Unified Conversion Framework of Spiking Neural NetworkCode1
CoIn: Contrastive Instance Feature Mining for Outdoor 3D Object Detection with Very Limited AnnotationsCode1
Coarse-to-Fine Vision-Language Pre-training with Fusion in the BackboneCode1
Adaptive Bounding Box Uncertainties via Two-Step Conformal PredictionCode1
CoBEV: Elevating Roadside 3D Object Detection with Depth and Height ComplementarityCode1
CoADNet: Collaborative Aggregation-and-Distribution Networks for Co-Salient Object DetectionCode1
CN-RMA: Combined Network with Ray Marching Aggregation for 3D Indoor Object Detection from Multi-view ImagesCode1
CNN Model & Tuning for Global Road Damage DetectionCode1
Container: Context Aggregation NetworksCode1
CN-RMA: Combined Network with Ray Marching Aggregation for 3D Indoors Object Detection from Multi-view ImagesCode1
CoLA: Conditional Dropout and Language-driven Robust Dual-modal Salient Object DetectionCode1
EgoObjects: A Large-Scale Egocentric Dataset for Fine-Grained Object UnderstandingCode1
Eliminating Position Bias of Language Models: A Mechanistic ApproachCode1
End-to-End Video Object Detection with Spatial-Temporal TransformersCode1
Eventful Transformers: Leveraging Temporal Redundancy in Vision TransformersCode1
CLOCs: Camera-LiDAR Object Candidates Fusion for 3D Object DetectionCode1
Adaptive Attention Span in Computer VisionCode1
Cloud Object Detector Adaptation by Integrating Different Source KnowledgeCode1
CLIP the Gap: A Single Domain Generalization Approach for Object DetectionCode1
Watch out! Motion is Blurring the Vision of Your Deep Neural NetworksCode1
Efficient Self-supervised Vision Pretraining with Local Masked ReconstructionCode1
CLIP-Count: Towards Text-Guided Zero-Shot Object CountingCode1
CLIP-Guided Source-Free Object Detection in Aerial ImagesCode1
EfficientPose: An efficient, accurate and scalable end-to-end 6D multi object pose estimation approachCode1
CLIM: Contrastive Language-Image Mosaic for Region RepresentationCode1
Efficient One-stage Video Object Detection by Exploiting Temporal ConsistencyCode1
Efficient Multimodal Semantic Segmentation via Dual-Prompt LearningCode1
Class-Difficulty Based Methods for Long-Tailed Visual RecognitionCode1
Adapting Segment Anything Model to Multi-modal Salient Object Detection with Semantic Feature Fusion GuidanceCode1
Class-aware Sounding Objects Localization via Audiovisual CorrespondenceCode1
Adapting Segment Anything Model (SAM) through Prompt-based Learning for Enhanced Protein Identification in Cryo-EM MicrographsCode1
ClusterFormer: Clustering As A Universal Visual LearnerCode1
Class-Agnostic Segmentation Loss and Its Application to Salient Object Detection and SegmentationCode1
CLAMP-ViT: Contrastive Data-Free Learning for Adaptive Post-Training Quantization of ViTsCode1
Class-Agnostic Segmentation Loss and Its Application to Salient Object Detection and SegmentationCode1
Efficient Golf Ball Detection and Tracking Based on Convolutional Neural Networks and Kalman FilterCode1
Efficient Object Detection in Autonomous Driving using Spiking Neural Networks: Performance, Energy Consumption Analysis, and Insights into Open-set Object DiscoveryCode1
Efficient Visual Computing with Camera RAW SnapshotsCode1
Efficient Decoder-free Object Detection with TransformersCode1
CircleNet: Anchor-free Detection with Circle RepresentationCode1
Show:102550
← PrevPage 23 of 220Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Co-DETRbox mAP66Unverified
2InternImage-H (M3I Pre-training)box mAP65.5Unverified
3M3I Pre-training (InternImage-H)box mAP65.4Unverified
4MoCaEbox mAP65.1Unverified
5Co-DETR (Swin-L)box mAP64.8Unverified
6Focal-Stable-DINO (Focal-Huge, no TTA)box mAP64.8Unverified
7EVAbox mAP64.7Unverified
8Group DETR v2box mAP64.5Unverified
9FocalNet-H (DINO)box mAP64.4Unverified
10InternImage-XLbox mAP64.3Unverified