SOTAVerified

Object Detection

Papers

Showing 11511200 of 10957 papers

TitleStatusHype
On the Robustness of Object Detection Models on Aerial ImagesCode1
Bridging Cross-task Protocol Inconsistency for Distillation in Dense Object DetectionCode1
SOGDet: Semantic-Occupancy Guided Multi-view 3D Object DetectionCode1
Eventful Transformers: Leveraging Temporal Redundancy in Vision TransformersCode1
Learning Heavily-Degraded Prior for Underwater Object DetectionCode1
AMSP-UOD: When Vortex Convolution and Stochastic Perturbation Meet Underwater Object DetectionCode1
Delving into Motion-Aware Matching for Monocular 3D Object TrackingCode1
A Survey on Self-Supervised Representation LearningCode1
ViLLA: Fine-Grained Vision-Language Representation Learning from Real-World DataCode1
Spatial Transform Decoupling for Oriented Object DetectionCode1
UniM^2AE: Multi-modal Masked Autoencoders with Unified 3D Representation for 3D Perception in Autonomous DrivingCode1
ASY-VRNet: Waterway Panoptic Driving Perception Model based on Asymmetric Fair Fusion of Vision and 4D mmWave RadarCode1
DatasetEquity: Are All Samples Created Equal? In The Quest For Equity Within DatasetsCode1
Deep Equilibrium Object DetectionCode1
Small Object Detection via Coarse-to-fine Proposal Generation and Imitation LearningCode1
RLIPv2: Fast Scaling of Relational Language-Image Pre-trainingCode1
MonoNeRD: NeRF-like Representations for Monocular 3D Object DetectionCode1
Far3D: Expanding the Horizon for Surround-view 3D Object DetectionCode1
Frequency Perception Network for Camouflaged Object DetectionCode1
ImGeoNet: Image-induced Geometry-aware Voxel Representation for Multi-view 3D Object DetectionCode1
GPA-3D: Geometry-aware Prototype Alignment for Unsupervised Domain Adaptive 3D Object Detection from Point CloudsCode1
Diagnosing Human-object Interaction DetectorsCode1
Real-time Automatic M-mode Echocardiography Measurement with Panel Attention from Local-to-Global PixelsCode1
Improved Region Proposal Network for Enhanced Few-Shot Object DetectionCode1
3DMOTFormer: Graph Transformer for Online 3D Multi-Object TrackingCode1
TongueSAM: An Universal Tongue Segmentation Model Based on SAM with Zero-ShotCode1
Cyclic-Bootstrap Labeling for Weakly Supervised Object DetectionCode1
Taming Self-Training for Open-Vocabulary Object DetectionCode1
Learned Point Cloud Compression for ClassificationCode1
MS3D++: Ensemble of Experts for Multi-Source Unsupervised Domain Adaption in 3D Object DetectionCode1
Recognizing Handwritten Mathematical Expressions of Vertical Addition and SubtractionCode1
Objects do not disappear: Video object detection by single-frame object location anticipationCode1
Density Crop-guided Semi-supervised Object Detection in Aerial ImagesCode1
PARTNER: Level up the Polar Representation for LiDAR 3D Object DetectionCode1
SODFormer: Streaming Object Detection with Transformer Using Events and FramesCode1
YUDO: YOLO for Uniform Directed Object DetectionCode1
V-DETR: DETR with Vertex Relative Position Encoding for 3D Object DetectionCode1
FeatEnHancer: Enhancing Hierarchical Features for Object Detection and Beyond Under Low-Light VisionCode1
Recurrent Multi-scale Transformer for High-Resolution Salient Object DetectionCode1
Strategic Preys Make Acute Predators: Enhancing Camouflaged Object Detectors by Generating Camouflaged ObjectsCode1
FracAtlas: A Dataset for Fracture Classification, Localization and Segmentation of Musculoskeletal RadiographsCode1
Generation of Realistic Synthetic Raw Radar Data for Automated Driving Applications using Generative Adversarial NetworksCode1
Balanced Classification: A Unified Framework for Long-Tailed Object DetectionCode1
UGainS: Uncertainty Guided Anomaly Instance SegmentationCode1
Point Anywhere: Directed Object Estimation from Omnidirectional ImagesCode1
A Satellite Imagery Dataset for Long-Term Sustainable Development in United States CitiesCode1
RCS-YOLO: A Fast and High-Accuracy Object Detector for Brain Tumor DetectionCode1
Spatio-Temporal Domain Awareness for Multi-Agent Collaborative PerceptionCode1
Unmasking Anomalies in Road-Scene SegmentationCode1
RecursiveDet: End-to-End Region-based Recursive Object DetectionCode1
Show:102550
← PrevPage 24 of 220Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Co-DETRbox mAP66Unverified
2InternImage-H (M3I Pre-training)box mAP65.5Unverified
3M3I Pre-training (InternImage-H)box mAP65.4Unverified
4MoCaEbox mAP65.1Unverified
5Co-DETR (Swin-L)box mAP64.8Unverified
6Focal-Stable-DINO (Focal-Huge, no TTA)box mAP64.8Unverified
7EVAbox mAP64.7Unverified
8Group DETR v2box mAP64.5Unverified
9FocalNet-H (DINO)box mAP64.4Unverified
10InternImage-XLbox mAP64.3Unverified