SOTAVerified

Object Detection

Papers

Showing 201225 of 10957 papers

TitleStatusHype
Visual Consensus Prompting for Co-Salient Object DetectionCode1
Balancing Privacy and Action Performance: A Penalty-Driven Approach to Image Anonymization0
SLAM-Based Navigation and Fault Resilience in a Surveillance Quadcopter with Embedded Vision Systems0
Context-Awareness and Interpretability of Rare Occurrences for Discovery and Formalization of Critical Failure Modes0
DenSe-AdViT: A novel Vision Transformer for Dense SAR Object Detection0
HMPE:HeatMap Embedding for Efficient Transformer-Based Small Object Detection0
Feature Alignment and Representation Transfer in Knowledge Distillation for Large Language Models0
Towards a Multi-Agent Vision-Language System for Zero-Shot Novel Hazardous Object Detection for Autonomous Driving SafetyCode0
Lightweight LiDAR-Camera 3D Dynamic Object Detection and Multi-Class Trajectory PredictionCode1
Weak Cube R-CNN: Weakly Supervised 3D Detection using only 2D Bounding Boxes0
RoPETR: Improving Temporal Camera-Only 3D Detection by Integrating Enhanced Rotary Position Embedding0
SAR Object Detection with Self-Supervised Pretraining and Curriculum-Aware Sampling0
Self-Supervised Pre-training with Combined Datasets for 3D Perception in Autonomous Driving0
RF-DETR Object Detection vs YOLOv12 : A Study of Transformer-based and CNN-based Architectures for Single-Class and Multi-Class Greenfruit Detection in Complex Orchard Environments Under Label Ambiguity0
VLLFL: A Vision-Language Model Based Lightweight Federated Learning Framework for Smart Agriculture0
Perception Encoder: The best visual embeddings are not at the output of the networkCode8
Multimodal Spatio-temporal Graph Learning for Alignment-free RGBT Video Object Detection0
RADLER: Radar Object Detection Leveraging Semantic 3D City Models and Self-Supervised Radar-Image Learning0
A Review of YOLOv12: Attention-Based Enhancements vs. Previous Versions0
pix2pockets: Shot Suggestions in 8-Ball Pool from a Single Image in the Wild0
Towards a General-Purpose Zero-Shot Synthetic Low-Light Image and Video Pipeline0
Flyweight FLIM Networks for Salient Object Detection in Biomedical Images0
S^2Teacher: Step-by-step Teacher for Sparsely Annotated Oriented Object Detection0
CFIS-YOLO: A Lightweight Multi-Scale Fusion Network for Edge-Deployable Wood Defect Detection0
Safe-Construct: Redefining Construction Safety Violation Recognition as 3D Multi-View Engagement Task0
Show:102550
← PrevPage 9 of 439Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Co-DETRbox mAP66Unverified
2InternImage-H (M3I Pre-training)box mAP65.5Unverified
3M3I Pre-training (InternImage-H)box mAP65.4Unverified
4MoCaEbox mAP65.1Unverified
5Co-DETR (Swin-L)box mAP64.8Unverified
6Focal-Stable-DINO (Focal-Huge, no TTA)box mAP64.8Unverified
7EVAbox mAP64.7Unverified
8Group DETR v2box mAP64.5Unverified
9FocalNet-H (DINO)box mAP64.4Unverified
10InternImage-XLbox mAP64.3Unverified