SOTAVerified

Object Detection

Papers

Showing 701750 of 10957 papers

TitleStatusHype
UN-DETR: Promoting Objectness Learning via Joint Supervision for Unknown Object DetectionCode1
Cloud Object Detector Adaptation by Integrating Different Source KnowledgeCode1
Towards Flexible 3D Perception: Object-Centric Occupancy Completion Augments 3D Object DetectionCode1
EvRT-DETR: Latent Space Adaptation of Image Detectors for Event-based VisionCode1
Token Cropr: Faster ViTs for Quite a Few TasksCode1
Bootstraping Clustering of Gaussians for View-consistent 3D Scene UnderstandingCode1
COMPrompter: reconceptualized segment anything model with multiprompt network for camouflaged object detectionCode1
Deep Fourier-embedded Network for Bi-modal Salient Object DetectionCode1
From Open Vocabulary to Open World: Teaching Vision Language Models to Detect Novel ObjectsCode1
Event-based Spiking Neural Networks for Object Detection: A Review of Datasets, Architectures, Learning Rules, and ImplementationCode1
Learn from Foundation Model: Fruit Detection Model without Manual AnnotationCode1
Machine Learning for the Digital Typhoon Dataset: Extensions to Multiple Basins and New Developments in Representations and TasksCode1
Towards RAW Object Detection in Diverse ConditionsCode1
LRSAA: Large-scale Remote Sensing Image Target Recognition and Automatic AnnotationCode1
Highly Efficient and Unsupervised Framework for Moving Object Detection in Satellite VideosCode1
OCDet: Object Center Detection via Bounding Box-Aware Heatmap Prediction on Edge Devices with NPUsCode1
Physics-Guided Detector for SAR AirplanesCode1
Vision Eagle Attention: a new lens for advancing image classificationCode1
RETR: Multi-View Radar Detection Transformer for Indoor PerceptionCode1
Local-Global Attention: An Adaptive Mechanism for Multi-Scale Feature IntegrationCode1
Large-scale Remote Sensing Image Target Recognition and Automatic AnnotationCode1
Fast and Efficient Transformer-based Method for Bird's Eye View Instance PredictionCode1
LSSInst: Improving Geometric Modeling in LSS-Based BEV Perception with Instance RepresentationCode1
An Empirical Analysis on Spatial Reasoning Capabilities of Large Multimodal ModelsCode1
Efficient Fourier Filtering Network with Contrastive Learning for UAV-based Unaligned Bi-modal Salient Object DetectionCode1
CRT-Fusion: Camera, Radar, Temporal Fusion Using Motion Information for 3D Object DetectionCode1
Efficient Feature Aggregation and Scale-Aware Regression for Monocular 3D Object DetectionCode1
Advanced computer vision for extracting georeferenced vehicle trajectories from drone imageryCode1
ROAD-Waymo: Action Awareness at Scale for Autonomous DrivingCode1
Lighten CARAFE: Dynamic Lightweight Upsampling with Guided Reassemble KernelsCode1
PK-YOLO: Pretrained Knowledge Guided YOLO for Brain Tumor Detection in Multiplanar MRI SlicesCode1
IndraEye: Infrared Electro-Optical UAV-based Perception Dataset for Robust Downstream TasksCode1
MVSDet: Multi-View Indoor 3D Object Detection via Efficient Plane SweepsCode1
Sebica: Lightweight Spatial and Efficient Bidirectional Channel Attention Super Resolution NetworkCode1
Thermal Chameleon: Task-Adaptive Tone-mapping for Radiometric Thermal-Infrared imagesCode1
Optimizing Edge Offloading Decisions for Object DetectionCode1
You Only Look Around: Learning Illumination Invariant Feature for Low-light Object DetectionCode1
DREB-Net: Dual-stream Restoration Embedding Blur-feature Fusion Network for High-mobility UAV Object DetectionCode1
PlantCamo: Plant Camouflage DetectionCode1
OVT-B: A New Large-Scale Benchmark for Open-Vocabulary Multi-Object TrackingCode1
Fire and Smoke Detection with Burning Intensity RepresentationCode1
TrackMe:A Simple and Effective Multiple Object Tracking Annotation ToolCode1
MambaSOD: Dual Mamba-Driven Cross-Modal Fusion Network for RGB-D Salient Object DetectionCode1
Real-time Stereo-based 3D Object Detection for Streaming PerceptionCode1
TEOcc: Radar-camera Multi-modal Occupancy Prediction via Temporal EnhancementCode1
V2M: Visual 2-Dimensional Mamba for Image Representation LearningCode1
GlobalMamba: Global Image Serialization for Vision MambaCode1
LoLI-Street: Benchmarking Low-Light Image Enhancement and BeyondCode1
DeBiFormer: Vision Transformer with Deformable Agent Bi-level Routing AttentionCode1
DA-Ada: Learning Domain-Aware Adapter for Domain Adaptive Object DetectionCode1
Show:102550
← PrevPage 15 of 220Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Co-DETRbox mAP66Unverified
2InternImage-H (M3I Pre-training)box mAP65.5Unverified
3M3I Pre-training (InternImage-H)box mAP65.4Unverified
4MoCaEbox mAP65.1Unverified
5Co-DETR (Swin-L)box mAP64.8Unverified
6Focal-Stable-DINO (Focal-Huge, no TTA)box mAP64.8Unverified
7EVAbox mAP64.7Unverified
8Group DETR v2box mAP64.5Unverified
9FocalNet-H (DINO)box mAP64.4Unverified
10InternImage-XLbox mAP64.3Unverified