SOTAVerified

Object Detection

Papers

Showing 28012850 of 10957 papers

TitleStatusHype
Learning Discriminative Features for Crowd Counting0
Domain Adaptive Object Detection via Balancing Between Self-Training and Adversarial Learning0
Towards Few-Annotation Learning in Computer Vision: Application to Image Classification and Object Detection tasks0
SODAWideNet -- Salient Object Detection with an Attention augmented Wide Encoder Decoder network without ImageNet pre-trainingCode0
Free-Space Optical Spiking Neural Network0
Image change detection with only a few samples0
3DiffTection: 3D Object Detection with Geometry-Aware Diffusion Features0
FusionViT: Hierarchical 3D Object Detection via LiDAR-Camera Vision Transformer Fusion0
Instruct Me More! Random Prompting for Visual In-Context LearningCode1
3DifFusionDet: Diffusion Model for 3D Object Detection with Robust LiDAR-Camera Fusion0
Data exploitation: multi-task learning of object detection and semantic segmentation on partially annotated dataCode0
mmFUSION: Multimodal Fusion for 3D Objects Detection0
Meta-Adapter: An Online Few-shot Learner for Vision-Language ModelCode1
Inner-IoU: More Effective Intersection over Union Loss with Auxiliary Bounding BoxCode1
Segmentation of Drone Collision Hazards in Airborne RADAR Point Clouds Using PointNet0
TokenMotion: Motion-Guided Vision Transformer for Video Camouflaged Object Detection Via Learnable Token Selection0
NeuSyRE: Neuro-Symbolic Visual Understanding and Reasoning Framework based on Scene Graph EnrichmentCode1
Scenario Diffusion: Controllable Driving Scenario Generation With Diffusion0
ISAR: A Benchmark for Single- and Few-Shot Object Instance Segmentation and Re-Identification0
Adapting Segment Anything Model (SAM) through Prompt-based Learning for Enhanced Protein Identification in Cryo-EM MicrographsCode1
Proposal-Level Unsupervised Domain Adaptation for Open World Unbiased DetectorCode1
Flow-Based Feature Fusion for Vehicle-Infrastructure Cooperative 3D Object DetectionCode1
Patch-based Selection and Refinement for Early Object DetectionCode1
Towards Unsupervised Object Detection From LiDAR Point Clouds0
Bridging the Gap between Multi-focus and Multi-modal: A Focused Integration Framework for Multi-modal Image FusionCode1
Taking a PEEK into YOLOv5 for Satellite Component Recognition via Entropy-based Visual Explanations0
Quantitative Evaluation of a Multi-Modal Camera Setup for Fusing Event Data with RGB Images0
Effective Human-AI Teams via Learned Natural Language Rules and OnboardingCode1
AiluRus: A Scalable ViT Framework for Dense PredictionCode0
InsPLAD: A Dataset and Benchmark for Power Line Asset Inspection in UAV ImagesCode1
CML-MOTS: Collaborative Multi-task Learning for Multi-Object Tracking and Segmentation0
Ultra-Efficient On-Device Object Detection on AI-Integrated Smart Glasses with TinyissimoYOLOCode1
CenterRadarNet: Joint 3D Object Detection and Tracking Framework using 4D FMCW Radar0
M&M3D: Multi-Dataset Training and Efficient Network for Multi-view 3D Object DetectionCode0
Efficient Vision Transformer for Accurate Traffic Sign Detection0
Recognize Any RegionsCode1
Scattering Vision Transformer: Spectral Mixing Matters0
Enhancing Traffic Object Detection in Variable Illumination with RGB-Event FusionCode0
TPSeNCE: Towards Artifact-Free Realistic Rain Generation for Deraining and Object Detection in RainCode1
Re-Scoring Using Image-Language Similarity for Few-Shot Object DetectionCode1
HEDNet: A Hierarchical Encoder-Decoder Network for 3D Object Detection in Point CloudsCode1
Spuriosity Rankings for Free: A Simple Framework for Last Layer Retraining Based on Object Detection0
View Classification and Object Detection in Cardiac Ultrasound to Localize Valves via Deep Learning0
YOLOv8-Based Visual Detection of Road Hazards: Potholes, Sewer Covers, and Manholes0
Battle of the Backbones: A Large-Scale Comparison of Pretrained Models across Computer Vision TasksCode2
DiffusionVID: Denoising Object Boxes with Spatio-temporal Conditioning for Video Object DetectionCode1
TransXNet: Learning Both Global and Local Dynamics with a Dual Dynamic Token Mixer for Visual RecognitionCode2
Radar-Lidar Fusion for Object Detection by Designing Effective Convolution Networks0
Towards Few-Annotation Learning for Object Detection: Are Transformer-based Models More Efficient ?Code0
Improving Online Source-free Domain Adaptation for Object Detection by Unsupervised Data Acquisition0
Show:102550
← PrevPage 57 of 220Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Co-DETRbox mAP66Unverified
2InternImage-H (M3I Pre-training)box mAP65.5Unverified
3M3I Pre-training (InternImage-H)box mAP65.4Unverified
4MoCaEbox mAP65.1Unverified
5Co-DETR (Swin-L)box mAP64.8Unverified
6Focal-Stable-DINO (Focal-Huge, no TTA)box mAP64.8Unverified
7EVAbox mAP64.7Unverified
8Group DETR v2box mAP64.5Unverified
9FocalNet-H (DINO)box mAP64.4Unverified
10InternImage-XLbox mAP64.3Unverified