SOTAVerified

Object Detection

Papers

Showing 16761700 of 10957 papers

TitleStatusHype
V3Det Challenge 2024 on Vast Vocabulary and Open Vocabulary Object Detection: Methods and Results0
YOLO-FEDER FusionNet: A Novel Deep Learning Architecture for Drone Detection0
OoDIS: Anomaly Instance Segmentation BenchmarkCode1
Low-power Ship Detection in Satellite Images Using Neuromorphic Hardware0
Enhancing Generalizability of Representation Learning for Data-Efficient 3D Scene Understanding0
Semi-Supervised Domain Adaptation Using Target-Oriented Domain Augmentation for 3D Object DetectionCode0
Syn-to-Real Unsupervised Domain Adaptation for Indoor 3D Object DetectionCode0
SparseDet: A Simple and Effective Framework for Fully Sparse LiDAR-based 3D Object Detection0
Open-Vocabulary X-ray Prohibited Item Detection via Fine-tuning CLIP0
Object Detection using Oriented Window Learning Vi-sion Transformer: Roadway Assets Recognition0
SparseRadNet: Sparse Perception Neural Network on Subsampled Radar Data0
Voxel Mamba: Group-Free State Space Models for Point Cloud based 3D Object DetectionCode2
MMVR: Millimeter-wave Multi-View Radar Dataset and Benchmark for Indoor PerceptionCode0
What is the Visual Cognition Gap between Humans and Multimodal LLMs?Code0
YOLOv1 to YOLOv10: A comprehensive review of YOLO variants and their application in the agricultural domain0
EFM3D: A Benchmark for Measuring Progress Towards 3D Egocentric Foundation ModelsCode2
Shelf-Supervised Cross-Modal Pre-Training for 3D Object DetectionCode0
Automated GIS-Based Framework for Detecting Crosswalk Changes from Bi-Temporal High-Resolution Aerial Images0
Towards Evaluating the Robustness of Visual State Space ModelsCode1
Visual Sketchpad: Sketching as a Visual Chain of Thought for Multimodal Language ModelsCode3
Computer vision-based model for detecting turning lane features on Florida's public roadways0
DenoiseRep: Denoising Model for Representation LearningCode1
STAR: A First-Ever Dataset and A Large-Scale Benchmark for Scene Graph Generation in Large-Size Satellite ImageryCode2
BEVSpread: Spread Voxel Pooling for Bird's-Eye-View Representation in Vision-based Roadside 3D Object DetectionCode2
Enhanced Object Detection: A Study on Vast Vocabulary Object Detection Track for V3Det Challenge 20240
Show:102550
← PrevPage 68 of 439Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Co-DETRbox mAP66Unverified
2InternImage-H (M3I Pre-training)box mAP65.5Unverified
3M3I Pre-training (InternImage-H)box mAP65.4Unverified
4MoCaEbox mAP65.1Unverified
5Co-DETR (Swin-L)box mAP64.8Unverified
6Focal-Stable-DINO (Focal-Huge, no TTA)box mAP64.8Unverified
7EVAbox mAP64.7Unverified
8Group DETR v2box mAP64.5Unverified
9FocalNet-H (DINO)box mAP64.4Unverified
10InternImage-XLbox mAP64.3Unverified