SOTAVerified

Object Detection

Papers

Showing 16511700 of 10957 papers

TitleStatusHype
DiPEx: Dispersing Prompt Expansion for Class-Agnostic Object DetectionCode1
LeYOLO, New Scalable and Efficient CNN Architecture for Object DetectionCode2
SSAD: Self-supervised Auxiliary Detection Framework for Panoramic X-ray based Dental Disease DiagnosisCode0
Towards the in-situ Trunk Identification and Length Measurement of Sea Cucumbers via Bézier Curve ModellingCode0
Enhanced Bank Check Security: Introducing a Novel Dataset and Transformer-Based Approach for Detection and VerificationCode0
Visible-Thermal Tiny Object Detection: A Benchmark Dataset and BaselinesCode3
Advancements in Orthopaedic Arm Segmentation: A Comprehensive Review0
Snowy Scenes,Clear Detections: A Robust Model for Traffic Light Detection in Adverse Weather ConditionsCode0
DPO: Dual-Perturbation Optimization for Test-time Adaptation in 3D Object DetectionCode0
Strengthening Layer Interaction via Dynamic Layer AttentionCode0
Semantic Enhanced Few-shot Object Detection0
Aligning Models with Their Realization through Model-based Systems Engineering0
A machine learning pipeline for automated insect monitoring0
Competitive Learning for Achieving Content-specific Filters in Video Coding for Machines0
SDNIA-YOLO: A Robust Object Detection Model for Extreme Weather Conditions0
Privacy Preserving Federated Learning in Medical Imaging with Uncertainty EstimationCode0
The Solution for CVPR2024 Foundational Few-Shot Object Detection Challenge0
Online Anchor-based Training for Image Classification Tasks0
DASSF: Dynamic-Attention Scale-Sequence Fusion for Aerial Object Detection0
Certified ML Object Detection for Surveillance Missions0
ViDSOD-100: A New Dataset and a Baseline Model for RGB-D Video Salient Object DetectionCode0
Overlap Suppression Clustering for Offline Multi-Camera People Tracking0
Scaling Efficient Masked Image Modeling on Large Remote Sensing DatasetCode2
Multi-Dimensional Pruning: Joint Channel, Layer and Block Pruning with Latency Constraint0
YOLO9tr: A Lightweight Model for Pavement Damage Detection Utilizing a Generalized Efficient Layer Aggregation Network and Attention MechanismCode1
V3Det Challenge 2024 on Vast Vocabulary and Open Vocabulary Object Detection: Methods and Results0
YOLO-FEDER FusionNet: A Novel Deep Learning Architecture for Drone Detection0
OoDIS: Anomaly Instance Segmentation BenchmarkCode1
Low-power Ship Detection in Satellite Images Using Neuromorphic Hardware0
Enhancing Generalizability of Representation Learning for Data-Efficient 3D Scene Understanding0
Semi-Supervised Domain Adaptation Using Target-Oriented Domain Augmentation for 3D Object DetectionCode0
Syn-to-Real Unsupervised Domain Adaptation for Indoor 3D Object DetectionCode0
SparseDet: A Simple and Effective Framework for Fully Sparse LiDAR-based 3D Object Detection0
Open-Vocabulary X-ray Prohibited Item Detection via Fine-tuning CLIP0
Object Detection using Oriented Window Learning Vi-sion Transformer: Roadway Assets Recognition0
SparseRadNet: Sparse Perception Neural Network on Subsampled Radar Data0
Voxel Mamba: Group-Free State Space Models for Point Cloud based 3D Object DetectionCode2
MMVR: Millimeter-wave Multi-View Radar Dataset and Benchmark for Indoor PerceptionCode0
What is the Visual Cognition Gap between Humans and Multimodal LLMs?Code0
YOLOv1 to YOLOv10: A comprehensive review of YOLO variants and their application in the agricultural domain0
EFM3D: A Benchmark for Measuring Progress Towards 3D Egocentric Foundation ModelsCode2
Shelf-Supervised Cross-Modal Pre-Training for 3D Object DetectionCode0
Automated GIS-Based Framework for Detecting Crosswalk Changes from Bi-Temporal High-Resolution Aerial Images0
Towards Evaluating the Robustness of Visual State Space ModelsCode1
Visual Sketchpad: Sketching as a Visual Chain of Thought for Multimodal Language ModelsCode3
Computer vision-based model for detecting turning lane features on Florida's public roadways0
DenoiseRep: Denoising Model for Representation LearningCode1
STAR: A First-Ever Dataset and A Large-Scale Benchmark for Scene Graph Generation in Large-Size Satellite ImageryCode2
BEVSpread: Spread Voxel Pooling for Bird's-Eye-View Representation in Vision-based Roadside 3D Object DetectionCode2
Enhanced Object Detection: A Study on Vast Vocabulary Object Detection Track for V3Det Challenge 20240
Show:102550
← PrevPage 34 of 220Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Co-DETRbox mAP66Unverified
2InternImage-H (M3I Pre-training)box mAP65.5Unverified
3M3I Pre-training (InternImage-H)box mAP65.4Unverified
4MoCaEbox mAP65.1Unverified
5Co-DETR (Swin-L)box mAP64.8Unverified
6Focal-Stable-DINO (Focal-Huge, no TTA)box mAP64.8Unverified
7EVAbox mAP64.7Unverified
8Group DETR v2box mAP64.5Unverified
9FocalNet-H (DINO)box mAP64.4Unverified
10InternImage-XLbox mAP64.3Unverified