SOTAVerified

Object Detection

Papers

Showing 601650 of 10957 papers

TitleStatusHype
Feature Pyramid Networks for Object DetectionCode2
SSD: Single Shot MultiBox DetectorCode2
Fast R-CNNCode2
Beyond One Shot, Beyond One Perspective: Cross-View and Long-Horizon Distillation for Better LiDAR RepresentationsCode1
Improve Underwater Object Detection through YOLOv12 Architecture and Physics-informed AugmentationCode1
Semantic-decoupled Spatial Partition Guided Point-supervised Oriented Object DetectionCode1
Multiple Object Stitching for Unsupervised Representation LearningCode1
Diffusion Domain Teacher: Diffusion Guided Domain Adaptive Object DetectorCode1
GeneA-SLAM2: Dynamic SLAM with AutoEncoder-Preprocessed Genetic Keypoints Resampling and Depth Variance-Guided Dynamic Region RemovalCode1
OD3: Optimization-free Dataset Distillation for Object DetectionCode1
Adaptive Semantic Token Communication for Transformer-based Edge InferenceCode1
AdvReal: Adversarial Patch Generation Framework with Application to Adversarial Safety Evaluation of Object Detection SystemsCode1
Decoupling Classifier for Boosting Few-shot Object Detection and Instance SegmentationCode1
AGI-Elo: How Far Are We From Mastering A Task?Code1
M4-SAR: A Multi-Resolution, Multi-Polarization, Multi-Scene, Multi-Source Dataset and Benchmark for Optical-SAR Fusion Object DetectionCode1
StoryReasoning Dataset: Using Chain-of-Thought for Scene Understanding and Grounded Story GenerationCode1
M3CAD: Towards Generic Cooperative Autonomous Driving BenchmarkCode1
DiGIT: Multi-Dilated Gated Encoder and Central-Adjacent Region Integrated Decoder for Temporal Action Detection TransformerCode1
A Simple Detector with Frame Dynamics is a Strong TrackerCode1
Lightweight RGB-D Salient Object Detection from a Speed-Accuracy Tradeoff PerspectiveCode1
DualDiff: Dual-branch Diffusion Model for Autonomous Driving with Semantic FusionCode1
CDFormer: Cross-Domain Few-Shot Object Detection Transformer Against Feature ConfusionCode1
LLM-Empowered Embodied Agent for Memory-Augmented Task Planning in Household RoboticsCode1
E-InMeMo: Enhanced Prompting for Visual In-Context LearningCode1
A Multimodal Hybrid Late-Cascade Fusion Network for Enhanced 3D Object DetectionCode1
SAGA: Semantic-Aware Gray color Augmentation for Visible-to-Thermal Domain Adaptation across Multi-View Drone and Ground-Based Vision SystemsCode1
Visual Consensus Prompting for Co-Salient Object DetectionCode1
Lightweight LiDAR-Camera 3D Dynamic Object Detection and Multi-Class Trajectory PredictionCode1
DRIFT open dataset: A drone-derived intelligence for traffic analysis in urban environmenCode1
LEMUR Neural Network Dataset: Towards Seamless AutoMLCode1
Uncertainty Guided Refinement for Fine-Grained Salient Object DetectionCode1
RT-DATR:Real-time Unsupervised Domain Adaptive Detection Transformer with Adversarial Feature LearningCode1
Pychop: Emulating Low-Precision Arithmetic in Numerical Methods and Neural NetworksCode1
Hyperspectral Remote Sensing Images Salient Object Detection: The First Benchmark Dataset and BaselineCode1
Multimodal Fusion and Vision-Language Models: A Survey for Robot VisionCode1
Rip Current Segmentation: A Novel Benchmark and YOLOv8 Baseline ResultsCode1
CaLiV: LiDAR-to-Vehicle Calibration of Arbitrary Sensor Setups via Object ReconstructionCode1
Spectral-Adaptive Modulation Networks for Visual PerceptionCode1
EagleVision: Object-level Attribute Multimodal LLM for Remote SensingCode1
Learning Class Prototypes for Unified Sparse Supervised 3D Object DetectionCode1
Mind the Gap: Benchmarking Spatial Reasoning in Vision-Language ModelsCode1
Benchmarking Object Detectors under Real-World Distribution Shifts in Satellite ImageryCode1
Superpowering Open-Vocabulary Object Detectors for X-ray VisionCode1
UltraFlwr -- An Efficient Federated Medical and Surgical Object Detection FrameworkCode1
Robust Object Detection of Underwater Robot based on Domain GeneralizationCode1
Is Discretization Fusion All You Need for Collaborative Perception?Code1
State Space Model Meets Transformer: A New Paradigm for 3D Object DetectionCode1
RoCo-Sim: Enhancing Roadside Collaborative Perception through Foreground SimulationCode1
A Hierarchical Semantic Distillation Framework for Open-Vocabulary Object DetectionCode1
Accelerate 3D Object Detection Models via Zero-Shot Attention Key PruningCode1
Show:102550
← PrevPage 13 of 220Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Co-DETRbox mAP66Unverified
2InternImage-H (M3I Pre-training)box mAP65.5Unverified
3M3I Pre-training (InternImage-H)box mAP65.4Unverified
4MoCaEbox mAP65.1Unverified
5Co-DETR (Swin-L)box mAP64.8Unverified
6Focal-Stable-DINO (Focal-Huge, no TTA)box mAP64.8Unverified
7EVAbox mAP64.7Unverified
8Group DETR v2box mAP64.5Unverified
9FocalNet-H (DINO)box mAP64.4Unverified
10InternImage-XLbox mAP64.3Unverified