SOTAVerified

Object Detection

Papers

Showing 28512900 of 10957 papers

TitleStatusHype
Radar-Lidar Fusion for Object Detection by Designing Effective Convolution Networks0
A High-Resolution Dataset for Instance Detection with Multi-View Instance CaptureCode1
Seeking Flat Minima with Mean Teacher on Semi- and Weakly-Supervised Domain Generalization for Object Detection0
Dynamic V2X Autonomous Perception from Road-to-Vehicle Vision0
Out-of-distribution Object Detection through Bayesian Uncertainty Estimation0
PrObeD: Proactive Object Detection WrapperCode0
Exploring Data Augmentations on Self-/Semi-/Fully- Supervised Pre-trained Models0
ODM3D: Alleviating Foreground Sparsity for Semi-Supervised Monocular 3D Object DetectionCode0
Efficient Object Detection in Optical Remote Sensing Imagery via Attention-based Feature Distillation0
Mobile Application for Oral Disease Detection using Federated Learning0
HSIC-based Moving WeightAveraging for Few-Shot Open-Set Object DetectionCode1
Improving the Performance of Object Detection by Preserving Balanced Class DistributionCode0
Learning Extrinsic Dexterity with Parameterized Manipulation Primitives0
DecoderTracker: Decoder-Only Method for Multiple-Object Tracking0
Navigating Data Heterogeneity in Federated Learning A Semi-Supervised Federated Object DetectionCode1
IndustReal: A Dataset for Procedure Step Recognition Handling Execution Errors in Egocentric Videos in an Industrial-Like SettingCode1
LP-OVOD: Open-Vocabulary Object Detection by Linear ProbingCode1
torchdistill Meets Hugging Face Libraries for Reproducible, Coding-Free Deep Learning Studies: A Case Study on NLP0
CosmosDSR -- a methodology for automated detection and tracking of orbital debris using the Unscented Kalman Filter0
YOLO-BEV: Generating Bird's-Eye View in the Same Way as 2D Object Detection0
CoDet: Co-Occurrence Guided Region-Word Alignment for Open-Vocabulary Object DetectionCode1
ParisLuco3D: A high-quality target dataset for domain generalization of LiDAR perception0
DiffRef3D: A Diffusion-based Proposal Refinement Framework for 3D Object Detection0
Proposal-Contrastive Pretraining for Object Detection from Fewer Data0
MVFAN: Multi-View Feature Assisted Network for 4D Radar Object Detection0
Leveraging Vision-Centric Multi-Modal Expertise for 3D Object DetectionCode3
Salient Object Detection in RGB-D VideosCode1
Mean Teacher DETR with Masked Feature Alignment: A Robust Domain Adaptive Detection Transformer Framework0
Decoupled DETR: Spatially Disentangling Localization and Classification for Improved End-to-End Object Detection0
CPSeg: Finer-grained Image Semantic Segmentation via Chain-of-Thought Language Prompting0
GUPNet++: Geometry Uncertainty Propagation Network for Monocular 3D Object DetectionCode1
Safe Navigation: Training Autonomous Vehicles using Deep Reinforcement Learning in CARLACode1
Pre-Training LiDAR-Based 3D Object Detectors Through ColorizationCode0
Rethinking Scale Imbalance in Semi-supervised Object Detection for Aerial Images0
MaRU: A Manga Retrieval and Understanding System Connecting Vision and Language0
The Importance of Anti-Aliasing in Tiny Object DetectionCode0
Skipped Feature Pyramid Network with Grid Anchor for Object Detection0
OV-VG: A Benchmark for Open-Vocabulary Visual GroundingCode1
Deep MDP: A Modular Framework for Multi-Object TrackingCode0
Guidance system for Visually Impaired Persons using Deep Learning and Optical flow0
Multimodal Transformer Using Cross-Channel attention for Object Detection in Remote Sensing ImagesCode1
Fuzzy-NMS: Improving 3D Object Detection with Fuzzy Classification in NMS0
A review of individual tree crown detection and delineation from optical remote sensing images0
EarlyBird: Early-Fusion for Multi-View Tracking in the Bird's Eye ViewCode1
ScalableMap: Scalable Map Learning for Online Long-Range Vectorized HD Map ConstructionCode1
Zone Evaluation: Revealing Spatial Bias in Object DetectionCode1
Multi‑camera trajectory matching based on hierarchical clustering and constraintsCode1
RTNH+: Enhanced 4D Radar Object Detection Network using Combined CFAR-based Two-level Preprocessing and Vertical Encoding0
DT/MARS-CycleGAN: Improved Object Detection for MARS Phenotyping Robot0
Lost in Translation: When GPT-4V(ision) Can't See Eye to Eye with Text. A Vision-Language-Consistency Analysis of VLLMs and Beyond0
Show:102550
← PrevPage 58 of 220Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Co-DETRbox mAP66Unverified
2InternImage-H (M3I Pre-training)box mAP65.5Unverified
3M3I Pre-training (InternImage-H)box mAP65.4Unverified
4MoCaEbox mAP65.1Unverified
5Co-DETR (Swin-L)box mAP64.8Unverified
6Focal-Stable-DINO (Focal-Huge, no TTA)box mAP64.8Unverified
7EVAbox mAP64.7Unverified
8Group DETR v2box mAP64.5Unverified
9FocalNet-H (DINO)box mAP64.4Unverified
10InternImage-XLbox mAP64.3Unverified