SOTAVerified

Object Detection

Papers

Showing 46514700 of 10957 papers

TitleStatusHype
Advanced Knowledge Extraction of Physical Design Drawings, Translation and conversion to CAD formats using Deep Learning0
Self-supervised co-salient object detection via feature correspondence at multiple scalesCode0
V2X-DGW: Domain Generalization for Multi-agent Perception under Adverse Weather Conditions0
Intelligent Railroad Grade Crossing: Leveraging Semantic Segmentation and Object Detection for Enhanced Safety0
GRA: Detecting Oriented Objects through Group-wise Rotating and Attention0
FishNet: Deep Neural Networks for Low-Cost Fish Stock Estimation0
Detection of Fast-Moving Objects with Neuromorphic Hardware0
Cannabis Seed Variant Detection using Faster R-CNN0
SparseFusion: Efficient Sparse Multi-Modal Fusion Framework for Long-Range 3D Perception0
CSDNet: Detect Salient Object in Depth-Thermal via A Lightweight Cross Shallow and Deep Perception Network0
A Hybrid SNN-ANN Network for Event-based Object Detection with Spatial and Temporal Attention0
PoIFusion: Multi-Modal 3D Object Detection via Fusion at Points of Interest0
Improving Distant 3D Object Detection Using 2D Box Supervision0
D-YOLO a robust framework for object detection in adverse weather conditions0
Open-Vocabulary Object Detection with Meta Prompt Representation and Instance Contrastive Optimization0
SHAN: Object-Level Privacy Detection via Inference on Scene Heterogeneous Graph0
Attention-based Class-Conditioned Alignment for Multi-Source Domain Adaptation of Object DetectorsCode0
Griffon v2: Advancing Multimodal Perception with High-Resolution Scaling and Visual-Language Co-Referring0
FieldNet: Efficient Real-Time Shadow Removal for Enhanced Vision in Field Robotics0
CLIP-BEVFormer: Enhancing Multi-View Image-Based BEV Detector with Ground Truth Flow0
Advancing Security in AI Systems: A Novel Approach to Detecting Backdoors in Deep Neural Networks0
A Multimodal Fusion Network For Student Emotion Recognition Based on Transformer and Tensor Product0
Improved YOLOv5 Based on Attention Mechanism and FasterNet for Foreign Object Detection on Railway and Airway tracks0
FogGuard: guarding YOLO against fog using perceptual lossCode0
TaskCLIP: Extend Large Vision-Language Model for Task Oriented Object Detection0
Aedes aegypti Egg Counting with Neural Networks for Object Detection0
PeLK: Parameter-efficient Large Kernel ConvNets with Peripheral Convolution0
Eliminating Cross-modal Conflicts in BEV Space for LiDAR-Camera 3D Object DetectionCode0
JSTR: Joint Spatio-Temporal Reasoning for Event-based Moving Object Detection0
Mondrian: On-Device High-Performance Video Analytics with Compressive Packed Inference0
A Survey of Vision Transformers in Autonomous Driving: Current Trends and Future Directions0
SparseLIF: High-Performance Sparse LiDAR-Camera Fusion for 3D Object Detection0
Inception-YOLO: Computational cost and accuracy improvement of the YOLOv5 model based on employing modified CSP, SPPF, and inception modules0
Class Imbalance in Object Detection: An Experimental Diagnosis and Study of Mitigation StrategiesCode0
Evaluating the Energy Efficiency of Few-Shot Learning for Object Detection in Industrial Settings0
Cross-domain and Cross-dimension Learning for Image-to-Graph TransformersCode0
Genetic Learning for Designing Sim-to-Real Data AugmentationsCode0
Out-of-distribution Partial Label Learning0
LeOCLR: Leveraging Original Images for Contrastive Learning of Visual Representations0
Transformer based Multitask Learning for Image Captioning and Object Detection0
Cross-Cluster Shifting for Efficient and Effective 3D Object Detection in Autonomous Driving0
Reframe Anything: LLM Agent for Open World Video Reframing0
Improving the Successful Robotic Grasp Detection Using Convolutional Neural Networks0
EVD4UAV: An Altitude-Sensitive Benchmark to Evade Vehicle Detection in UAVCode0
LanePtrNet: Revisiting Lane Detection as Point Voting and Grouping on Curves0
VLM-PL: Advanced Pseudo Labeling Approach for Class Incremental Object Detection via Vision-Language Model0
ActFormer: Scalable Collaborative Perception via Active Queries0
Not just Birds and Cars: Generic, Scalable and Explainable Models for Professional Visual Recognition0
ACC-ViT : Atrous Convolution's Comeback in Vision Transformers0
Möbius Transform for Mitigating Perspective Distortions in Representation Learning0
Show:102550
← PrevPage 94 of 220Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Co-DETRbox mAP66Unverified
2InternImage-H (M3I Pre-training)box mAP65.5Unverified
3M3I Pre-training (InternImage-H)box mAP65.4Unverified
4MoCaEbox mAP65.1Unverified
5Co-DETR (Swin-L)box mAP64.8Unverified
6Focal-Stable-DINO (Focal-Huge, no TTA)box mAP64.8Unverified
7EVAbox mAP64.7Unverified
8Group DETR v2box mAP64.5Unverified
9FocalNet-H (DINO)box mAP64.4Unverified
10InternImage-XLbox mAP64.3Unverified