SOTAVerified

Object Detection

Papers

Showing 62016250 of 10957 papers

TitleStatusHype
A General Divergence Modeling Strategy for Salient Object Detection0
Focal and Global Knowledge Distillation for DetectorsCode1
Metamorphic Adversarial Detection Pipeline for Face Recognition Systems0
Lightweight Transformer Backbone for Medical Object Detection0
FedCV: A Federated Learning Framework for Diverse Computer Vision TasksCode1
Learning to Aggregate Multi-Scale Context for Instance Segmentation in Remote Sensing ImagesCode1
MetaFormer Is Actually What You Need for VisionCode2
Many Heads but One Brain: Fusion Brain -- a Competition and a Single Multimodal Multitask ArchitectureCode1
L-Verse: Bidirectional Generation Between Image and TextCode1
Tracking Grow-Finish Pigs Across Large Pens Using Multiple CamerasCode1
Benchmarking Detection Transfer Learning with Vision TransformersCode1
Dense Uncertainty Estimation via an Ensemble-based Conditional Latent Variable Model0
MUM : Mix Image Tiles and UnMix Feature Tiles for Semi-Supervised Object DetectionCode1
Class-agnostic Object Detection with Multi-modal TransformerCode1
Conifer Seedling Detection in UAV-Imagery with RGB-Depth Information0
Florence: A New Foundation Model for Computer VisionCode1
FCOSR: A Simple Anchor-free Rotated Detector for Aerial Object DetectionCode1
Efficient Softmax Approximation for Deep Neural Networks with Attention Mechanism0
HoughCL: Finding Better Positive Pairs in Dense Self-supervised Learning0
FBNetV5: Neural Architecture Search for Multiple Tasks in One Run0
Grounded Situation Recognition with TransformersCode1
Swin Transformer V2: Scaling Up Capacity and ResolutionCode1
Boosting Supervised Learning Performance with Co-training0
Range-Aware Attention Network for LiDAR-based 3D Object Detection with Auxiliary Point Density Level EstimationCode1
LiDAR Cluster First and Camera Inference Later: A New Perspective Towards Autonomous Driving0
TransMix: Attend to Mix for Vision TransformersCode1
Open Vocabulary Object Detection with Pseudo Bounding-Box LabelsCode1
Tracklet-Switch Adversarial Attack against Pedestrian Multi-Object Tracking TrackersCode1
ARKitScenes: A Diverse Real-World Dataset For 3D Indoor Scene Understanding Using Mobile RGB-D DataCode1
SAPNet: Segmentation-Aware Progressive Network for Perceptual Contrastive DerainingCode1
TYolov5: A Temporal Yolov5 Detector Based on Quasi-Recurrent Neural Networks for Real-Time Handgun Detection in VideoCode1
Single-stage uav detection and classification with yolov5: Mosaic data augmentation and panetCode0
On Vision Features in Multimodal Machine Translation0
TextMosaic: A New Data Augmentation Method for Named Entity Recognition Using Document-Level Contexts0
Postdisaster image-based damage detection and repair cost estimation of reinforced concrete buildings using dual convolutional neural networks0
Multi-Grained Vision Language Pre-Training: Aligning Texts with Visual ConceptsCode1
Single Image Object Counting and Localizing using Active-Learning0
Semantically Grounded Object Matching for Robust Robotic Scene RearrangementCode0
iBOT: Image BERT Pre-Training with Online TokenizerCode1
Attention Mechanisms in Computer Vision: A SurveyCode2
Robust and Accurate Object Detection via Self-Knowledge DistillationCode0
Co-segmentation Inspired Attention Module for Video-based Computer Vision TasksCode1
Fracture Detection in Wrist X-ray Images Using Deep Learning-Based Object Detection ModelsCode0
Factorial Convolution Neural Networks0
Visual Understanding of Complex Table Structures from Document Images0
Can neural networks predict dynamics they have never seen?0
Attention Guided Cosine Margin For Overcoming Class-Imbalance in Few-Shot Road Object DetectionCode1
Masked Autoencoders Are Scalable Vision LearnersCode1
Indian Licence Plate Dataset in the wildCode1
Towards Live Video Analytics with On-Drone Deeper-yet-Compatible Compression0
Show:102550
← PrevPage 125 of 220Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Co-DETRbox mAP66Unverified
2InternImage-H (M3I Pre-training)box mAP65.5Unverified
3M3I Pre-training (InternImage-H)box mAP65.4Unverified
4MoCaEbox mAP65.1Unverified
5Co-DETR (Swin-L)box mAP64.8Unverified
6Focal-Stable-DINO (Focal-Huge, no TTA)box mAP64.8Unverified
7EVAbox mAP64.7Unverified
8Group DETR v2box mAP64.5Unverified
9FocalNet-H (DINO)box mAP64.4Unverified
10InternImage-XLbox mAP64.3Unverified