SOTAVerified

Object Detection

Papers

Showing 20012050 of 10957 papers

TitleStatusHype
Robust End-to-End Focal Liver Lesion Detection using Unregistered Multiphase Computed Tomography ImagesCode1
MutualFormer: Multi-Modality Representation Learning via Cross-Diffusion AttentionCode1
MViTv2: Improved Multiscale Vision Transformers for Classification and DetectionCode1
Pooling by Sliced-Wasserstein EmbeddingCode1
Container: Context Aggregation NetworksCode1
Object-Aware Cropping for Self-Supervised LearningCode1
Confidence Propagation Cluster: Unleash Full Potential of Object DetectorsCode1
The Norm Must Go On: Dynamic Unsupervised Domain Adaptation by NormalizationCode1
Revitalizing CNN Attention via Transformers in Self-Supervised Visual Representation LearningCode1
Focal Attention for Long-Range Interactions in Vision TransformersCode1
Generalized and Discriminative Few-Shot Object Detection via SVD-Dictionary EnhancementCode1
Event-Based Fusion for Motion Deblurring with Cross-modal AttentionCode1
TALISMAN: Targeted Active Learning for Object Detection with Rare Classes and Slices using Submodular Mutual InformationCode1
A Unified Pruning Framework for Vision TransformersCode1
Attentive Prototypes for Source-free Unsupervised Domain Adaptive 3D Object DetectionCode1
DanceTrack: Multi-Object Tracking in Uniform Appearance and Diverse MotionCode1
Sparse DETR: Efficient End-to-End Object Detection with Learnable SparsityCode1
Searching the Search Space of Vision TransformerCode1
NomMer: Nominate Synergistic Context in Vision Transformer for Visual RecognitionCode1
CDNet is all you need: Cascade DCN based underwater object detection RCNNCode1
BoxeR: Box-Attention for 2D and 3D TransformersCode1
Detecting and Tracking Small and Dense Moving Objects in Satellite Videos: A BenchmarkCode1
Cross-Domain Adaptive Teacher for Object DetectionCode1
PeCo: Perceptual Codebook for BERT Pre-training of Vision TransformersCode1
Focal and Global Knowledge Distillation for DetectorsCode1
Few-Shot Object Detection via Association and DIscriminationCode1
Florence: A New Foundation Model for Computer VisionCode1
Learning to Aggregate Multi-Scale Context for Instance Segmentation in Remote Sensing ImagesCode1
Class-agnostic Object Detection with Multi-modal TransformerCode1
Tracking Grow-Finish Pigs Across Large Pens Using Multiple CamerasCode1
Benchmarking Detection Transfer Learning with Vision TransformersCode1
Many Heads but One Brain: Fusion Brain -- a Competition and a Single Multimodal Multitask ArchitectureCode1
L-Verse: Bidirectional Generation Between Image and TextCode1
MUM : Mix Image Tiles and UnMix Feature Tiles for Semi-Supervised Object DetectionCode1
FedCV: A Federated Learning Framework for Diverse Computer Vision TasksCode1
FCOSR: A Simple Anchor-free Rotated Detector for Aerial Object DetectionCode1
Grounded Situation Recognition with TransformersCode1
Range-Aware Attention Network for LiDAR-based 3D Object Detection with Auxiliary Point Density Level EstimationCode1
Open Vocabulary Object Detection with Pseudo Bounding-Box LabelsCode1
TransMix: Attend to Mix for Vision TransformersCode1
Swin Transformer V2: Scaling Up Capacity and ResolutionCode1
ARKitScenes: A Diverse Real-World Dataset For 3D Indoor Scene Understanding Using Mobile RGB-D DataCode1
Tracklet-Switch Adversarial Attack against Pedestrian Multi-Object Tracking TrackersCode1
SAPNet: Segmentation-Aware Progressive Network for Perceptual Contrastive DerainingCode1
TYolov5: A Temporal Yolov5 Detector Based on Quasi-Recurrent Neural Networks for Real-Time Handgun Detection in VideoCode1
Multi-Grained Vision Language Pre-Training: Aligning Texts with Visual ConceptsCode1
iBOT: Image BERT Pre-Training with Online TokenizerCode1
Co-segmentation Inspired Attention Module for Video-based Computer Vision TasksCode1
Attention Guided Cosine Margin For Overcoming Class-Imbalance in Few-Shot Road Object DetectionCode1
Indian Licence Plate Dataset in the wildCode1
Show:102550
← PrevPage 41 of 220Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Co-DETRbox mAP66Unverified
2InternImage-H (M3I Pre-training)box mAP65.5Unverified
3M3I Pre-training (InternImage-H)box mAP65.4Unverified
4MoCaEbox mAP65.1Unverified
5Co-DETR (Swin-L)box mAP64.8Unverified
6Focal-Stable-DINO (Focal-Huge, no TTA)box mAP64.8Unverified
7EVAbox mAP64.7Unverified
8Group DETR v2box mAP64.5Unverified
9FocalNet-H (DINO)box mAP64.4Unverified
10InternImage-XLbox mAP64.3Unverified