SOTAVerified

Object Detection

Papers

Showing 15011550 of 10957 papers

TitleStatusHype
Weak-to-Strong Compositional Learning from Generative Models for Language-based Object Detection0
Multiple Object Detection and Tracking in Panoramic Videos for Cycling Safety AnalysisCode1
RayFormer: Improving Query-Based Multi-Camera 3D Object Detection via Ray-Centric Strategies0
A New Lightweight Hybrid Graph Convolutional Neural Network -- CNN Scheme for Scene Classification using Object Detection InferenceCode0
MLMT-CNN for Object Detection and Segmentation in Multi-layer and Multi-spectral Images0
EmoCAM: Toward Understanding What Drives CNN-based Emotion Recognition0
Bucketed Ranking-based Losses for Efficient Training of Object DetectorsCode1
Enhancing Layout Hotspot Detection Efficiency with YOLOv8 and PCA-Guided Augmentation0
Evaluating and Enhancing Trustworthiness of LLMs in Perception Tasks0
SUSTechGAN: Image Generation for Object Detection in Adverse Conditions of Autonomous DrivingCode0
Learning Visual Grounding from Generative Vision and Language Model0
DFMSD: Dual Feature Masking Stage-wise Knowledge Distillation for Object Detection0
General Geometry-aware Weakly Supervised 3D Object DetectionCode1
Enhancing Source-Free Domain Adaptive Object Detection with Low-confidence Pseudo Label DistillationCode0
FocusDiffuser: Perceiving Local Disparities for Camouflaged Object Detection0
Learning Camouflaged Object Detection from Noisy Pseudo Label0
GroupMamba: Efficient Group-Based Visual State Space ModelCode2
ColorMAE: Exploring data-independent masking strategies in Masked AutoEncodersCode0
AdaLog: Post-Training Quantization for Vision Transformers with Adaptive Logarithm QuantizerCode1
Weighting Pseudo-Labels via High-Activation Feature Index Similarity and Object Detection for Semi-Supervised SegmentationCode0
Toward INT4 Fixed-Point Training via Exploring Quantization Error for Gradients0
Exploring Deeper! Segment Anything Model with Depth Perception for Camouflaged Object DetectionCode1
CerberusDet: Unified Multi-Dataset Object DetectionCode1
GLARE: Low Light Image Enhancement via Generative Latent Feature based Codebook RetrievalCode2
Enhancing Wrist Fracture Detection with YOLOCode0
Close the Sim2real Gap via Physically-based Structured Light Synthetic Data Simulation0
Embracing Events and Frames with Hierarchical Feature Refinement Network for Object DetectionCode1
Generative AI Driven Task-Oriented Adaptive Semantic Communications0
PADRe: A Unifying Polynomial Attention Drop-in Replacement for Efficient Vision Transformer0
LaMI-DETR: Open-Vocabulary Detection with Language Model InstructionCode2
Bridge Past and Future: Overcoming Information Asymmetry in Incremental Object DetectionCode1
Monocular pose estimation of articulated surgical instruments in open surgery0
Relation DETR: Exploring Explicit Position Relation Prior for Object DetectionCode3
Crowd-SAM: SAM as a Smart Annotator for Object Detection in Crowded ScenesCode2
TCFormer: Visual Recognition via Token Clustering TransformerCode3
The object detection method aids in image reconstruction evaluation and clinical interpretation of meniscal abnormalities0
MaskVD: Region Masking for Efficient Video Object Detection0
AFIDAF: Alternating Fourier and Image Domain Adaptive Filters as an Efficient Alternative to Attention in ViTs0
Improving Unsupervised Video Object Segmentation via Fake Flow Generation0
OpenPSG: Open-set Panoptic Scene Graph Generation via Large Multimodal ModelsCode1
OVLW-DETR: Open-Vocabulary Light-Weighted Detection TransformerCode3
Anticipating Future Object Compositions without Forgetting0
Interpreting Hand gestures using Object Detection and Digits Classification0
RepVF: A Unified Vector Fields Representation for Multi-task 3D PerceptionCode1
OPEN: Object-wise Position Embedding for Multi-view 3D Object DetectionCode2
Backdoor Attacks against Image-to-Image Networks0
FSD-BEV: Foreground Self-Distillation for Multi-view 3D Object DetectionCode1
Shape2Scene: 3D Scene Representation Learning Through Pre-training on Shape DataCode0
Augmented Neural Fine-Tuning for Efficient Backdoor PurificationCode1
LabelDistill: Label-guided Cross-modal Knowledge Distillation for Camera-based 3D Object DetectionCode1
Show:102550
← PrevPage 31 of 220Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Co-DETRbox mAP66Unverified
2InternImage-H (M3I Pre-training)box mAP65.5Unverified
3M3I Pre-training (InternImage-H)box mAP65.4Unverified
4MoCaEbox mAP65.1Unverified
5Co-DETR (Swin-L)box mAP64.8Unverified
6Focal-Stable-DINO (Focal-Huge, no TTA)box mAP64.8Unverified
7EVAbox mAP64.7Unverified
8Group DETR v2box mAP64.5Unverified
9FocalNet-H (DINO)box mAP64.4Unverified
10InternImage-XLbox mAP64.3Unverified