SOTAVerified

Object Detection

Papers

Showing 30013050 of 10957 papers

TitleStatusHype
LightViT: Towards Light-Weight Convolution-Free Vision TransformersCode1
Tied Block Convolution: Leaner and Better CNNs with Shared Thinner FiltersCode1
Point2Seq: Detecting 3D Objects as SequencesCode1
Discovering A Variety of Objects in Spatio-Temporal Human-Object InteractionsCode1
Lightweight Neural Architecture Search for Temporal Convolutional Networks at the EdgeCode1
Lightweight RGB-D Salient Object Detection from a Speed-Accuracy Tradeoff PerspectiveCode1
Polarity Loss for Zero-shot Object DetectionCode1
Linear Gaussian Bounding Box Representation and Ring-Shaped Rotated Convolution for Oriented Object DetectionCode1
LLA: Loss-aware Label Assignment for Dense Pedestrian DetectionCode1
Discriminative Co-Saliency and Background Mining Transformer for Co-Salient Object DetectionCode1
TransCenter: Transformers with Dense Representations for Multiple-Object TrackingCode1
LineCounter: Learning Handwritten Text Line Segmentation by CountingCode1
Pseudo-LiDAR from Visual Depth Estimation: Bridging the Gap in 3D Object Detection for Autonomous DrivingCode1
TogetherNet: Bridging Image Restoration and Object Detection Together via Dynamic Enhancement LearningCode1
Region Similarity Representation LearningCode1
Disentangled High Quality Salient Object DetectionCode1
Disentangled Non-Local Neural NetworksCode1
Disentangled Pre-training for Human-Object Interaction DetectionCode1
Scaling Local Self-Attention for Parameter Efficient Visual BackbonesCode1
Disentangling 3D Prototypical Networks For Few-Shot Concept LearningCode1
LiteYOLO-ID: A Lightweight Object Detection Network for Insulator Defect DetectionCode1
Lite-FPN for Keypoint-based Monocular 3D Object DetectionCode1
Towards Physically Realizable Adversarial Attacks in Embodied Vision NavigationCode1
LLM-Empowered Embodied Agent for Memory-Augmented Task Planning in Household RoboticsCode1
Boosting R-CNN: Reweighting R-CNN Samples by RPN's Error for Underwater Object DetectionCode1
Disjoint Masking with Joint Distillation for Efficient Masked Image ModelingCode1
Towards Accurate Ground Plane Normal Estimation from Ego-MotionCode1
Towards Accurate One-Stage Object Detection with AP-LossCode1
Dispel Darkness for Better Fusion: A Controllable Visual Enhancer based on Cross-modal Conditional Adversarial LearningCode1
Disp R-CNN: Stereo 3D Object Detection via Shape Prior Guided Instance Disparity EstimationCode1
PiCANet: Pixel-wise Contextual Attention Learning for Accurate Saliency DetectionCode0
Conditional Set Generation with TransformersCode0
Conditional Negative Sampling for Contrastive Learning of Visual RepresentationsCode0
Conditional and Residual Methods in Scalable Coding for Humans and MachinesCode0
AShapeFormer: Semantics-Guided Object-Level Active Shape Encoding for 3D Object Detection via TransformersCode0
Perspective-aware Convolution for Monocular 3D Object DetectionCode0
Context-Aware Dynamic Feature Extraction for 3D Object Detection in Point CloudsCode0
Performance Evaluation of Semi-supervised Learning Frameworks for Multi-Class Weed DetectionCode0
A Separable Self-attention Inspired by the State Space Model for Computer VisionCode0
Per-frame mAP Prediction for Continuous Performance Monitoring of Object Detection During DeploymentCode0
A-Fast-RCNN: Hard Positive Generation via Adversary for Object DetectionCode0
Performance Evaluation of Real-Time Object Detection for Electric ScootersCode0
Percept, Memory, and Imagine: World Feature Simulating for Open-Domain Unknown Object DetectionCode0
KAN-RCBEVDepth: A multi-modal fusion algorithm in object detection for autonomous drivingCode0
Perceptual Piercing: Human Visual Cue-based Object Detection in Low Visibility ConditionsCode0
PEEKABOO: Hiding parts of an image for unsupervised object localizationCode0
A Semantic-Aware and Multi-Guided Network for Infrared-Visible Image FusionCode0
Pelee: A Real-Time Object Detection System on Mobile DevicesCode0
Computer Vision and Normalizing Flow-Based Defect DetectionCode0
Computer Vision Aided mmWave Beam Alignment in V2X CommunicationsCode0
Show:102550
← PrevPage 61 of 220Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Co-DETRbox mAP66Unverified
2InternImage-H (M3I Pre-training)box mAP65.5Unverified
3M3I Pre-training (InternImage-H)box mAP65.4Unverified
4MoCaEbox mAP65.1Unverified
5Co-DETR (Swin-L)box mAP64.8Unverified
6Focal-Stable-DINO (Focal-Huge, no TTA)box mAP64.8Unverified
7EVAbox mAP64.7Unverified
8Group DETR v2box mAP64.5Unverified
9FocalNet-H (DINO)box mAP64.4Unverified
10InternImage-XLbox mAP64.3Unverified