SOTAVerified

Object Detection

Papers

Showing 22012250 of 10957 papers

TitleStatusHype
Cross-domain and Cross-dimension Learning for Image-to-Graph TransformersCode0
Reframe Anything: LLM Agent for Open World Video Reframing0
Enhancing 3D Object Detection with 2D Detection-Guided Query AnchorsCode1
Poly Kernel Inception Network for Remote Sensing DetectionCode2
Transformer based Multitask Learning for Image Captioning and Object Detection0
Cross-Cluster Shifting for Efficient and Effective 3D Object Detection in Autonomous Driving0
V_kD: Improving Knowledge Distillation using Orthogonal ProjectionsCode2
SAFDNet: A Simple and Effective Network for Fully Sparse 3D Object DetectionCode2
Frequency Attention for Knowledge DistillationCode1
Not just Birds and Cars: Generic, Scalable and Explainable Models for Professional Visual Recognition0
EVD4UAV: An Altitude-Sensitive Benchmark to Evade Vehicle Detection in UAVCode0
SIRST-5K: Exploring Massive Negatives Synthesis with Self-supervised Learning for Robust Infrared Small Target DetectionCode1
Frequency-Adaptive Dilated Convolution for Semantic SegmentationCode2
VLM-PL: Advanced Pseudo Labeling Approach for Class Incremental Object Detection via Vision-Language Model0
Improving the Successful Robotic Grasp Detection Using Convolutional Neural Networks0
ActFormer: Scalable Collaborative Perception via Active Queries0
Exploring Robust Features for Few-Shot Object Detection in Satellite ImageryCode1
RadarDistill: Boosting Radar-based Object Detection Performance via Knowledge Distillation from LiDAR FeaturesCode1
LanePtrNet: Revisiting Lane Detection as Point Voting and Grouping on Curves0
Möbius Transform for Mitigating Perspective Distortions in Representation Learning0
ComFe: Interpretable Image Classifiers With Foundation Models, Transformers and Component FeaturesCode0
CN-RMA: Combined Network with Ray Marching Aggregation for 3D Indoors Object Detection from Multi-view ImagesCode1
ACC-ViT : Atrous Convolution's Comeback in Vision Transformers0
Effectiveness Assessment of Recent Large Vision-Language Models0
FriendNet: Detection-Friendly Dehazing NetworkCode1
Investigation of the Impact of Synthetic Training Data in the Industrial Application of Terminal Strip Object Detection0
Performance Evaluation of Semi-supervised Learning Frameworks for Multi-Class Weed DetectionCode0
Adversarial Infrared Geometry: Using Geometry to Perform Adversarial Attack against Infrared Pedestrian Detectors0
CMDA: Cross-Modal and Domain Adversarial Adaptation for LiDAR-Based 3D Object Detection0
FLAME Diffuser: Wildfire Image Synthesis using Mask Guided DiffusionCode1
Multi-task Learning for Real-time Autonomous Driving Leveraging Task-adaptive Attention Generator0
Loss Design for Single-carrier Joint Communication and Neural Network-based Sensing0
Detecting Concrete Visual Tokens for Multimodal Machine Translation0
Are Dense Labels Always Necessary for 3D Object Detection from Point Cloud?Code0
FastOcc: Accelerating 3D Occupancy Prediction by Fusing the 2D Bird's-Eye View and Perspective View0
BSDP: Brain-inspired Streaming Dual-level Perturbations for Online Open World Object Detection0
Bootstrapping Rare Object Detection in High-Resolution Satellite Imagery0
False Positive Sampling-based Data Augmentation for Enhanced 3D Object Detection Accuracy0
COMMIT: Certifying Robustness of Multi-Sensor Fusion Systems against Semantic Attacks0
PillarGen: Enhancing Radar Point Cloud Density and Quality via Pillar-based Point Generation Network0
Zero-shot Generalizable Incremental Learning for Vision-Language Object DetectionCode1
Explicit Motion Handling and Interactive Prompting for Video Camouflaged Object Detection0
Scalable Vision-Based 3D Object Detection and Monocular Depth Estimation for Autonomous DrivingCode1
NiNformer: A Network in Network Transformer with Token Mixing Generated Gating FunctionCode0
Leveraging Anchor-based LiDAR 3D Object Detection via Point Assisted Sample SelectionCode0
Lightweight Object Detection: A Study Based on YOLOv7 Integrated with ShuffleNetv2 and Vision Transformer0
MCA: Moment Channel Attention NetworksCode0
Self-Supervised Representation Learning with Meta Comprehensive Regularization0
Run-time Introspection of 2D Object Detection in Automated Driving Systems Using Learning Representations0
TUMTraf V2X Cooperative Perception DatasetCode4
Show:102550
← PrevPage 45 of 220Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Co-DETRbox mAP66Unverified
2InternImage-H (M3I Pre-training)box mAP65.5Unverified
3M3I Pre-training (InternImage-H)box mAP65.4Unverified
4MoCaEbox mAP65.1Unverified
5Co-DETR (Swin-L)box mAP64.8Unverified
6Focal-Stable-DINO (Focal-Huge, no TTA)box mAP64.8Unverified
7EVAbox mAP64.7Unverified
8Group DETR v2box mAP64.5Unverified
9FocalNet-H (DINO)box mAP64.4Unverified
10InternImage-XLbox mAP64.3Unverified