SOTAVerified

Object Detection

Papers

Showing 23012350 of 10957 papers

TitleStatusHype
LangXAI: Integrating Large Vision Models for Generating Textual Explanations to Enhance Explainability in Visual Perception TasksCode1
Weakly Supervised Object Detection in Chest X-Rays with Differentiable ROI Proposal Networks and Soft ROI PoolingCode1
SDGE: Stereo Guided Depth Estimation for 360^ Camera Sets0
Reinforcement Learning as a Parsimonious Alternative to Prediction Cascades: A Case Study on Image SegmentationCode0
UncertaintyTrack: Exploiting Detection and Localization Uncertainty in Multi-Object TrackingCode1
LiRaFusion: Deep Adaptive LiDAR-Radar Fusion for 3D Object DetectionCode1
A Multispectral Automated Transfer Technique (MATT) for machine-driven image labeling utilizing the Segment Anything Model (SAM)0
MultiCorrupt: A Multi-Modal Robustness Dataset and Benchmark of LiDAR-Camera Fusion for 3D Object DetectionCode2
GraphKD: Exploring Knowledge Distillation Towards Document Object Detection with Structured Graph CreationCode1
ReViT: Enhancing Vision Transformers Feature Diversity with Attention Residual ConnectionsCode1
Modular Graph Extraction for Handwritten Circuit Diagram Images0
CodaMal: Contrastive Domain Adaptation for Malaria Detection in Low-Cost MicroscopesCode0
STF: Spatio-Temporal Fusion Module for Improving Video Object DetectionCode0
AutoGPT+P: Affordance-based Task Planning with Large Language Models0
SAWEC: Sensing-Assisted Wireless Edge ComputingCode0
LLMs as Bridges: Reformulating Grounded Multimodal Named Entity RecognitionCode1
A Comprehensive Review on Computer Vision Analysis of Aerial Data0
Efficient One-stage Video Object Detection by Exploiting Temporal ConsistencyCode1
YOLOv8-AM: YOLOv8 Based on Effective Attention Mechanisms for Pediatric Wrist Fracture DetectionCode2
Switch EMA: A Free Lunch for Better Flatness and SharpnessCode1
Few-Shot Object Detection with Sparse Context Transformers0
TDViT: Temporal Dilated Video Transformer for Dense Video TasksCode1
Improving Image Coding for Machines through Optimizing Encoder via Auxiliary Loss0
Leveraging Self-Supervised Instance Contrastive Learning for Radar Object Detection0
Object Detection in Thermal Images Using Deep Learning for Unmanned Aerial Vehicles0
AYDIV: Adaptable Yielding 3D Object Detection via Integrated Contextual Vision TransformerCode1
MODIPHY: Multimodal Obscured Detection for IoT using PHantom Convolution-Enabled Faster YOLOCode1
Context-aware Multi-Model Object Detection for Diversely Heterogeneous Compute SystemsCode0
A Flow-based Credibility Metric for Safety-critical Pedestrian Detection0
Semantic Object-level Modeling for Robust Visual Camera Relocalization0
Domain Adaptable Fine-Tune Distillation Framework For Advancing Farm SurveillanceCode0
Transfer learning with generative models for object detection on limited datasets0
Event-to-Video Conversion for Overhead Object Detection0
Neural Rendering based Urban Scene Reconstruction for Autonomous Driving0
Scrapping The Web For Early Wildfire Detection: A New Annotated Dataset of Images and Videos of Smoke Plumes In-the-wild0
InstaGen: Enhancing Object Detection by Training on Synthetic Dataset0
Using YOLO v7 to Detect Kidney in Magnetic Resonance Imaging0
Streamlined Hybrid Annotation Framework using Scalable Codestream for Bandwidth-Restricted UAV Object Detection0
G-NAS: Generalizable Neural Architecture Search for Single Domain Generalization Object DetectionCode1
Shape-biased Texture Agnostic Representations for Improved Textureless and Metallic Object Detection and 6D Pose EstimationCode0
Toward Accurate Camera-based 3D Object Detection via Cascade Depth Estimation and Calibration0
FM-Fusion: Instance-aware Semantic Mapping Boosted by Vision-Language Foundation ModelsCode2
LLMs Meet VLMs: Boost Open Vocabulary Object Detection with Fine-grained Descriptors0
0-1 laws for pattern occurrences in phylogenetic trees and networks0
Breaking Data Silos: Cross-Domain Learning for Multi-Agent Perception from Independent Private SourcesCode0
YOLOPoint Joint Keypoint and Object DetectionCode2
Ray Denoising: Depth-aware Hard Negative Sampling for Multi-view 3D Object DetectionCode2
Improving Robustness of LiDAR-Camera Fusion Model against Weather Corruption from Fusion Strategy Perspective0
ActiveAnno3D -- An Active Learning Framework for Multi-Modal 3D Object DetectionCode4
Cross-Domain Few-Shot Object Detection via Enhanced Open-Set Object DetectorCode2
Show:102550
← PrevPage 47 of 220Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Co-DETRbox mAP66Unverified
2InternImage-H (M3I Pre-training)box mAP65.5Unverified
3M3I Pre-training (InternImage-H)box mAP65.4Unverified
4MoCaEbox mAP65.1Unverified
5Co-DETR (Swin-L)box mAP64.8Unverified
6Focal-Stable-DINO (Focal-Huge, no TTA)box mAP64.8Unverified
7EVAbox mAP64.7Unverified
8Group DETR v2box mAP64.5Unverified
9FocalNet-H (DINO)box mAP64.4Unverified
10InternImage-XLbox mAP64.3Unverified