Object Detection

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 201–250 of 10957 papers

Title	Date	Tasks	Status	Hype
Efficient Teacher: Semi-Supervised Object Detection for YOLOv5	Feb 15, 2023	Objectobject-detection	CodeCode Available	2
EFFOcc: A Minimal Baseline for EFficient Fusion-based 3D Occupancy Network	Jun 11, 2024	3D Object DetectionActive Learning	CodeCode Available	2
EMIFF: Enhanced Multi-scale Image Feature Fusion for Vehicle-Infrastructure Cooperative 3D Object Detection	Feb 23, 2024	3D Object DetectionAutonomous Driving	CodeCode Available	2
EMOv2: Pushing 5M Vision Model Frontier	Dec 9, 2024	Image Generationmodel	CodeCode Available	2
MobileOne: An Improved One millisecond Mobile Backbone	Jun 8, 2022	Efficient Neural NetworkGaze Estimation	CodeCode Available	2
ESOD: Efficient Small Object Detection on High-Resolution Images	Jul 23, 2024	GPUObject	CodeCode Available	2
MogaNet: Multi-order Gated Aggregation Network	Nov 7, 2022	3D Human Pose EstimationImage Classification	CodeCode Available	2
Exploiting Unlabeled Data with Multiple Expert Teachers for Open Vocabulary Aerial Object Detection and Its Orientation Adaptation	Nov 4, 2024	Earth ObservationObject	CodeCode Available	2
Visible-Thermal Multiple Object Tracking: Large-scale Video Dataset and Progressive Fusion Approach	Aug 2, 2024	cross-modal alignmentMultiple Object Tracking	CodeCode Available	2
Exploring Orthogonality in Open World Object Detection	Jan 1, 2024	Incremental LearningObject	CodeCode Available	2
Accelerating DETR Convergence via Semantic-Aligned Matching	Mar 14, 2022	Objectobject-detection	CodeCode Available	2
FasterViT: Fast Vision Transformers with Hierarchical Attention	Jun 9, 2023	Image Classificationobject-detection	CodeCode Available	2
A Novel Unified Architecture for Low-Shot Counting by Detection and Segmentation	Sep 27, 2024	Exemplar-Free CountingFew-shot Object Counting and Detection	CodeCode Available	2
Fast Vision Transformers with HiLo Attention	May 26, 2022	BenchmarkingEfficient ViTs	CodeCode Available	2
Efficient Multi-Scale Attention Module with Cross-Spatial Learning	May 23, 2023	Dimensionality Reductionimage-classification	CodeCode Available	2
Fine-Grained Stochastic Architecture Search	Jun 17, 2020	Neural Architecture Searchobject-detection	CodeCode Available	2
FM-Fusion: Instance-aware Semantic Mapping Boosted by Vision-Language Foundation Models	Feb 7, 2024	Instance SegmentationObject	CodeCode Available	2
FocalFormer3D : Focusing on Hard Instance for 3D Object Detection	Aug 8, 2023	3D Object DetectionAutonomous Driving	CodeCode Available	2
UNetFormer: A UNet-like Transformer for Efficient Semantic Segmentation of Remote Sensing Urban Scene Imagery	Sep 18, 2021	Change DetectionDecoder	CodeCode Available	2
On the Arbitrary-Oriented Object Detection: Classification based Approaches Revisited	Mar 12, 2020	ClassificationGeneral Classification	CodeCode Available	2
Focal Sparse Convolutional Networks for 3D Object Detection	Apr 26, 2022	3D Object DetectionObject	CodeCode Available	2
Focusing on Tracks for Online Multi-Object Tracking	Jun 15, 2025	global-optimizationMulti-Object Tracking	CodeCode Available	2
E2E-MFD: Towards End-to-End Synchronous Multimodal Fusion Detection	Mar 14, 2024	Autonomous DrivingObject	CodeCode Available	2
ECA-Net: Efficient Channel Attention for Deep Convolutional Neural Networks	Oct 8, 2019	Dimensionality Reductionimage-classification	CodeCode Available	2
EA-LSS: Edge-aware Lift-splat-shot Framework for 3D BEV Object Detection	Mar 31, 2023	3D Object DetectionDepth Estimation	CodeCode Available	2
GaussianPretrain: A Simple Unified 3D Gaussian Representation for Visual Pre-training in Autonomous Driving	Nov 19, 2024	3D Object DetectionAutonomous Driving	CodeCode Available	2
GeminiFusion: Efficient Pixel-wise Multimodal Fusion for Vision Transformer	Jun 3, 2024	3D Object DetectionImage-to-Image Translation	CodeCode Available	2
ParC-Net: Position Aware Circular Convolution with Merits from ConvNets and Transformer	Mar 8, 2022	Image Classificationobject-detection	CodeCode Available	2
DSVT: Dynamic Sparse Voxel Transformer with Rotated Sets	Jan 15, 2023	3D Object Detectionobject-detection	CodeCode Available	2
Generative Region-Language Pretraining for Open-Ended Object Detection	Mar 15, 2024	Language ModelingLanguage Modelling	CodeCode Available	2
Drones Help Drones: A Collaborative Framework for Multi-Drone Object Trajectory Prediction and Beyond	May 23, 2024	3D Object Detectionobject-detection	CodeCode Available	2
Dynamic Graph Induced Contour-aware Heat Conduction Network for Event-based Object Detection	May 19, 2025	Event-based visionObject	CodeCode Available	2
A Simple Framework for 3D Occupancy Estimation in Autonomous Driving	Mar 17, 2023	3D Object Detection3D Reconstruction	CodeCode Available	2
Global Context Networks	Dec 24, 2020	Instance SegmentationObject Detection	CodeCode Available	2
GOReloc: Graph-based Object-Level Relocalization for Visual SLAM	Aug 15, 2024	Objectobject-detection	CodeCode Available	2
GreedyViG: Dynamic Axial Graph Construction for Efficient Vision GNNs	May 10, 2024	graph constructionimage-classification	CodeCode Available	2
A Simple Latent Diffusion Approach for Panoptic Segmentation and Mask Inpainting	Jan 18, 2024	Instance SegmentationInteractive Segmentation	CodeCode Available	2
GrootVL: Tree Topology is All You Need in State Space Model	Jun 4, 2024	Allimage-classification	CodeCode Available	2
EdgeNeXt: Efficiently Amalgamated CNN-Transformer Architecture for Mobile Vision Applications	Jun 21, 2022	Image ClassificationObject Detection	CodeCode Available	2
Evaluating Large-Vocabulary Object Detectors: The Devil is in the Details	Feb 1, 2021	Benchmarkingobject-detection	CodeCode Available	2
GroupViT: Semantic Segmentation Emerges from Text Supervision	Feb 22, 2022	Object DetectionScene Understanding	CodeCode Available	2
HASSOD: Hierarchical Adaptive Self-Supervised Object Detection	Feb 5, 2024	Objectobject-detection	CodeCode Available	2
A Survey on Open-Vocabulary Detection and Segmentation: Past, Present, and Future	Jul 18, 2023	Knowledge Distillationobject-detection	CodeCode Available	2
HENet: Hybrid Encoding for End-to-end Multi-task 3D Perception from Multi-view Cameras	Apr 3, 2024	3D Object DetectionAutonomous Driving	CodeCode Available	2
Hierarchical Open-vocabulary Universal Image Segmentation	Jul 3, 2023	Image ComprehensionImage Segmentation	CodeCode Available	2
Fully Sparse 3D Object Detection	Jul 20, 2022	3D Object DetectionAutonomous Driving	CodeCode Available	2
Hulk: A Universal Knowledge Translator for Human-Centric Tasks	Dec 4, 2023	3D Human Pose EstimationAction Recognition	CodeCode Available	2
Improving CLIP Fine-tuning Performance	Jan 1, 2023	Diagnosticobject-detection	CodeCode Available	2
Distance-IoU Loss: Faster and Better Learning for Bounding Box Regression	Nov 19, 2019	object-detectionObject Detection	CodeCode Available	2
Dilated Neighborhood Attention Transformer	Sep 29, 2022	Image ClassificationInstance Segmentation	CodeCode Available	2

Show:10 25 50

← PrevPage 5 of 220Next →

All datasets COCO test-dev COCO minival COCO-O COCO 2017 val PASCAL VOC 2007 COCO 2017 CrowdHuman (full body)CPPE-5 LVIS v1.0 val Manga109-s 15test PKU-DDD17-Car Waymo 2D detection all_ns f0val

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	Co-DETR	box mAP	66	—	Unverified
2	InternImage-H (M3I Pre-training)	box mAP	65.5	—	Unverified
3	M3I Pre-training (InternImage-H)	box mAP	65.4	—	Unverified
4	MoCaE	box mAP	65.1	—	Unverified
5	Co-DETR (Swin-L)	box mAP	64.8	—	Unverified
6	Focal-Stable-DINO (Focal-Huge, no TTA)	box mAP	64.8	—	Unverified
7	EVA	box mAP	64.7	—	Unverified
8	Group DETR v2	box mAP	64.5	—	Unverified
9	FocalNet-H (DINO)	box mAP	64.4	—	Unverified
10	InternImage-XL	box mAP	64.3	—	Unverified