SOTAVerified

Object

Replace the cat with a British Shorthair cat of the breed with bulging yellow eyes

Papers

Showing 276300 of 10696 papers

TitleStatusHype
3D Object Detection for Autonomous Driving: A Comprehensive SurveyCode2
DeepInteraction: 3D Object Detection via Modality InteractionCode2
Deep Snake for Real-Time Instance SegmentationCode2
DAVE -- A Detect-and-Verify Paradigm for Low-Shot CountingCode2
DAMSDet: Dynamic Adaptive Multispectral Detection Transformer with Competitive Query Selection and Adaptive Feature FusionCode2
Decoupling Features in Hierarchical Propagation for Video Object SegmentationCode2
DetectoRS: Detecting Objects with Recursive Feature Pyramid and Switchable Atrous ConvolutionCode2
AGLA: Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local AttentionCode2
DeepFusionMOT: A 3D Multi-Object Tracking Framework Based on Camera-LiDAR Fusion with Deep AssociationCode2
Analyzing and Boosting the Power of Fine-Grained Visual Recognition for Multi-modal Large Language ModelsCode2
DEYOLO: Dual-Feature-Enhancement YOLO for Cross-Modality Object DetectionCode2
DeMo: Decoupled Feature-Based Mixture of Experts for Multi-Modal Object Re-IdentificationCode2
DiffusionTrack: Diffusion Model For Multi-Object TrackingCode2
CPA-Enhancer: Chain-of-Thought Prompted Adaptive Enhancer for Object Detection under Unknown DegradationsCode2
DragonDiffusion: Enabling Drag-style Manipulation on Diffusion ModelsCode2
A Novel Unified Architecture for Low-Shot Counting by Detection and SegmentationCode2
Duoduo CLIP: Efficient 3D Understanding with Multi-View ImagesCode2
Dynamic Graph Induced Contour-aware Heat Conduction Network for Event-based Object DetectionCode2
CORE4D: A 4D Human-Object-Human Interaction Dataset for Collaborative Object REarrangementCode2
Cross-Domain Few-Shot Object Detection via Enhanced Open-Set Object DetectorCode2
EdgeYOLO: An Edge-Real-Time Object DetectorCode2
Contextual Object Detection with Multimodal Large Language ModelsCode2
E2E-MFD: Towards End-to-End Synchronous Multimodal Fusion DetectionCode2
Context-Guided Spatio-Temporal Video GroundingCode2
Contrastive learning of Class-agnostic Activation Map for Weakly Supervised Object Localization and Semantic SegmentationCode2
Show:102550
← PrevPage 12 of 428Next →

No leaderboard results yet.