SOTAVerified

Object

Replace the cat with a British Shorthair cat of the breed with bulging yellow eyes

Papers

Showing 276300 of 10696 papers

TitleStatusHype
DeepFusionMOT: A 3D Multi-Object Tracking Framework Based on Camera-LiDAR Fusion with Deep AssociationCode2
Decoupling Features in Hierarchical Propagation for Video Object SegmentationCode2
DeepInteraction: 3D Object Detection via Modality InteractionCode2
Autoregressive Visual TrackingCode2
3D Object Detection for Autonomous Driving: A Comprehensive SurveyCode2
Dense Distinct Query for End-to-End Object DetectionCode2
DenseWorld-1M: Towards Detailed Dense Grounded Caption in the Real WorldCode2
DAVE -- A Detect-and-Verify Paradigm for Low-Shot CountingCode2
Cross-View Referring Multi-Object TrackingCode2
DetZero: Rethinking Offboard 3D Object Detection with Long-term Sequential Point CloudsCode2
DexGraspNet: A Large-Scale Robotic Dexterous Grasp Dataset for General Objects Based on SimulationCode2
Cross Language Image Matching for Weakly Supervised Semantic SegmentationCode2
DAMSDet: Dynamic Adaptive Multispectral Detection Transformer with Competitive Query Selection and Adaptive Feature FusionCode2
DQ-DETR: DETR with Dynamic Query for Tiny Object DetectionCode2
Contextual Object Detection with Multimodal Large Language ModelsCode2
Contrastive learning of Class-agnostic Activation Map for Weakly Supervised Object Localization and Semantic SegmentationCode2
Duoduo CLIP: Efficient 3D Understanding with Multi-View ImagesCode2
Dynamic Graph Induced Contour-aware Heat Conduction Network for Event-based Object DetectionCode2
EA-LSS: Edge-aware Lift-splat-shot Framework for 3D BEV Object DetectionCode2
Complex-YOLO: Real-time 3D Object Detection on Point CloudsCode2
OBSeg: Accurate and Fast Instance Segmentation Framework Using Segmentation Foundation Models with Oriented Bounding Box PromptsCode2
AGLA: Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local AttentionCode2
Context-Guided Spatio-Temporal Video GroundingCode2
Efficient Video Object Segmentation via Modulated Cross-Attention MemoryCode2
CORE4D: A 4D Human-Object-Human Interaction Dataset for Collaborative Object REarrangementCode2
Show:102550
← PrevPage 12 of 428Next →

No leaderboard results yet.