SOTAVerified

Object

Replace the cat with a British Shorthair cat of the breed with bulging yellow eyes

Papers

Showing 301350 of 10696 papers

TitleStatusHype
DetGPT: Detect What You Need via ReasoningCode2
Fine-Grained Prototypes Distillation for Few-Shot Object DetectionCode2
Focal Loss for Dense Object DetectionCode2
Focal Sparse Convolutional Networks for 3D Object DetectionCode2
Fully Sparse 3D Object DetectionCode2
Fusing Visual Appearance and Geometry for Multi-modality 6DoF Object TrackingCode2
DetZero: Rethinking Offboard 3D Object Detection with Long-term Sequential Point CloudsCode2
DetectoRS: Detecting Objects with Recursive Feature Pyramid and Switchable Atrous ConvolutionCode2
Gen6D: Generalizable Model-Free 6-DoF Object Pose Estimation from RGB ImagesCode2
Detect Everything with Few ExamplesCode2
DexGraspNet: A Large-Scale Robotic Dexterous Grasp Dataset for General Objects Based on SimulationCode2
Global Tracking TransformersCode2
GOReloc: Graph-based Object-Level Relocalization for Visual SLAMCode2
Grasp, See, and Place: Efficient Unknown Object Rearrangement with Policy Structure PriorCode2
DI-MaskDINO: A Joint Object Detection and Instance Segmentation ModelCode2
HASSOD: Hierarchical Adaptive Self-Supervised Object DetectionCode2
High-Precision Dichotomous Image Segmentation via Probing Diffusion CapacityCode2
Boundary-Aware Segmentation Network for Mobile and Web ApplicationsCode2
EdgeYOLO: An Edge-Real-Time Object DetectorCode2
InstMove: Instance Motion for Object-centric Video SegmentationCode2
InstructSAM: A Training-Free Framework for Instruction-Oriented Remote Sensing Object RecognitionCode2
InteractVLM: 3D Interaction Reasoning from 2D Foundational ModelsCode2
Deep Snake for Real-Time Instance SegmentationCode2
Interpreting Object-level Foundation Models via Visual Precision SearchCode2
Articulated Object Interaction in Unknown Scenes with Whole-Body Mobile ManipulationCode2
Joint Reconstruction of 3D Human and Object via Contact-Based Refinement TransformerCode2
DeepFusionMOT: A 3D Multi-Object Tracking Framework Based on Camera-LiDAR Fusion with Deep AssociationCode2
Large Selective Kernel Network for Remote Sensing Object DetectionCode2
Bridging the Gap between Object and Image-level Representations for Open-Vocabulary DetectionCode2
Learning Embeddings with Centroid Triplet Loss for Object Identification in Robotic GraspingCode2
Decoupling Features in Hierarchical Propagation for Video Object SegmentationCode2
DeepInteraction: 3D Object Detection via Modality InteractionCode2
LiDAR Snowfall Simulation for Robust 3D Object DetectionCode2
LISO: Lidar-only Self-Supervised 3D Object DetectionCode2
Magic Tokens: Select Diverse Tokens for Multi-modal Object Re-IdentificationCode2
Make It Count: Text-to-Image Generation with an Accurate Number of ObjectsCode2
DeMo: Decoupled Feature-Based Mixture of Experts for Multi-Modal Object Re-IdentificationCode2
DAMSDet: Dynamic Adaptive Multispectral Detection Transformer with Competitive Query Selection and Adaptive Feature FusionCode2
MeMOTR: Long-Term Memory-Augmented Transformer for Multi-Object TrackingCode2
A Novel Unified Architecture for Low-Shot Counting by Detection and SegmentationCode2
AdaMixer: A Fast-Converging Query-Based Object DetectorCode2
MeViS: A Large-scale Benchmark for Video Segmentation with Motion ExpressionsCode2
Cross Language Image Matching for Weakly Supervised Semantic SegmentationCode2
MonoCD: Monocular 3D Object Detection with Complementary DepthsCode2
MonoDETR: Depth-guided Transformer for Monocular 3D Object DetectionCode2
Cross-View Referring Multi-Object TrackingCode2
DAVE -- A Detect-and-Verify Paradigm for Low-Shot CountingCode2
Dense Distinct Query for End-to-End Object DetectionCode2
Contrastive learning of Class-agnostic Activation Map for Weakly Supervised Object Localization and Semantic SegmentationCode2
Contextual Object Detection with Multimodal Large Language ModelsCode2
Show:102550
← PrevPage 7 of 214Next →

No leaderboard results yet.