SOTAVerified

Object

Replace the cat with a British Shorthair cat of the breed with bulging yellow eyes

Papers

Showing 426450 of 10696 papers

TitleStatusHype
UltraFlwr -- An Efficient Federated Medical and Surgical Object Detection FrameworkCode1
GIVEPose: Gradual Intra-class Variation Elimination for RGB-based Category-Level Object Pose EstimationCode1
MMR: A Large-scale Benchmark Dataset for Multi-target and Multi-granularity Reasoning SegmentationCode1
Robust Object Detection of Underwater Robot based on Domain GeneralizationCode1
History-Aware Transformation of ReID Features for Multiple Object TrackingCode1
OmniSTVG: Toward Spatio-Temporal Omni-Object Video GroundingCode1
Learning to Detect Objects from Multi-Agent LiDAR Scans without Manual LabelsCode1
SimROD: A Simple Baseline for Raw Object Detection with Global and Local EnhancementsCode1
A Data-Centric Revisit of Pre-Trained Vision Models for Robot LearningCode1
DQO-MAP: Dual Quadrics Multi-Object mapping with Gaussian SplattingCode1
Convex Hull-based Algebraic Constraint for Visual Quadric SLAMCode1
Modeling Fine-Grained Hand-Object Dynamics for Egocentric Video Representation LearningCode1
Mitigating Hallucinations in Large Vision-Language Models by Adaptively Constraining Information FlowCode1
Dynamic Markov Blanket Detection for Macroscopic Physics DiscoveryCode1
C-Drag: Chain-of-Thought Driven Motion Controller for Video GenerationCode1
CLIP Under the Microscope: A Fine-Grained Analysis of Multi-Object RepresentationCode1
Vector-Quantized Vision Foundation Models for Object-Centric LearningCode1
Ev-3DOD: Pushing the Temporal Boundaries of 3D Object Detection with Event CamerasCode1
Cross-domain Few-shot Object Detection with Multi-modal Textual EnrichmentCode1
Object-Centric Image to Video Generation with Language GuidanceCode1
DA-Mamba: Domain Adaptive Hybrid Mamba-Transformer Based One-Stage Object DetectionCode1
Knowing Your Target: Target-Aware Transformer Makes Better Spatio-Temporal Video GroundingCode1
PlaySlot: Learning Inverse Latent Dynamics for Controllable Object-Centric Video Prediction and PlanningCode1
SAVE: Self-Attention on Visual Embedding for Zero-Shot Generic Object CountingCode1
TUMTraffic-VideoQA: A Benchmark for Unified Spatio-Temporal Video Understanding in Traffic ScenesCode1
Show:102550
← PrevPage 18 of 428Next →

No leaderboard results yet.