SOTAVerified

Object

Replace the cat with a British Shorthair cat of the breed with bulging yellow eyes

Papers

Showing 18011850 of 10696 papers

TitleStatusHype
DetToolChain: A New Prompting Paradigm to Unleash Detection Ability of MLLMCode1
Boosting Zero-Shot Human-Object Interaction Detection with Vision-Language TransferCode0
R3DS: Reality-linked 3D Scenes for Panoramic Scene Understanding0
Prototipo de un Contador Bidireccional Automático de Personas basado en sensores de visión 3D0
FlexCap: Describe Anything in Images in Controllable Detail0
HOIDiffusion: Generating Realistic 3D Hand-Object Interaction Data0
Pedestrian Tracking with Monocular Camera using Unconstrained 3D Motion Model0
Prioritized Semantic Learning for Zero-shot Instance NavigationCode1
Video Object Segmentation with Dynamic Query ModulationCode1
Circle Representation for Medical Instance Object SegmentationCode0
Object Segmentation-Assisted Inter Prediction for Versatile Video Coding0
GenFlow: Generalizable Recurrent Flow for 6D Pose Refinement of Novel Objects0
THOR: Text to Human-Object Interaction Diffusion via Relation Intervention0
NetTrack: Tracking Highly Dynamic Objects with a NetCode2
CPA-Enhancer: Chain-of-Thought Prompted Adaptive Enhancer for Object Detection under Unknown DegradationsCode2
FORCE: Physics-aware Human-object Interaction0
Creating Seamless 3D Maps Using Radiance Fields0
GRA: Detecting Oriented Objects through Group-wise Rotating and Attention0
Unsupervised Collaborative Metric Learning with Mixed-Scale Groups for General Object RetrievalCode1
View-Centric Multi-Object Tracking with Homographic Matching in Moving UAV0
Segment Any Object Model (SAOM): Real-to-Simulation Fine-Tuning Strategy for Multi-Class Multi-Instance Segmentation0
IMPRINT: Generative Object Compositing by Learning Identity-Preserving Representation0
GS-Pose: Generalizable Segmentation-based 6D Object Pose Estimation with 3D Gaussian Splatting0
Latent Object Characteristics Recognition with Visual to Haptic-Audio Cross-modal Transfer Learning0
Magic Tokens: Select Diverse Tokens for Multi-modal Object Re-IdentificationCode2
Generative Region-Language Pretraining for Open-Ended Object DetectionCode2
Learning Physical Dynamics for Object-centric Visual Prediction0
Grasp Anything: Combining Teacher-Augmented Policy Gradient Learning with Instance Segmentation to Grasp Arbitrary Objects0
Attention-based Class-Conditioned Alignment for Multi-Source Domain Adaptation of Object DetectorsCode0
Right Place, Right Time! Dynamizing Topological Graphs for Embodied Navigation0
SHAN: Object-Level Privacy Detection via Inference on Scene Heterogeneous Graph0
Explorations in Texture LearningCode0
E2E-MFD: Towards End-to-End Synchronous Multimodal Fusion DetectionCode2
Griffon v2: Advancing Multimodal Perception with High-Resolution Scaling and Visual-Language Co-Referring0
Reconstruction and Simulation of Elastic Objects with Spring-Mass 3D Gaussians0
Rethinking Referring Object Removal0
Knowledge Distillation in YOLOX-ViT for Side-Scan Sonar Object DetectionCode2
OneTracker: Unifying Visual Object Tracking with Foundation Models and Efficient Tuning0
Improving Distant 3D Object Detection Using 2D Box Supervision0
PoIFusion: Multi-Modal 3D Object Detection via Fusion at Points of Interest0
FogGuard: guarding YOLO against fog using perceptual lossCode0
TFCounter:Polishing Gems for Training-Free Object Counting0
TaskCLIP: Extend Large Vision-Language Model for Task Oriented Object Detection0
DragAnything: Motion Control for Anything using Entity RepresentationCode7
Entropy is not Enough for Test-Time Adaptation: From the Perspective of Disentangled FactorsCode1
JSTR: Joint Spatio-Temporal Reasoning for Event-based Moving Object Detection0
FSC: Few-point Shape CompletionCode1
Learn and Search: An Elegant Technique for Object Lookup using Contrastive Learning0
Adaptive Bounding Box Uncertainties via Two-Step Conformal PredictionCode1
Category-Agnostic Pose Estimation for Point Clouds0
Show:102550
← PrevPage 37 of 214Next →

No leaderboard results yet.