SOTAVerified

Object

Replace the cat with a British Shorthair cat of the breed with bulging yellow eyes

Papers

Showing 27512800 of 10696 papers

TitleStatusHype
CQ-DINO: Mitigating Gradient Dilution via Category Queries for Vast Vocabulary Object DetectionCode0
Human-Object Interaction with Vision-Language Model Guided Relative Movement Dynamics0
Any6D: Model-free 6D Pose Estimation of Novel Objects0
Online 3D Scene Reconstruction Using Neural Object Priors0
OmnimatteZero: Training-free Real-time Omnimatte with Pre-trained Video Diffusion Models0
Shapley-Scarf Markets with Objective Indifferences0
An Image-like Diffusion Method for Human-Object Interaction Detection0
Decorum: A Language-Based Approach For Style-Conditioned Synthesis of Indoor 3D Scenes0
MAMAT: 3D Mamba-Based Atmospheric Turbulence Removal and its Object Detection Capability0
Co-op: Correspondence-based Novel Object Pose Estimation0
4D-Bench: Benchmarking Multi-modal Large Language Models for 4D Object UnderstandingCode0
RefCut: Interactive Segmentation with Reference Guidance0
ExCap3D: Expressive 3D Scene Understanding via Object Captioning with Varying Detail0
Re-HOLD: Video Hand Object Interaction Reenactment via adaptive Layout-instructed Diffusion Model0
Which2comm: An Efficient Collaborative Perception Framework for 3D Object Detection0
GraPLUS: Graph-based Placement Using Semantics for Image Composition0
MagicMotion: Controllable Video Generation with Dense-to-Sparse Trajectory Guidance0
Variational Message Passing-based Multiobject Tracking for MIMO-Radars using Raw Sensor Signals0
Fine-Grained Open-Vocabulary Object Detection with Fined-Grained Prompts: Task, Dataset and Benchmark0
Intelligent Spatial Perception by Building Hierarchical 3D Scene Graphs for Indoor Scenarios with the Help of LLMs0
xMOD: Cross-Modal Distillation for 2D/3D Multi-Object Discovery from 2D motionCode0
Volumetric Reconstruction From Partial Views for Task-Oriented Grasping0
Test-Time Backdoor Detection for Object Detection Models0
HSOD-BIT-V2: A New Challenging Benchmarkfor Hyperspectral Salient Object DetectionCode0
FrustumFusionNets: A Three-Dimensional Object Detection Network Based on Tractor Road Scene0
PSA-SSL: Pose and Size-aware Self-Supervised Learning on LiDAR Point CloudsCode0
LED: LLM Enhanced Open-Vocabulary Object Detection without Human Curated Data GenerationCode0
Learning Shape-Independent Transformation via Spherical Representations for Category-Level Object Pose Estimation0
FLEX: A Framework for Learning Robot-Agnostic Force-based Skills Involving Sustained Contact Object Manipulation0
Leveraging Motion Information for Better Self-Supervised Video Correspondence Learning0
Cognitive Disentanglement for Referring Multi-Object Tracking0
Disentangled Object-Centric Image Representation for Robotic Manipulation0
MTV-Inpaint: Multi-Task Long Video Inpainting0
TASTE-Rob: Advancing Video Generation of Task-Oriented Hand-Object Interaction for Generalizable Robotic Manipulation0
Auto-Associative Memories for Direct Signalling of Visual Angle During Object Approaches0
6D Object Pose Tracking in Internet Videos for Robotic Manipulation0
3D Extended Object Tracking based on Extruded B-Spline Side View Profiles0
ROODI: Reconstructing Occluded Objects with Denoising Inpainters0
DreamInsert: Zero-Shot Image-to-Video Object Insertion from A Single Image0
Semantic-Supervised Spatial-Temporal Fusion for LiDAR-based 3D Object Detection0
KUDA: Keypoints to Unify Dynamics Learning and Visual Prompting for Open-Vocabulary Robotic Manipulation0
OCPM^2: Extending the Process Mining Methodology for Object-Centric Event Data Extraction0
MoEdit: On Learning Quantity Perception for Multi-object Image EditingCode0
GASPACHO: Gaussian Splatting for Controllable Humans and Objects0
Object-Aware DINO (Oh-A-Dino): Enhancing Self-Supervised Representations for Multi-Object Instance Retrieval0
InteractEdit: Zero-Shot Editing of Human-Object Interactions in Images0
TetraGrip: Sensor-Driven Multi-Suction Reactive Object Manipulation in Cluttered Scenes0
2HandedAfforder: Learning Precise Actionable Bimanual Affordances from Human Videos0
ObjectMover: Generative Object Movement with Video Prior0
Seeing What's Not There: Spurious Correlation in Multimodal LLMs0
Show:102550
← PrevPage 56 of 214Next →

No leaderboard results yet.