SOTAVerified

Object

Replace the cat with a British Shorthair cat of the breed with bulging yellow eyes

Papers

Showing 301350 of 10696 papers

TitleStatusHype
4D-Bench: Benchmarking Multi-modal Large Language Models for 4D Object UnderstandingCode0
GOAL: Global-local Object Alignment LearningCode1
MAMAT: 3D Mamba-Based Atmospheric Turbulence Removal and its Object Detection Capability0
RefCut: Interactive Segmentation with Reference Guidance0
Co-op: Correspondence-based Novel Object Pose Estimation0
Re-HOLD: Video Hand Object Interaction Reenactment via adaptive Layout-instructed Diffusion Model0
ExCap3D: Expressive 3D Scene Understanding via Object Captioning with Varying Detail0
Which2comm: An Efficient Collaborative Perception Framework for 3D Object Detection0
GraPLUS: Graph-based Placement Using Semantics for Image Composition0
MagicMotion: Controllable Video Generation with Dense-to-Sparse Trajectory Guidance0
Fine-Grained Open-Vocabulary Object Detection with Fined-Grained Prompts: Task, Dataset and Benchmark0
xMOD: Cross-Modal Distillation for 2D/3D Multi-Object Discovery from 2D motionCode0
Variational Message Passing-based Multiobject Tracking for MIMO-Radars using Raw Sensor Signals0
UltraFlwr -- An Efficient Federated Medical and Surgical Object Detection FrameworkCode1
Test-Time Backdoor Detection for Object Detection Models0
GIVEPose: Gradual Intra-class Variation Elimination for RGB-based Category-Level Object Pose EstimationCode1
Volumetric Reconstruction From Partial Views for Task-Oriented Grasping0
Intelligent Spatial Perception by Building Hierarchical 3D Scene Graphs for Indoor Scenarios with the Help of LLMs0
Robust Object Detection of Underwater Robot based on Domain GeneralizationCode1
HSOD-BIT-V2: A New Challenging Benchmarkfor Hyperspectral Salient Object DetectionCode0
FrustumFusionNets: A Three-Dimensional Object Detection Network Based on Tractor Road Scene0
Learning Shape-Independent Transformation via Spherical Representations for Category-Level Object Pose Estimation0
PSA-SSL: Pose and Size-aware Self-Supervised Learning on LiDAR Point CloudsCode0
MMR: A Large-scale Benchmark Dataset for Multi-target and Multi-granularity Reasoning SegmentationCode1
Rethinking End-to-End 2D to 3D Scene Segmentation in Gaussian SplattingCode2
LED: LLM Enhanced Open-Vocabulary Object Detection without Human Curated Data GenerationCode0
FLEX: A Framework for Learning Robot-Agnostic Force-based Skills Involving Sustained Contact Object Manipulation0
History-Aware Transformation of ReID Features for Multiple Object TrackingCode1
Leveraging Motion Information for Better Self-Supervised Video Correspondence Learning0
Cognitive Disentanglement for Referring Multi-Object Tracking0
MTV-Inpaint: Multi-Task Long Video Inpainting0
TASTE-Rob: Advancing Video Generation of Task-Oriented Hand-Object Interaction for Generalizable Robotic Manipulation0
Disentangled Object-Centric Image Representation for Robotic Manipulation0
MoEdit: On Learning Quantity Perception for Multi-object Image EditingCode0
3D Extended Object Tracking based on Extruded B-Spline Side View Profiles0
OmniSTVG: Toward Spatio-Temporal Omni-Object Video GroundingCode1
4D LangSplat: 4D Language Gaussian Splatting via Multimodal Large Language ModelsCode2
Semantic-Supervised Spatial-Temporal Fusion for LiDAR-based 3D Object Detection0
OCPM^2: Extending the Process Mining Methodology for Object-Centric Event Data Extraction0
Auto-Associative Memories for Direct Signalling of Visual Angle During Object Approaches0
DreamInsert: Zero-Shot Image-to-Video Object Insertion from A Single Image0
ROODI: Reconstructing Occluded Objects with Denoising Inpainters0
6D Object Pose Tracking in Internet Videos for Robotic Manipulation0
KUDA: Keypoints to Unify Dynamics Learning and Visual Prompting for Open-Vocabulary Robotic Manipulation0
Object-Aware DINO (Oh-A-Dino): Enhancing Self-Supervised Representations for Multi-Object Instance Retrieval0
GASPACHO: Gaussian Splatting for Controllable Humans and Objects0
InteractEdit: Zero-Shot Editing of Human-Object Interactions in Images0
2HandedAfforder: Learning Precise Actionable Bimanual Affordances from Human Videos0
TetraGrip: Sensor-Driven Multi-Suction Reactive Object Manipulation in Cluttered Scenes0
Bring Remote Sensing Object Detect Into Nature Language Model: Using SFT Method0
Show:102550
← PrevPage 7 of 214Next →

No leaderboard results yet.