SOTAVerified

Object

Replace the cat with a British Shorthair cat of the breed with bulging yellow eyes

Papers

Showing 526550 of 10696 papers

TitleStatusHype
Object-Centric 2D Gaussian Splatting: Background Removal and Occlusion-Aware Pruning for Compact Object Models0
BlobGEN-Vid: Compositional Text-to-Video Generation with Blob Video Representations0
Collaborative Learning for 3D Hand-Object Reconstruction and Compositional Action Recognition from Egocentric RGB Videos Using Superquadrics0
SST-EM: Advanced Metrics for Evaluating Semantic, Spatial and Temporal Aspects in Video EditingCode0
VDOR: A Video-based Dataset for Object Removal via Sequence Consistency0
VAGeo: View-specific Attention for Cross-View Object Geo-Localization0
Toward Realistic Camouflaged Object Detection: Benchmarks and MethodCode1
UnCommon Objects in 3DCode5
Guided SAM: Label-Efficient Part Segmentation0
3DCoMPaT200: Language-Grounded Compositional Understanding of Parts and Materials of 3D ShapesCode1
Mamba-MOC: A Multicategory Remote Object Counting via State Space Model0
UniQ: Unified Decoder with Task-specific Queries for Efficient Scene Graph Generation0
From Simple to Complex Skills: The Case of In-Hand Object Reorientation0
Perception-as-Control: Fine-grained Controllable Image Animation with 3D-aware Motion Representation0
Improving Skeleton-based Action Recognition with Interactive Object InformationCode0
UPAQ: A Framework for Real-Time and Energy-Efficient 3D Object Detection in Autonomous Vehicles0
TexHOI: Reconstructing Textures of 3D Unknown Objects in Monocular Hand-Object Interaction ScenesCode0
Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and VideosCode5
Learning to Transfer Human Hand Skills for Robot Manipulations0
AuxDepthNet: Real-Time Monocular 3D Object Detection with Depth-Sensitive Features0
Strip R-CNN: Large Strip Convolution for Remote Sensing Object DetectionCode4
Universal Fine-grained Visual Categorization by Concept Guided LearningCode0
HOGSA: Bimanual Hand-Object Interaction Understanding with 3D Gaussian Splatting Based Data Augmentation0
Through-The-Mask: Mask-based Motion Trajectories for Image-to-Video Generation0
Human Gaze Boosts Object-Centered Representation Learning0
Show:102550
← PrevPage 22 of 428Next →

No leaderboard results yet.