SOTAVerified

Object

Replace the cat with a British Shorthair cat of the breed with bulging yellow eyes

Papers

Showing 451475 of 10696 papers

TitleStatusHype
SMamba: Sparse Mamba for Event-based Object DetectionCode1
Learning Motion and Temporal Cues for Unsupervised Video Object SegmentationCode1
Toward Realistic Camouflaged Object Detection: Benchmarks and MethodCode1
3DCoMPaT200: Language-Grounded Compositional Understanding of Parts and Materials of 3D ShapesCode1
Generalization-Enhanced Few-Shot Object Detection in Remote SensingCode1
GO-N3RDet: Geometry Optimized NeRF-enhanced 3D Object DetectorCode1
Prior-free 3D Object TrackingCode1
Interacted Object Grounding in Spatio-Temporal Human-Object InteractionsCode1
DriveEditor: A Unified 3D Information-Guided Framework for Controllable Object Editing in Driving ScenesCode1
Seamless Detection: Unifying Salient Object Detection and Camouflaged Object DetectionCode1
Affordance-Aware Object Insertion via Mask-Aware Dual DiffusionCode1
M^3-VOS: Multi-Phase, Multi-Transition, and Multi-Scenery Video Object SegmentationCode1
PixelMan: Consistent Object Editing with Diffusion Models via Pixel Manipulation and GenerationCode1
Differential Alignment for Domain Adaptive Object DetectionCode1
CREST: An Efficient Conjointly-trained Spike-driven Framework for Event-based Object Detection Exploiting Spatiotemporal DynamicsCode1
Heterogeneous Graph Transformer for Multiple Tiny Object Tracking in RGB-T VideosCode1
Towards Flexible 3D Perception: Object-Centric Occupancy Completion Augments 3D Object DetectionCode1
EvRT-DETR: Latent Space Adaptation of Image Detectors for Event-based VisionCode1
Multi-Granularity Video Object SegmentationCode1
Referring Video Object Segmentation via Language-aligned Track SelectionCode1
Particle-based 6D Object Pose Estimation from Point Clouds using Diffusion ModelsCode1
GREAT: Geometry-Intention Collaborative Inference for Open-Vocabulary 3D Object Affordance GroundingCode1
SpotLight: Shadow-Guided Object Relighting via DiffusionCode1
From Open Vocabulary to Open World: Teaching Vision Language Models to Detect Novel ObjectsCode1
InTraGen: Trajectory-controlled Video Generation for Object InteractionsCode1
Show:102550
← PrevPage 19 of 428Next →

No leaderboard results yet.