SOTAVerified

Object

Replace the cat with a British Shorthair cat of the breed with bulging yellow eyes

Papers

Showing 27012725 of 10696 papers

TitleStatusHype
POEM: Precise Object-level Editing via MLLM control0
How Can Objects Help Video-Language Understanding?0
Marmot: Multi-Agent Reasoning for Multi-Object Self-Correcting in Improving Image-Text Alignment0
Compass Control: Multi Object Orientation Control for Text-to-Image Generation0
Glossy Object Reconstruction with Cost-effective Polarized Acquisition0
DLTPose: 6DoF Pose Estimation From Accurate Dense Surface Point Estimates0
MovSAM: A Single-image Moving Object Segmentation Framework Based on Deep ThinkingCode0
Better Decisions through the Right Causal World Model0
A Self-Supervised Framework for Space Object Behaviour Characterisation0
D-Feat Occlusions: Diffusion Features for Robustness to Partial Visual Occlusions in Object Recognition0
PRIMEDrive-CoT: A Precognitive Chain-of-Thought Framework for Uncertainty-Aware Object Interaction in Driving Scene Scenario0
Grounding 3D Object Affordance with Language Instructions, Visual Observations and Interactions0
EMF: Event Meta Formers for Event-based Real-time Traffic Object Detection0
CornerPoint3D: Look at the Nearest Corner Instead of the Center0
RASP: Revisiting 3D Anamorphic Art for Shadow-Guided Packing of Irregular Objects0
Deep Reinforcement Learning via Object-Centric AttentionCode0
COST: Contrastive One-Stage Transformer for Vision-Language Small Object Tracking0
A Diffusion-Based Framework for Occluded Object Movement0
Slot-Level Robotic Placement via Visual Imitation from Single Human Video0
Towards Unified Referring Expression Segmentation Across Omni-Level Visual Target GranularitiesCode0
TransforMerger: Transformer-based Voice-Gesture Fusion for Robust Human-Robot Communication0
Deep LG-Track: An Enhanced Localization-Confidence-Guided Multi-Object Tracker0
MB-ORES: A Multi-Branch Object Reasoner for Visual Grounding in Remote SensingCode0
Detail-aware multi-view stereo network for depth estimationCode0
Physically Ground Commonsense Knowledge for Articulated Object Manipulation with Analytic Concepts0
Show:102550
← PrevPage 109 of 428Next →

No leaderboard results yet.