SOTAVerified

Object

Replace the cat with a British Shorthair cat of the breed with bulging yellow eyes

Papers

Showing 28012825 of 10696 papers

TitleStatusHype
Seeing What's Not There: Spurious Correlation in Multimodal LLMs0
Bring Remote Sensing Object Detect Into Nature Language Model: Using SFT Method0
Hierarchical Contact-Rich Trajectory Optimization for Multi-Modal Manipulation using Tight Convex Relaxations0
OmniPaint: Mastering Object-Oriented Editing via Disentangled Insertion-Removal Inpainting0
Recovering Partially Corrupted Major Objects through Tri-modality Based Image Completion0
Multi-Modal 3D Mesh Reconstruction from Images and Text0
Find your Needle: Small Object Image Retrieval via Multi-Object Attention Optimization0
Erase Diffusion: Empowering Object Removal Through Calibrating Diffusion Pathways0
EAZY: Eliminating Hallucinations in LVLMs by Zeroing out Hallucinatory Image Tokens0
Large model enhanced computational ghost imagingCode0
Hierarchical Cross-Modal Alignment for Open-Vocabulary 3D Object Detection0
Large Language Model Guided Progressive Feature Alignment for Multimodal UAV Object Detection0
Aligning Instance-Semantic Sparse Representation towards Unsupervised Object Segmentation and Shape Abstraction with Repeatable Primitives0
A Light Perspective for 3D Object Detection0
OV-SCAN: Semantically Consistent Alignment for Novel Object Discovery in Open-Vocabulary 3D Object Detection0
AxisPose: Model-Free Matching-Free Single-Shot 6D Object Pose Estimation via Axis Generation0
D3DR: Lighting-Aware Object Insertion in Gaussian Splatting0
OpenRSD: Towards Open-prompts for Object Detection in Remote Sensing Images0
Object-Centric World Model for Language-Guided Manipulation0
Accurate and Efficient Two-Stage Gun Detection in Video0
OSCAR: Object Status and Contextual Awareness for Recipes to Support Non-Visual Cooking0
2D Object Detection: A Survey0
DecoupledGaussian: Object-Scene Decoupling for Physics-Based Interaction0
Energy-Guided Optimization for Personalized Image Editing with Pretrained Text-to-Image Diffusion Models0
Shaken, Not Stirred: A Novel Dataset for Visual Understanding of Glasses in Human-Robot Bartending Tasks0
Show:102550
← PrevPage 113 of 428Next →

No leaderboard results yet.