SOTAVerified

Object

Replace the cat with a British Shorthair cat of the breed with bulging yellow eyes

Papers

Showing 251275 of 10696 papers

TitleStatusHype
v-CLR: View-Consistent Learning for Open-World Instance SegmentationCode1
Slot-Level Robotic Placement via Visual Imitation from Single Human Video0
COST: Contrastive One-Stage Transformer for Vision-Language Small Object Tracking0
Deep LG-Track: An Enhanced Localization-Confidence-Guided Multi-Object Tracker0
Towards Unified Referring Expression Segmentation Across Omni-Level Visual Target GranularitiesCode0
Detail-aware multi-view stereo network for depth estimationCode0
MB-ORES: A Multi-Branch Object Reasoner for Visual Grounding in Remote SensingCode0
EagleVision: Object-level Attribute Multimodal LLM for Remote SensingCode1
Physically Ground Commonsense Knowledge for Articulated Object Manipulation with Analytic Concepts0
ReferDINO-Plus: 2nd Solution for 4th PVUW MeViS Challenge at CVPR 2025Code0
DASH: Detection and Assessment of Systematic Hallucinations of VLMsCode1
Object Isolated Attention for Consistent Story Visualization0
Context in object detection: a systematic literature review0
Efficient Explicit Joint-level Interaction Modeling with Mamba for Text-guided HOI GenerationCode0
The Marine Debris Forward-Looking Sonar Datasets0
Hyperspectral Adapter for Object Tracking based on Hyperspectral Video0
SIGHT: Single-Image Conditioned Generation of Hand Trajectories for Hand-Object Interaction0
VisTa: Visual-contextual and Text-augmented Zero-shot Object-level OOD Detection0
SemAlign3D: Semantic Correspondence between RGB-Images through Aligning 3D Object-Class Representations0
ForcePose: A Deep Learning Approach for Force Calculation Based on Action Recognition Using MediaPipe Pose Estimation Combined with Object Detection0
TranSplat: Lighting-Consistent Cross-Scene Object Transfer with 3D Gaussian Splatting0
Segment then Splat: A Unified Approach for 3D Open-Vocabulary Segmentation based on Gaussian Splatting0
RUNA: Object-level Out-of-Distribution Detection via Regional Uncertainty Alignment of Multimodal Representations0
AGILE: A Diffusion-Based Attention-Guided Image and Label Translation for Efficient Cross-Domain Plant Trait IdentificationCode0
BOOTPLACE: Bootstrapped Object Placement with Detection TransformersCode1
Show:102550
← PrevPage 11 of 428Next →

No leaderboard results yet.