SOTAVerified

Object

Replace the cat with a British Shorthair cat of the breed with bulging yellow eyes

Papers

Showing 126150 of 10696 papers

TitleStatusHype
Gaussian Grouping: Segment and Edit Anything in 3D ScenesCode2
Duoduo CLIP: Efficient 3D Understanding with Multi-View ImagesCode2
Analyzing and Boosting the Power of Fine-Grained Visual Recognition for Multi-modal Large Language ModelsCode2
DVIS-DAQ: Improving Video Segmentation via Dynamic Anchor QueriesCode2
EasyHOI: Unleashing the Power of Large Models for Reconstructing Hand-Object Interactions in the WildCode2
DragonDiffusion: Enabling Drag-style Manipulation on Diffusion ModelsCode2
DreamMix: Decoupling Object Attributes for Enhanced Editability in Customized Image InpaintingCode2
DM-NeRF: 3D Scene Geometry Decomposition and Manipulation from 2D ImagesCode2
DQ-DETR: DETR with Dynamic Query for Tiny Object DetectionCode2
EdgeYOLO: An Edge-Real-Time Object DetectorCode2
Exploiting Unlabeled Data with Multiple Expert Teachers for Open Vocabulary Aerial Object Detection and Its Orientation AdaptationCode2
Generative Region-Language Pretraining for Open-Ended Object DetectionCode2
DEYOLO: Dual-Feature-Enhancement YOLO for Cross-Modality Object DetectionCode2
Aligning and Prompting Everything All at Once for Universal Visual PerceptionCode2
Diff9D: Diffusion-Based Domain-Generalized Category-Level 9-DoF Object Pose EstimationCode2
DetZero: Rethinking Offboard 3D Object Detection with Long-term Sequential Point CloudsCode2
ALBench: A Framework for Evaluating Active Learning in Object DetectionCode2
DexGraspNet: A Large-Scale Robotic Dexterous Grasp Dataset for General Objects Based on SimulationCode2
Visible-Thermal Multiple Object Tracking: Large-scale Video Dataset and Progressive Fusion ApproachCode2
DetGPT: Detect What You Need via ReasoningCode2
Detect Everything with Few ExamplesCode2
AGLA: Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local AttentionCode2
DetectoRS: Detecting Objects with Recursive Feature Pyramid and Switchable Atrous ConvolutionCode2
Deep Snake for Real-Time Instance SegmentationCode2
DeMo: Decoupled Feature-Based Mixture of Experts for Multi-Modal Object Re-IdentificationCode2
Show:102550
← PrevPage 6 of 428Next →

No leaderboard results yet.