SOTAVerified

Object

Replace the cat with a British Shorthair cat of the breed with bulging yellow eyes

Papers

Showing 401425 of 10696 papers

TitleStatusHype
M^3-VOS: Multi-Phase, Multi-Transition, and Multi-Scenery Video Object SegmentationCode1
Multiple Object Stitching for Unsupervised Representation LearningCode1
LPOI: Listwise Preference Optimization for Vision Language ModelsCode1
Locality-Aware Zero-Shot Human-Object Interaction DetectionCode1
ReaMOT: A Benchmark and Framework for Reasoning-based Multi-Object TrackingCode1
Object-level Cross-view Geo-localization with Location Enhancement and Multi-Head Cross AttentionCode1
StoryReasoning Dataset: Using Chain-of-Thought for Scene Understanding and Grounded Story GenerationCode1
Asynchronous Multi-Object Tracking with an Event CameraCode1
A Simple Detector with Frame Dynamics is a Strong TrackerCode1
CDFormer: Cross-Domain Few-Shot Object Detection Transformer Against Feature ConfusionCode1
LLM-Empowered Embodied Agent for Memory-Augmented Task Planning in Household RoboticsCode1
GrabS: Generative Embodied Agent for 3D Object Segmentation without Scene SupervisionCode1
MonoDiff9D: Monocular Category-Level 9D Object Pose Estimation via Diffusion ModelCode1
Are We Done with Object-Centric Learning?Code1
PicoPose: Progressive Pixel-to-Pixel Correspondence Learning for Novel Object Pose EstimationCode1
v-CLR: View-Consistent Learning for Open-World Instance SegmentationCode1
DASH: Detection and Assessment of Systematic Hallucinations of VLMsCode1
EagleVision: Object-level Attribute Multimodal LLM for Remote SensingCode1
BOOTPLACE: Bootstrapped Object Placement with Detection TransformersCode1
Learning Class Prototypes for Unified Sparse Supervised 3D Object DetectionCode1
DynOPETs: A Versatile Benchmark for Dynamic Object Pose Estimation and Tracking in Moving Camera ScenariosCode1
CamSAM2: Segment Anything Accurately in Camouflaged VideosCode1
Benchmarking Object Detectors under Real-World Distribution Shifts in Satellite ImageryCode1
Global-Local Tree Search in VLMs for 3D Indoor Scene GenerationCode1
GOAL: Global-local Object Alignment LearningCode1
Show:102550
← PrevPage 17 of 428Next →

No leaderboard results yet.