SOTAVerified

Object

Replace the cat with a British Shorthair cat of the breed with bulging yellow eyes

Papers

Showing 226250 of 10696 papers

TitleStatusHype
RoboFusion: Towards Robust Multi-Modal 3D Object Detection via SAMCode2
MS-DETR: Efficient DETR Training with Mixed SupervisionCode2
Context-Guided Spatio-Temporal Video GroundingCode2
Unveiling Parts Beyond Objects: Towards Finer-Granularity Referring Expression SegmentationCode2
Exploring Orthogonality in Open World Object DetectionCode2
Point Segment and Count: A Generalized Framework for Object CountingCode2
ARTrackV2: Prompting Autoregressive Tracker Where to Look and How to DescribeCode2
UniRef++: Segment Every Reference Object in Spatial and Temporal SpacesCode2
Prototype-based Cross-Modal Object TrackingCode2
VCoder: Versatile Vision Encoders for Multimodal Large Language ModelsCode2
UCMCTrack: Multi-Object Tracking with Uniform Camera Motion CompensationCode2
Chat-Scene: Bridging 3D Scene and Large Language Models with Object IdentifiersCode2
SAM-Assisted Remote Sensing Imagery Semantic Segmentation with Object and Boundary ConstraintsCode2
Aligning and Prompting Everything All at Once for Universal Visual PerceptionCode2
ImageDream: Image-Prompt Multi-view Diffusion for 3D GenerationCode2
Gaussian Grouping: Segment and Edit Anything in 3D ScenesCode2
TrackDiffusion: Tracklet-Conditioned Video Generation via Diffusion ModelsCode2
HOLD: Category-agnostic 3D Reconstruction of Interacting Hands and Objects from VideoCode2
A Graph-Based Approach for Category-Agnostic Pose EstimationCode2
Mitigating Object Hallucinations in Large Vision-Language Models through Visual Contrastive DecodingCode2
SAM-6D: Segment Anything Model Meets Zero-Shot 6D Object Pose EstimationCode2
Open-Vocabulary Camouflaged Object SegmentationCode2
Clearer Frames, Anytime: Resolving Velocity Ambiguity in Video Frame InterpolationCode2
GenEval: An Object-Focused Framework for Evaluating Text-to-Image AlignmentCode2
CoDA: Collaborative Novel Box Discovery and Cross-modal Alignment for Open-vocabulary 3D Object DetectionCode2
Show:102550
← PrevPage 10 of 428Next →

No leaderboard results yet.