SOTAVerified

Object

Replace the cat with a British Shorthair cat of the breed with bulging yellow eyes

Papers

Showing 201250 of 10696 papers

TitleStatusHype
E2E-MFD: Towards End-to-End Synchronous Multimodal Fusion DetectionCode2
LISO: Lidar-only Self-Supervised 3D Object DetectionCode2
Poly Kernel Inception Network for Remote Sensing DetectionCode2
SAFDNet: A Simple and Effective Network for Fully Sparse 3D Object DetectionCode2
Beyond MOT: Semantic Multi-Object TrackingCode2
VastTrack: Vast Category Visual Object TrackingCode2
EAGLE: Eigen Aggregation Learning for Object-Centric Unsupervised Semantic SegmentationCode2
HALC: Object Hallucination Reduction via Adaptive Focal-Contrast DecodingCode2
DAMSDet: Dynamic Adaptive Multispectral Detection Transformer with Competitive Query Selection and Adaptive Feature FusionCode2
FusionVision: A comprehensive approach of 3D object reconstruction and segmentation from RGB-D cameras using YOLO and fast segment anythingCode2
HOISDF: Constraining 3D Hand-Object Pose Estimation with Global Signed Distance FieldsCode2
Grasp, See, and Place: Efficient Unknown Object Rearrangement with Policy Structure PriorCode2
VOOM: Robust Visual Object Odometry and Mapping using Hierarchical LandmarksCode2
Open3DSG: Open-Vocabulary 3D Scene Graphs from Point Clouds with Queryable Objects and Open-Set RelationshipsCode2
CoLLaVO: Crayon Large Language and Vision mOdelCode2
FM-Fusion: Instance-aware Semantic Mapping Boosted by Vision-Language Foundation ModelsCode2
YOLOPoint Joint Keypoint and Object DetectionCode2
Cross-Domain Few-Shot Object Detection via Enhanced Open-Set Object DetectorCode2
HASSOD: Hierarchical Adaptive Self-Supervised Object DetectionCode2
MF-MOS: A Motion-Focused Model for Moving Object SegmentationCode2
Beyond the Contact: Discovering Comprehensive Affordance for 3D Objects from Pre-trained 2D Diffusion ModelsCode2
Removal then Selection: A Coarse-to-Fine Fusion Perspective for RGB-Infrared Object DetectionCode2
Efficient4D: Fast Dynamic 3D Object Generation from a Single-view VideoCode2
OBSeg: Accurate and Fast Instance Segmentation Framework Using Segmentation Foundation Models with Oriented Bounding Box PromptsCode2
Fine-Grained Prototypes Distillation for Few-Shot Object DetectionCode2
RoboFusion: Towards Robust Multi-Modal 3D Object Detection via SAMCode2
MS-DETR: Efficient DETR Training with Mixed SupervisionCode2
Context-Guided Spatio-Temporal Video GroundingCode2
Unveiling Parts Beyond Objects: Towards Finer-Granularity Referring Expression SegmentationCode2
Exploring Orthogonality in Open World Object DetectionCode2
Point Segment and Count: A Generalized Framework for Object CountingCode2
ARTrackV2: Prompting Autoregressive Tracker Where to Look and How to DescribeCode2
UniRef++: Segment Every Reference Object in Spatial and Temporal SpacesCode2
Prototype-based Cross-Modal Object TrackingCode2
VCoder: Versatile Vision Encoders for Multimodal Large Language ModelsCode2
UCMCTrack: Multi-Object Tracking with Uniform Camera Motion CompensationCode2
Chat-Scene: Bridging 3D Scene and Large Language Models with Object IdentifiersCode2
SAM-Assisted Remote Sensing Imagery Semantic Segmentation with Object and Boundary ConstraintsCode2
Aligning and Prompting Everything All at Once for Universal Visual PerceptionCode2
ImageDream: Image-Prompt Multi-view Diffusion for 3D GenerationCode2
Gaussian Grouping: Segment and Edit Anything in 3D ScenesCode2
TrackDiffusion: Tracklet-Conditioned Video Generation via Diffusion ModelsCode2
HOLD: Category-agnostic 3D Reconstruction of Interacting Hands and Objects from VideoCode2
A Graph-Based Approach for Category-Agnostic Pose EstimationCode2
Mitigating Object Hallucinations in Large Vision-Language Models through Visual Contrastive DecodingCode2
SAM-6D: Segment Anything Model Meets Zero-Shot 6D Object Pose EstimationCode2
Open-Vocabulary Camouflaged Object SegmentationCode2
Clearer Frames, Anytime: Resolving Velocity Ambiguity in Video Frame InterpolationCode2
GenEval: An Object-Focused Framework for Evaluating Text-to-Image AlignmentCode2
CoDA: Collaborative Novel Box Discovery and Cross-modal Alignment for Open-vocabulary 3D Object DetectionCode2
Show:102550
← PrevPage 5 of 214Next →

No leaderboard results yet.