SOTAVerified

Object

Replace the cat with a British Shorthair cat of the breed with bulging yellow eyes

Papers

Showing 251300 of 10696 papers

TitleStatusHype
Detect Everything with Few ExamplesCode2
PointLLM: Empowering Large Language Models to Understand Point CloudsCode2
InterDiff: Generating 3D Human-Object Interactions with Physics-Informed DiffusionCode2
DiffusionTrack: Diffusion Model For Multi-Object TrackingCode2
SparseBEV: High-Performance Sparse 3D Object Detection from Multi-Camera VideosCode2
MeViS: A Large-scale Benchmark for Video Segmentation with Motion ExpressionsCode2
YOLO-MS: Rethinking Multi-Scale Representation Learning for Real-time Object DetectionCode2
FocalFormer3D : Focusing on Hard Instance for 3D Object DetectionCode2
RT-2: Vision-Language-Action Models Transfer Web Knowledge to Robotic ControlCode2
MeMOTR: Long-Term Memory-Augmented Transformer for Multi-Object TrackingCode2
Tracking Anything in High QualityCode2
COCO-O: A Benchmark for Object Detectors under Natural Distribution ShiftsCode2
CNOS: A Strong Baseline for CAD-based Novel Object SegmentationCode2
DragonDiffusion: Enabling Drag-style Manipulation on Diffusion ModelsCode2
RVT: Robotic View Transformer for 3D Object ManipulationCode2
OpenMask3D: Open-Vocabulary 3D Instance SegmentationCode2
SoftGPT: Learn Goal-oriented Soft Object Manipulation Skills by Generative Pre-trained Heterogeneous Graph TransformerCode2
Aria Digital Twin: A New Benchmark Dataset for Egocentric 3D Machine PerceptionCode2
DetZero: Rethinking Offboard 3D Object Detection with Long-term Sequential Point CloudsCode2
SAM3D: Zero-Shot 3D Object Detection via Segment Anything ModelCode2
Multi-modal Queried Object Detection in the WildCode2
Contextual Object Detection with Multimodal Large Language ModelsCode2
NeRO: Neural Geometry and BRDF Reconstruction of Reflective Objects from Multiview ImagesCode2
DetGPT: Detect What You Need via ReasoningCode2
Going Denser with Open-Vocabulary Part SegmentationCode2
Evaluating Object Hallucination in Large Vision-Language ModelsCode2
Video Object Segmentation in Panoptic Wild ScenesCode2
PillarNeXt: Rethinking Network Designs for 3D Object Detection in LiDAR Point CloudsCode2
SAMRS: Scaling-up Remote Sensing Segmentation Dataset with Segment Anything ModelCode2
SportsMOT: A Large Multi-Object Tracking Dataset in Multiple Sports ScenesCode2
EA-LSS: Edge-aware Lift-splat-shot Framework for 3D BEV Object DetectionCode2
LayoutDiffusion: Controllable Diffusion Model for Layout-to-image GenerationCode2
PAIR-Diffusion: A Comprehensive Multimodal Object-Level Image EditorCode2
NOPE: Novel Object Pose Estimation from a Single ImageCode2
Dense Distinct Query for End-to-End Object DetectionCode2
Exploring Object-Centric Temporal Modeling for Efficient Multi-View 3D Object DetectionCode2
VoxelNeXt: Fully Sparse VoxelNet for 3D Object Detection and TrackingCode2
Large Selective Kernel Network for Remote Sensing Object DetectionCode2
InstMove: Instance Motion for Object-centric Video SegmentationCode2
Virtual Sparse Convolution for Multimodal 3D Object DetectionCode2
Fusing Visual Appearance and Geometry for Multi-modality 6DoF Object TrackingCode2
Efficient Teacher: Semi-Supervised Object Detection for YOLOv5Code2
EdgeYOLO: An Edge-Real-Time Object DetectorCode2
Standing Between Past and Future: Spatio-Temporal Modeling for Multi-Camera 3D Multi-Object TrackingCode2
MOSE: A New Dataset for Video Object Segmentation in Complex ScenesCode2
vMAP: Vectorised Object Mapping for Neural Field SLAMCode2
OmniObject3D: Large-Vocabulary 3D Object Dataset for Realistic Perception, Reconstruction and GenerationCode2
PACO: Parts and Attributes of Common ObjectsCode2
FocalFormer3D: Focusing on Hard Instance for 3D Object DetectionCode2
Autoregressive Visual TrackingCode2
Show:102550
← PrevPage 6 of 214Next →

No leaderboard results yet.