SOTAVerified

Object

Replace the cat with a British Shorthair cat of the breed with bulging yellow eyes

Papers

Showing 150 of 10696 papers

TitleStatusHype
YOLOv10: Real-Time End-to-End Object DetectionCode11
YOLO-World: Real-Time Open-Vocabulary Object DetectionCode9
TripoSR: Fast 3D Object Reconstruction from a Single ImageCode9
DETRs Beat YOLOs on Real-time Object DetectionCode8
YOLOv12: Attention-Centric Real-Time Object DetectorsCode7
Visual-RFT: Visual Reinforcement Fine-TuningCode7
Efficient Track AnythingCode7
T-Rex2: Towards Generic Object Detection via Text-Visual Prompt SynergyCode7
DragAnything: Motion Control for Anything using Entity RepresentationCode7
YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectorsCode7
UnCommon Objects in 3DCode5
Matching Anything by Segmenting AnythingCode5
Slicing Aided Hyper Inference and Fine-tuning for Small Object DetectionCode5
GaussianObject: High-Quality 3D Object Reconstruction from Four Views with Gaussian SplattingCode5
Awesome Multi-modal Object TrackingCode5
RealFusion: 360° Reconstruction of Any Object from a Single ImageCode5
DINO-X: A Unified Vision Model for Open-World Object Detection and UnderstandingCode5
Street Gaussians: Modeling Dynamic Urban Scenes with Gaussian SplattingCode5
Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and VideosCode5
Underwater Camouflaged Object Tracking Meets Vision-Language SAM2Code5
PAIR Diffusion: A Comprehensive Multimodal Object-Level Image EditorCode4
DiffusionDet: Diffusion Model for Object DetectionCode4
Mamba YOLO: A Simple Baseline for Object Detection with State Space ModelCode4
GCoNet+: A Stronger Group Collaborative Co-Salient Object DetectorCode4
Transformer for Object Re-Identification: A SurveyCode4
Strip R-CNN: Large Strip Convolution for Remote Sensing Object DetectionCode4
Efficient Part-level 3D Object Generation via Dual Volume PackingCode4
UltimateDO: An Efficient Framework to Marry Occupancy Prediction with 3D Object Detection via Channel2heightCode4
SiamMask: A Framework for Fast Online Object Tracking and SegmentationCode4
Detectron2 Object Detection & Manipulating Images using CartoonizationCode4
AnyDoor: Zero-shot Object-level Image CustomizationCode4
RUMI: Rummaging Using Mutual InformationCode4
SAM2Long: Enhancing SAM 2 for Long Video Segmentation with a Training-Free Memory TreeCode4
RTMDet: An Empirical Study of Designing Real-Time Object DetectorsCode4
SARDet-100K: Towards Open-Source Benchmark and ToolKit for Large-Scale SAR Object DetectionCode4
OmDet: Large-scale vision-language multi-dataset pre-training with multimodal detection networkCode3
Observation-Centric SORT: Rethinking SORT for Robust Multi-Object TrackingCode3
MureObjectStitch: Multi-reference Image CompositionCode3
Multiple Object Tracking as ID PredictionCode3
NeROIC: Neural Rendering of Objects from Online Image CollectionsCode3
Panacea+: Panoramic and Controllable Video Generation for Autonomous DrivingCode3
Motion Representations for Articulated AnimationCode3
MotionCtrl: A Unified and Flexible Motion Controller for Video GenerationCode3
Moving Object Segmentation: All You Need Is SAM (and Flow)Code3
MagicDrive: Street View Generation with Diverse 3D Geometry ControlCode3
BundleSDF: Neural 6-DoF Tracking and 3D Reconstruction of Unknown ObjectsCode3
InterMimic: Towards Universal Whole-Body Control for Physics-Based Human-Object InteractionsCode3
Locate Anything on Earth: Advancing Open-Vocabulary Object Detection for Remote Sensing CommunityCode3
MTP: Advancing Remote Sensing Foundation Model via Multi-Task PretrainingCode3
PETR: Position Embedding Transformation for Multi-View 3D Object DetectionCode3
Show:102550
← PrevPage 1 of 214Next →

No leaderboard results yet.