SOTAVerified

Object

Replace the cat with a British Shorthair cat of the breed with bulging yellow eyes

Papers

Showing 201250 of 10696 papers

TitleStatusHype
4D LangSplat: 4D Language Gaussian Splatting via Multimodal Large Language ModelsCode2
HALC: Object Hallucination Reduction via Adaptive Focal-Contrast DecodingCode2
High-Precision Dichotomous Image Segmentation via Probing Diffusion CapacityCode2
HIT-UAV: A high-altitude infrared thermal dataset for Unmanned Aerial Vehicle-based object detectionCode2
HOLD: Category-agnostic 3D Reconstruction of Interacting Hands and Objects from VideoCode2
ImageDream: Image-Prompt Multi-view Diffusion for 3D GenerationCode2
InstructLayout: Instruction-Driven 2D and 3D Layout Synthesis with Semantic Graph PriorCode2
InstructSAM: A Training-Free Framework for Instruction-Oriented Remote Sensing Object RecognitionCode2
Duoduo CLIP: Efficient 3D Understanding with Multi-View ImagesCode2
InterFusion: Text-Driven Generation of 3D Human-Object InteractionCode2
Fully Sparse 3D Object DetectionCode2
Is CLIP the main roadblock for fine-grained open-world perception?Code2
Knowledge Distillation in YOLOX-ViT for Side-Scan Sonar Object DetectionCode2
K-Radar: 4D Radar Object Detection for Autonomous Driving in Various Weather ConditionsCode2
BigDetection: A Large-scale Benchmark for Improved Object Detector Pre-trainingCode2
Learning Embeddings with Centroid Triplet Loss for Object Identification in Robotic GraspingCode2
LeviTor: 3D Trajectory Oriented Image-to-Video SynthesisCode2
LeYOLO, New Scalable and Efficient CNN Architecture for Object DetectionCode2
Detect Everything with Few ExamplesCode2
Analyzing and Boosting the Power of Fine-Grained Visual Recognition for Multi-modal Large Language ModelsCode2
DenseWorld-1M: Towards Detailed Dense Grounded Caption in the Real WorldCode2
Beyond MOT: Semantic Multi-Object TrackingCode2
DetectoRS: Detecting Objects with Recursive Feature Pyramid and Switchable Atrous ConvolutionCode2
DeMo: Decoupled Feature-Based Mixture of Experts for Multi-Modal Object Re-IdentificationCode2
MeMOTR: Long-Term Memory-Augmented Transformer for Multi-Object TrackingCode2
Meta-DETR: Image-Level Few-Shot Detection with Inter-Class Correlation ExploitationCode2
3DGS-CD: 3D Gaussian Splatting-based Change Detection for Physical Object RearrangementCode2
MG-LLaVA: Towards Multi-Granularity Visual Instruction TuningCode2
Mitigating Object Hallucination via Concentric Causal AttentionCode2
MonoCD: Monocular 3D Object Detection with Complementary DepthsCode2
Deep Snake for Real-Time Instance SegmentationCode2
A Novel Unified Architecture for Low-Shot Counting by Detection and SegmentationCode2
Dense Distinct Query for End-to-End Object DetectionCode2
DeepInteraction: 3D Object Detection via Modality InteractionCode2
MS-DETR: Efficient DETR Training with Mixed SupervisionCode2
DeepFusionMOT: A 3D Multi-Object Tracking Framework Based on Camera-LiDAR Fusion with Deep AssociationCode2
DAVE -- A Detect-and-Verify Paradigm for Low-Shot CountingCode2
Multi-modal Queried Object Detection in the WildCode2
BOP Challenge 2020 on 6D Object LocalizationCode2
DAMSDet: Dynamic Adaptive Multispectral Detection Transformer with Competitive Query Selection and Adaptive Feature FusionCode2
NetTrack: Tracking Highly Dynamic Objects with a NetCode2
Decoupling Features in Hierarchical Propagation for Video Object SegmentationCode2
DetGPT: Detect What You Need via ReasoningCode2
NTIRE 2025 Challenge on Cross-Domain Few-Shot Object Detection: Methods and ResultsCode2
Segmentation Transformer: Object-Contextual Representations for Semantic SegmentationCode2
AccDiffusion: An Accurate Method for Higher-Resolution Image GenerationCode2
Boundary-Aware Segmentation Network for Mobile and Web ApplicationsCode2
ARCTIC: A Dataset for Dexterous Bimanual Hand-Object ManipulationCode2
Cross-View Referring Multi-Object TrackingCode2
ARTrackV2: Prompting Autoregressive Tracker Where to Look and How to DescribeCode2
Show:102550
← PrevPage 5 of 214Next →

No leaderboard results yet.