SOTAVerified

Object

Replace the cat with a British Shorthair cat of the breed with bulging yellow eyes

Papers

Showing 101150 of 10696 papers

TitleStatusHype
Analyzing and Boosting the Power of Fine-Grained Visual Recognition for Multi-modal Large Language ModelsCode2
One-shot 3D Object Canonicalization based on Geometric and Semantic ConsistencyCode2
RORem: Training a Robust Object Remover with Human-in-the-LoopCode2
YOLO-UniOW: Efficient Universal Open-World Object DetectionCode2
CGCOD: Class-Guided Camouflaged Object DetectionCode2
Cross-View Referring Multi-Object TrackingCode2
LeviTor: 3D Trajectory Oriented Image-to-Video SynthesisCode2
Multi-Sensor Object Anomaly Detection: Unifying Appearance, Geometry, and Internal PropertiesCode2
RelationField: Relate Anything in Radiance FieldsCode2
Exploring Enhanced Contextual Information for Video-Level Object TrackingCode2
MambaPro: Multi-Modal Object Re-Identification with Mamba Aggregation and Synergistic PromptCode2
DeMo: Decoupled Feature-Based Mixture of Experts for Multi-Modal Object Re-IdentificationCode2
RemDet: Rethinking Efficient Model Design for UAV Object DetectionCode2
Object Detection using Event Camera: A MoE Heat Conduction based Detector and A New Benchmark DatasetCode2
DEYOLO: Dual-Feature-Enhancement YOLO for Cross-Modality Object DetectionCode2
SADG: Segment Any Dynamic Gaussian Without Object TrackersCode2
Lost & Found: Tracking Changes from Egocentric Observations in 3D Dynamic Scene GraphsCode2
DreamMix: Decoupling Object Attributes for Enhanced Editability in Customized Image InpaintingCode2
Interpreting Object-level Foundation Models via Visual Precision SearchCode2
Open Vocabulary Monocular 3D Object DetectionCode2
EasyHOI: Unleashing the Power of Large Models for Reconstructing Hand-Object Interactions in the WildCode2
Find Any Part in 3DCode2
Token Merging for Training-Free Semantic Binding in Text-to-Image SynthesisCode2
3DGS-CD: 3D Gaussian Splatting-based Change Detection for Physical Object RearrangementCode2
Exploiting Unlabeled Data with Multiple Expert Teachers for Open Vocabulary Aerial Object Detection and Its Orientation AdaptationCode2
MonoDGP: Monocular 3D Object Detection with Decoupled-Query and Geometry-Error PriorsCode2
DI-MaskDINO: A Joint Object Detection and Instance Segmentation ModelCode2
Mitigating Object Hallucination via Concentric Causal AttentionCode2
VLM-Grounder: A VLM Agent for Zero-Shot 3D Visual GroundingCode2
Open World Object Detection: A SurveyCode2
Multiview Scene GraphCode2
High-Precision Dichotomous Image Segmentation via Probing Diffusion CapacityCode2
Towards Interpreting Visual Information Processing in Vision-Language ModelsCode2
HazyDet: Open-source Benchmark for Drone-view Object Detection with Depth-cues in Hazy ScenesCode2
One Token to Seg Them All: Language Instructed Reasoning Segmentation in VideosCode2
A Novel Unified Architecture for Low-Shot Counting by Detection and SegmentationCode2
Resolving Multi-Condition Confusion for Finetuning-Free Personalized Image GenerationCode2
Robot See Robot Do: Imitating Articulated Object Manipulation with Monocular 4D ReconstructionCode2
Source-Free Domain Adaptation for YOLO Object DetectionCode2
Progressive Representation Learning for Real-Time UAV TrackingCode2
RockTrack: A 3D Robust Multi-Camera-Ken Multi-Object Tracking FrameworkCode2
Improving Text-guided Object Inpainting with Semantic Pre-inpaintingCode2
UniDet3D: Multi-dataset Indoor 3D Object DetectionCode2
UTrack: Multi-Object Tracking with Uncertain DetectionsCode2
Unleashing the Temporal-Spatial Reasoning Capacity of GPT for Training-Free Audio and Language Referenced Video Object SegmentationCode2
GOReloc: Graph-based Object-Level Relocalization for Visual SLAMCode2
In Defense of Lazy Visual Grounding for Open-Vocabulary Semantic SegmentationCode2
Visible-Thermal Multiple Object Tracking: Large-scale Video Dataset and Progressive Fusion ApproachCode2
ESOD: Efficient Small Object Detection on High-Resolution ImagesCode2
MonoWAD: Weather-Adaptive Diffusion Model for Robust Monocular 3D Object DetectionCode2
Show:102550
← PrevPage 3 of 214Next →

No leaderboard results yet.