SOTAVerified

Object

Replace the cat with a British Shorthair cat of the breed with bulging yellow eyes

Papers

Showing 451500 of 10696 papers

TitleStatusHype
SMamba: Sparse Mamba for Event-based Object DetectionCode1
Learning Motion and Temporal Cues for Unsupervised Video Object SegmentationCode1
Toward Realistic Camouflaged Object Detection: Benchmarks and MethodCode1
3DCoMPaT200: Language-Grounded Compositional Understanding of Parts and Materials of 3D ShapesCode1
Generalization-Enhanced Few-Shot Object Detection in Remote SensingCode1
GO-N3RDet: Geometry Optimized NeRF-enhanced 3D Object DetectorCode1
Prior-free 3D Object TrackingCode1
Interacted Object Grounding in Spatio-Temporal Human-Object InteractionsCode1
DriveEditor: A Unified 3D Information-Guided Framework for Controllable Object Editing in Driving ScenesCode1
Seamless Detection: Unifying Salient Object Detection and Camouflaged Object DetectionCode1
Affordance-Aware Object Insertion via Mask-Aware Dual DiffusionCode1
M^3-VOS: Multi-Phase, Multi-Transition, and Multi-Scenery Video Object SegmentationCode1
PixelMan: Consistent Object Editing with Diffusion Models via Pixel Manipulation and GenerationCode1
CREST: An Efficient Conjointly-trained Spike-driven Framework for Event-based Object Detection Exploiting Spatiotemporal DynamicsCode1
Differential Alignment for Domain Adaptive Object DetectionCode1
Heterogeneous Graph Transformer for Multiple Tiny Object Tracking in RGB-T VideosCode1
Towards Flexible 3D Perception: Object-Centric Occupancy Completion Augments 3D Object DetectionCode1
EvRT-DETR: Latent Space Adaptation of Image Detectors for Event-based VisionCode1
Referring Video Object Segmentation via Language-aligned Track SelectionCode1
Multi-Granularity Video Object SegmentationCode1
Particle-based 6D Object Pose Estimation from Point Clouds using Diffusion ModelsCode1
GREAT: Geometry-Intention Collaborative Inference for Open-Vocabulary 3D Object Affordance GroundingCode1
SpotLight: Shadow-Guided Object Relighting via DiffusionCode1
From Open Vocabulary to Open World: Teaching Vision Language Models to Detect Novel ObjectsCode1
InTraGen: Trajectory-controlled Video Generation for Object InteractionsCode1
Towards RAW Object Detection in Diverse ConditionsCode1
LRSAA: Large-scale Remote Sensing Image Target Recognition and Automatic AnnotationCode1
Generalizable Single-view Object Pose Estimation by Two-side Generating and MatchingCode1
OCDet: Object Center Detection via Bounding Box-Aware Heatmap Prediction on Edge Devices with NPUsCode1
Teaching VLMs to Localize Specific Objects from In-context ExamplesCode1
Leveraging MLLM Embeddings and Attribute Smoothing for Compositional Zero-Shot LearningCode1
PickScan: Object discovery and reconstruction from handheld interactionsCode1
Local-Global Attention: An Adaptive Mechanism for Multi-Scale Feature IntegrationCode1
3D Focusing-and-Matching Network for Multi-Instance Point Cloud RegistrationCode1
Large-scale Remote Sensing Image Target Recognition and Automatic AnnotationCode1
Add-it: Training-Free Object Insertion in Images With Pretrained Diffusion ModelsCode1
Not Just Object, But State: Compositional Incremental Learning without ForgettingCode1
Multi-Object 3D Grounding with Dynamic Modules and Language-Informed Spatial AttentionCode1
PK-YOLO: Pretrained Knowledge Guided YOLO for Brain Tumor Detection in Multiplanar MRI SlicesCode1
You Only Look Around: Learning Illumination Invariant Feature for Low-light Object DetectionCode1
Optimizing Edge Offloading Decisions for Object DetectionCode1
Personalized Instance-based Navigation Toward User-Specific Objects in Realistic EnvironmentsCode1
OVT-B: A New Large-Scale Benchmark for Open-Vocabulary Multi-Object TrackingCode1
DREB-Net: Dual-stream Restoration Embedding Blur-feature Fusion Network for High-mobility UAV Object DetectionCode1
TrackMe:A Simple and Effective Multiple Object Tracking Annotation ToolCode1
MagicEraser: Erasing Any Objects via Semantics-Aware ControlCode1
LoLI-Street: Benchmarking Low-Light Image Enhancement and BeyondCode1
Toward General Object-level Mapping from Sparse Views with 3D Diffusion PriorsCode1
Multimodal 3D Fusion and In-Situ Learning for Spatially Aware AICode1
Open3DTrack: Towards Open-Vocabulary 3D Multi-Object TrackingCode1
Show:102550
← PrevPage 10 of 214Next →

No leaderboard results yet.