| SMamba: Sparse Mamba for Event-based Object Detection | Jan 21, 2025 | MambaObject | CodeCode Available | 1 |
| Learning Motion and Temporal Cues for Unsupervised Video Object Segmentation | Jan 14, 2025 | Objectobject-detection | CodeCode Available | 1 |
| Toward Realistic Camouflaged Object Detection: Benchmarks and Method | Jan 13, 2025 | Instance SegmentationObject | CodeCode Available | 1 |
| 3DCoMPaT200: Language-Grounded Compositional Understanding of Parts and Materials of 3D Shapes | Jan 12, 2025 | NavigateObject | CodeCode Available | 1 |
| Generalization-Enhanced Few-Shot Object Detection in Remote Sensing | Jan 5, 2025 | Few-Shot LearningFew-Shot Object Detection | CodeCode Available | 1 |
| GO-N3RDet: Geometry Optimized NeRF-enhanced 3D Object Detector | Jan 1, 2025 | 3D Object DetectionNeRF | CodeCode Available | 1 |
| Prior-free 3D Object Tracking | Jan 1, 2025 | 3D Object TrackingObject | CodeCode Available | 1 |
| Interacted Object Grounding in Spatio-Temporal Human-Object Interactions | Dec 27, 2024 | Human-Object Interaction DetectionObject | CodeCode Available | 1 |
| DriveEditor: A Unified 3D Information-Guided Framework for Controllable Object Editing in Driving Scenes | Dec 27, 2024 | Autonomous DrivingNovel View Synthesis | CodeCode Available | 1 |
| Seamless Detection: Unifying Salient Object Detection and Camouflaged Object Detection | Dec 22, 2024 | DecoderObject | CodeCode Available | 1 |
| Affordance-Aware Object Insertion via Mask-Aware Dual Diffusion | Dec 19, 2024 | Object | CodeCode Available | 1 |
| M^3-VOS: Multi-Phase, Multi-Transition, and Multi-Scenery Video Object Segmentation | Dec 18, 2024 | ObjectSemantic Segmentation | CodeCode Available | 1 |
| PixelMan: Consistent Object Editing with Diffusion Models via Pixel Manipulation and Generation | Dec 18, 2024 | Object | CodeCode Available | 1 |
| CREST: An Efficient Conjointly-trained Spike-driven Framework for Event-based Object Detection Exploiting Spatiotemporal Dynamics | Dec 17, 2024 | Objectobject-detection | CodeCode Available | 1 |
| Differential Alignment for Domain Adaptive Object Detection | Dec 17, 2024 | Objectobject-detection | CodeCode Available | 1 |
| Heterogeneous Graph Transformer for Multiple Tiny Object Tracking in RGB-T Videos | Dec 14, 2024 | Multi-Object TrackingMultiple Object Tracking | CodeCode Available | 1 |
| Towards Flexible 3D Perception: Object-Centric Occupancy Completion Augments 3D Object Detection | Dec 6, 2024 | 3D Object DetectionAutonomous Driving | CodeCode Available | 1 |
| EvRT-DETR: Latent Space Adaptation of Image Detectors for Event-based Vision | Dec 3, 2024 | Event-based visionEvent Detection | CodeCode Available | 1 |
| Referring Video Object Segmentation via Language-aligned Track Selection | Dec 2, 2024 | ObjectObject Tracking | CodeCode Available | 1 |
| Multi-Granularity Video Object Segmentation | Dec 2, 2024 | ObjectSegmentation | CodeCode Available | 1 |
| Particle-based 6D Object Pose Estimation from Point Clouds using Diffusion Models | Dec 1, 2024 | 6D Pose Estimation using RGBObject | CodeCode Available | 1 |
| GREAT: Geometry-Intention Collaborative Inference for Open-Vocabulary 3D Object Affordance Grounding | Nov 29, 2024 | Collaborative InferenceObject | CodeCode Available | 1 |
| SpotLight: Shadow-Guided Object Relighting via Diffusion | Nov 27, 2024 | Image RelightingNeural Rendering | CodeCode Available | 1 |
| From Open Vocabulary to Open World: Teaching Vision Language Models to Detect Novel Objects | Nov 27, 2024 | Autonomous DrivingObject | CodeCode Available | 1 |
| InTraGen: Trajectory-controlled Video Generation for Object Interactions | Nov 25, 2024 | ObjectVideo Generation | CodeCode Available | 1 |
| Towards RAW Object Detection in Diverse Conditions | Nov 24, 2024 | Objectobject-detection | CodeCode Available | 1 |
| LRSAA: Large-scale Remote Sensing Image Target Recognition and Automatic Annotation | Nov 24, 2024 | Ensemble LearningObject | CodeCode Available | 1 |
| Generalizable Single-view Object Pose Estimation by Two-side Generating and Matching | Nov 24, 2024 | ObjectPose Estimation | CodeCode Available | 1 |
| OCDet: Object Center Detection via Bounding Box-Aware Heatmap Prediction on Edge Devices with NPUs | Nov 23, 2024 | Keypoint DetectionObject | CodeCode Available | 1 |
| Teaching VLMs to Localize Specific Objects from In-context Examples | Nov 20, 2024 | ObjectObject Tracking | CodeCode Available | 1 |
| Leveraging MLLM Embeddings and Attribute Smoothing for Compositional Zero-Shot Learning | Nov 18, 2024 | AttributeCompositional Zero-Shot Learning | CodeCode Available | 1 |
| PickScan: Object discovery and reconstruction from handheld interactions | Nov 17, 2024 | ObjectObject Discovery | CodeCode Available | 1 |
| Local-Global Attention: An Adaptive Mechanism for Multi-Scale Feature Integration | Nov 14, 2024 | Computational EfficiencyObject | CodeCode Available | 1 |
| 3D Focusing-and-Matching Network for Multi-Instance Point Cloud Registration | Nov 12, 2024 | ObjectPoint Cloud Registration | CodeCode Available | 1 |
| Large-scale Remote Sensing Image Target Recognition and Automatic Annotation | Nov 12, 2024 | Ensemble LearningObject | CodeCode Available | 1 |
| Add-it: Training-Free Object Insertion in Images With Pretrained Diffusion Models | Nov 11, 2024 | Object | CodeCode Available | 1 |
| Not Just Object, But State: Compositional Incremental Learning without Forgetting | Nov 4, 2024 | DiversityIncremental Learning | CodeCode Available | 1 |
| Multi-Object 3D Grounding with Dynamic Modules and Language-Informed Spatial Attention | Oct 29, 2024 | Object | CodeCode Available | 1 |
| PK-YOLO: Pretrained Knowledge Guided YOLO for Brain Tumor Detection in Multiplanar MRI Slices | Oct 29, 2024 | Objectobject-detection | CodeCode Available | 1 |
| You Only Look Around: Learning Illumination Invariant Feature for Low-light Object Detection | Oct 24, 2024 | Objectobject-detection | CodeCode Available | 1 |
| Optimizing Edge Offloading Decisions for Object Detection | Oct 24, 2024 | Objectobject-detection | CodeCode Available | 1 |
| Personalized Instance-based Navigation Toward User-Specific Objects in Realistic Environments | Oct 23, 2024 | ObjectVisual Navigation | CodeCode Available | 1 |
| OVT-B: A New Large-Scale Benchmark for Open-Vocabulary Multi-Object Tracking | Oct 23, 2024 | Multi-Object TrackingObject | CodeCode Available | 1 |
| DREB-Net: Dual-stream Restoration Embedding Blur-feature Fusion Network for High-mobility UAV Object Detection | Oct 23, 2024 | Image RestorationObject | CodeCode Available | 1 |
| TrackMe:A Simple and Effective Multiple Object Tracking Annotation Tool | Oct 20, 2024 | Multiple Object TrackingObject | CodeCode Available | 1 |
| MagicEraser: Erasing Any Objects via Semantics-Aware Control | Oct 14, 2024 | Image InpaintingObject | CodeCode Available | 1 |
| LoLI-Street: Benchmarking Low-Light Image Enhancement and Beyond | Oct 13, 2024 | Autonomous DrivingAutonomous Vehicles | CodeCode Available | 1 |
| Toward General Object-level Mapping from Sparse Views with 3D Diffusion Priors | Oct 7, 2024 | Object | CodeCode Available | 1 |
| Multimodal 3D Fusion and In-Situ Learning for Spatially Aware AI | Oct 6, 2024 | 3D ReconstructionObject | CodeCode Available | 1 |
| Open3DTrack: Towards Open-Vocabulary 3D Multi-Object Tracking | Oct 2, 2024 | 3D Multi-Object TrackingAutonomous Driving | CodeCode Available | 1 |