| Power of Cooperative Supervision: Multiple Teachers Framework for Enhanced 3D Semi-Supervised Object Detection | May 31, 2024 | Objectobject-detection | CodeCode Available | 0 |
| IReNe: Instant Recoloring of Neural Radiance Fields | May 30, 2024 | NeRFNovel View Synthesis | —Unverified | 0 |
| RTGen: Generating Region-Text Pairs for Open-Vocabulary Object Detection | May 30, 2024 | Image CaptioningImage Inpainting | CodeCode Available | 1 |
| YotoR-You Only Transform One Representation | May 30, 2024 | Computational EfficiencyObject | —Unverified | 0 |
| Improving Object Detector Training on Synthetic Data by Starting With a Strong Baseline Methodology | May 30, 2024 | Data AugmentationObject | —Unverified | 0 |
| Hierarchical Object-Centric Learning with Capsule Networks | May 30, 2024 | Computational EfficiencyLung Nodule Segmentation | —Unverified | 0 |
| Vision-based Manipulation from Single Human Video with Open-World Object Graphs | May 30, 2024 | ObjectRobot Manipulation | —Unverified | 0 |
| Lifelong Learning Using a Dynamically Growing Tree of Sub-networks for Domain Generalization in Video Object Segmentation | May 29, 2024 | Domain GeneralizationLifelong learning | —Unverified | 0 |
| RGB-T Object Detection via Group Shuffled Multi-receptive Attention and Multi-modal Supervision | May 29, 2024 | Multispectral Object DetectionObject | —Unverified | 0 |
| Model Agnostic Defense against Adversarial Patch Attacks on Object Detection in Unmanned Aerial Vehicles | May 29, 2024 | Objectobject-detection | —Unverified | 0 |
| SSGA-Net: Stepwise Spatial Global-local Aggregation Networks for for Autonomous Driving | May 29, 2024 | Autonomous DrivingObject | —Unverified | 0 |
| FocSAM: Delving Deeply into Focused Objects in Segmenting Anything | May 29, 2024 | DecoderInteractive Segmentation | CodeCode Available | 1 |
| Data-augmented phrase-level alignment for mitigating object hallucination | May 28, 2024 | Data AugmentationHallucination | —Unverified | 0 |
| Track Initialization and Re-Identification for~3D Multi-View Multi-Object Tracking | May 28, 2024 | 3D Multi-Object TrackingMulti-Object Tracking | CodeCode Available | 1 |
| REPARO: Compositional 3D Assets Generation with Differentiable 3D Layout Alignment | May 28, 2024 | Image to 3DObject | CodeCode Available | 2 |
| Don't Miss the Forest for the Trees: Attentional Vision Calibration for Large Vision Language Models | May 28, 2024 | MMEObject | —Unverified | 0 |
| Learning to Detour: Shortcut Mitigating Augmentation for Weakly Supervised Semantic Segmentation | May 28, 2024 | ObjectSemantic Segmentation | —Unverified | 0 |
| Reliable Object Tracking by Multimodal Hybrid Feature Extraction and Transformer-Based Fusion | May 28, 2024 | ObjectObject Tracking | CodeCode Available | 0 |
| Intent3D: 3D Object Detection in RGB-D Scans Based on Human Intention | May 28, 2024 | 3D Object Detection3D visual grounding | —Unverified | 0 |
| Recurrent Complex-Weighted Autoencoders for Unsupervised Object Discovery | May 27, 2024 | ObjectObject Discovery | CodeCode Available | 0 |
| Tracking Small Birds by Detection Candidate Region Filtering and Detection History-aware Association | May 27, 2024 | Objectobject-detection | —Unverified | 0 |
| VoCoT: Unleashing Visually Grounded Multi-Step Reasoning in Large Multi-Modal Models | May 27, 2024 | Object | CodeCode Available | 2 |
| Competing for pixels: a self-play algorithm for weakly-supervised segmentation | May 26, 2024 | Binary ClassificationImage Segmentation | CodeCode Available | 0 |
| Planning Robot Placement for Object Grasping | May 26, 2024 | Object | —Unverified | 0 |
| DiffuBox: Refining 3D Object Detection with Point Diffusion | May 25, 2024 | 3D Object DetectionAutonomous Driving | CodeCode Available | 1 |
| GreenCOD: A Green Camouflaged Object Detection Method | May 25, 2024 | Objectobject-detection | —Unverified | 0 |
| REACT: Real-time Efficiency and Accuracy Compromise for Tradeoffs in Scene Graph Generation | May 25, 2024 | Graph GenerationObject | CodeCode Available | 2 |
| Brain3D: Generating 3D Objects from fMRI | May 24, 2024 | 3D Generation3D Scene Reconstruction | CodeCode Available | 0 |
| Unbiased Faster R-CNN for Single-source Domain Generalized Object Detection | May 24, 2024 | AttributeData Augmentation | —Unverified | 0 |
| Transparent Object Depth Completion | May 24, 2024 | Depth CompletionDepth Estimation | —Unverified | 0 |
| Leveraging Unknown Objects to Construct Labeled-Unlabeled Meta-Relationships for Zero-Shot Object Navigation | May 24, 2024 | Object | —Unverified | 0 |
| ODGEN: Domain-specific Object Detection Data Generation with Diffusion Models | May 24, 2024 | Objectobject-detection | —Unverified | 0 |
| Multimodal Object Detection via Probabilistic a priori Information Integration | May 24, 2024 | Objectobject-detection | CodeCode Available | 0 |
| CoHD: A Counting-Aware Hierarchical Decoding Framework for Generalized Referring Expression Segmentation | May 24, 2024 | Generalized Referring Expression SegmentationObject | CodeCode Available | 1 |
| Balanced ID-OOD tradeoff transfer makes query based detectors good few shot learners | May 23, 2024 | Cross-Domain Few-Shot Object DetectionFew-Shot Object Detection | —Unverified | 0 |
| Variational Signal Separation for Automotive Radar Interference Mitigation | May 23, 2024 | Objectparameter estimation | —Unverified | 0 |
| YOLOv10: Real-Time End-to-End Object Detection | May 23, 2024 | 2D Object DetectionData Augmentation | CodeCode Available | 11 |
| PuTR: A Pure Transformer for Decoupled and Online Multi-Object Tracking | May 23, 2024 | Multi-Object TrackingObject | CodeCode Available | 1 |
| Improving Single Domain-Generalized Object Detection: A Focus on Diversification and Alignment | May 23, 2024 | Decision MakingDomain Generalization | CodeCode Available | 1 |
| MOD-UV: Learning Mobile Object Detectors from Unlabeled Videos | May 23, 2024 | Motion SegmentationObject | CodeCode Available | 1 |
| Designing A Sustainable Marine Debris Clean-up Framework without Human Labels | May 23, 2024 | ClassificationLanguage Modelling | CodeCode Available | 0 |
| Awesome Multi-modal Object Tracking | May 23, 2024 | Autonomous DrivingKnowledge Distillation | CodeCode Available | 5 |
| EgoChoir: Capturing 3D Human-Object Interaction Regions from Egocentric Views | May 22, 2024 | Human-Object Interaction DetectionObject | —Unverified | 0 |
| One-shot Training for Video Object Segmentation | May 22, 2024 | ObjectSemantic Segmentation | —Unverified | 0 |
| Collaboration of Teachers for Semi-supervised Object Detection | May 22, 2024 | Objectobject-detection | —Unverified | 0 |
| Anticipating Object State Changes in Long Procedural Videos | May 21, 2024 | ObjectObject State Change Classification | —Unverified | 0 |
| Active Object Detection with Knowledge Aggregation and Distillation from Large Models | May 21, 2024 | Active Object DetectionDecision Making | CodeCode Available | 0 |
| FFAM: Feature Factorization Activation Map for Explanation of 3D Detectors | May 21, 2024 | 3D Object DetectionObject | CodeCode Available | 0 |
| BiomedParse: a biomedical foundation model for image parsing of everything everywhere all at once | May 21, 2024 | AllImage Segmentation | —Unverified | 0 |
| Learning Causal Dynamics Models in Object-Oriented Environments | May 21, 2024 | Causal DiscoveryComputational Efficiency | CodeCode Available | 0 |