| DiffPop: Plausibility-Guided Object Placement Diffusion for Image Composition | Jun 12, 2024 | Data AugmentationDenoising | —Unverified | 0 |
| Dataset Enhancement with Instance-Level Augmentations | Jun 12, 2024 | Data AugmentationObject | CodeCode Available | 1 |
| OpenObj: Open-Vocabulary Object-Level Neural Radiance Fields with Fine-Grained Understanding | Jun 12, 2024 | 3D Scene ReconstructionNeRF | CodeCode Available | 1 |
| FaithFill: Faithful Inpainting for Object Completion Using a Single Reference Image | Jun 12, 2024 | Object | —Unverified | 0 |
| RMem: Restricted Memory Banks Improve Video Object Segmentation | Jun 12, 2024 | ObjectSemantic Segmentation | —Unverified | 0 |
| CUPID: Contextual Understanding of Prompt-conditioned Image Distributions | Jun 11, 2024 | Object | —Unverified | 0 |
| HOI-Swap: Swapping Objects in Videos with Hand-Object Interaction Awareness | Jun 11, 2024 | ObjectVideo Editing | —Unverified | 0 |
| Watching Swarm Dynamics from Above: A Framework for Advanced Object Tracking in Drone Videos | Jun 11, 2024 | ObjectObject Tracking | —Unverified | 0 |
| Object-level Scene Deocclusion | Jun 11, 2024 | 3D Scene ReconstructionObject | —Unverified | 0 |
| Unsupervised Object Detection with Theoretical Guarantees | Jun 11, 2024 | DecoderObject | —Unverified | 0 |
| Neural Gaffer: Relighting Any Object via Diffusion | Jun 11, 2024 | Image RelightingObject | —Unverified | 0 |
| Identifiable Object-Centric Representation Learning via Probabilistic Slot Attention | Jun 11, 2024 | ObjectRepresentation Learning | CodeCode Available | 0 |
| UEMM-Air: A Synthetic Multi-modal Dataset for Unmanned Aerial Vehicle Object Detection | Jun 10, 2024 | Objectobject-detection | CodeCode Available | 1 |
| I-MPN: Inductive Message Passing Network for Efficient Human-in-the-Loop Annotation of Mobile Eye Tracking Data | Jun 10, 2024 | NavigateObject | —Unverified | 0 |
| IllumiNeRF: 3D Relighting Without Inverse Rendering | Jun 10, 2024 | Inverse RenderingNeRF | —Unverified | 0 |
| ControlLoc: Physical-World Hijacking Attack on Visual Perception in Autonomous Driving | Jun 9, 2024 | Autonomous DrivingMultiple Object Tracking | —Unverified | 0 |
| Utilizing Grounded SAM for self-supervised frugal camouflaged human detection | Jun 9, 2024 | Human DetectionObject | —Unverified | 0 |
| Mamba YOLO: A Simple Baseline for Object Detection with State Space Model | Jun 9, 2024 | GPUMamba | CodeCode Available | 4 |
| Ctrl-V: Higher Fidelity Video Generation with Bounding-Box Controlled Object Motion | Jun 9, 2024 | Autonomous DrivingObject | CodeCode Available | 1 |
| Training-Free Robust Interactive Video Object Segmentation | Jun 8, 2024 | Interactive Video Object SegmentationObject | —Unverified | 0 |
| Real-time object detection and tracking using flash LiDAR imagery | Jun 7, 2024 | 3D Object ClassificationObject | —Unverified | 0 |
| 1st Place Solution for MOSE Track in CVPR 2024 PVUW Workshop: Complex Video Object Segmentation | Jun 7, 2024 | ObjectSegmentation | —Unverified | 0 |
| UCDNet: Multi-UAV Collaborative 3D Object Detection Network by Reliable Feature Mapping | Jun 7, 2024 | 3D Object DetectionManagement | —Unverified | 0 |
| IOR: Inversed Objects Replay for Incremental Object Detection | Jun 7, 2024 | Knowledge DistillationObject | —Unverified | 0 |
| Bootstrapping Referring Multi-Object Tracking | Jun 7, 2024 | DiversityMulti-Object Tracking | CodeCode Available | 1 |
| GenHeld: Generating and Editing Handheld Objects | Jun 7, 2024 | Object | —Unverified | 0 |
| Multi-Granularity Language-Guided Multi-Object Tracking | Jun 7, 2024 | Multi-Object TrackingObject | CodeCode Available | 1 |
| A Semi-Self-Supervised Approach for Dense-Pattern Video Object Segmentation | Jun 7, 2024 | Multi-Task LearningObject | —Unverified | 0 |
| Cut-and-Paste with Precision: a Content and Perspective-aware Data Augmentation for Road Damage Detection | Jun 6, 2024 | Data AugmentationObject | —Unverified | 0 |
| Omni6DPose: A Benchmark and Model for Universal 6D Object Pose Estimation and Tracking | Jun 6, 2024 | 6D Pose Estimation using RGBBenchmarking | —Unverified | 0 |
| DeepRacer on Physical Track: Parameters Exploration and Performance Evaluation | Jun 6, 2024 | Object | —Unverified | 0 |
| The syntax-semantics interface in a child's path: A study of 3- to 11-year-olds' elicited production of Mandarin recursive relative clauses | Jun 6, 2024 | Language AcquisitionObject | —Unverified | 0 |
| Matching Anything by Segmenting Anything | Jun 6, 2024 | Domain GeneralizationMultiple Object Tracking | CodeCode Available | 5 |
| 3rd Place Solution for MOSE Track in CVPR 2024 PVUW workshop: Complex Video Object Segmentation | Jun 6, 2024 | ObjectPosition | —Unverified | 0 |
| Sparse Color-Code Net: Real-Time RGB-Based 6D Object Pose Estimation on Edge Devices | Jun 5, 2024 | 6D Pose Estimation using RGBObject | —Unverified | 0 |
| OpenGaussian: Towards Point-Level 3D Gaussian-based Open Vocabulary Understanding | Jun 4, 2024 | 3DGSObject | —Unverified | 0 |
| The Crystal Ball Hypothesis in diffusion models: Anticipating object positions from initial noise | Jun 4, 2024 | Image GenerationObject | —Unverified | 0 |
| Object Aware Egocentric Online Action Detection | Jun 3, 2024 | Action DetectionObject | —Unverified | 0 |
| Augmented Commonsense Knowledge for Remote Object Grounding | Jun 3, 2024 | Decision MakingObject | CodeCode Available | 0 |
| Multi-Object Tracking based on Imaging Radar 3D Object Detection | Jun 3, 2024 | 3D Object DetectionMulti-Object Tracking | —Unverified | 0 |
| Reproducibility Study on Adversarial Attacks Against Robust Transformer Trackers | Jun 3, 2024 | Adversarial RobustnessObject | CodeCode Available | 0 |
| ParallelEdits: Efficient Multi-object Image Editing | Jun 3, 2024 | AttributeImage Generation | —Unverified | 0 |
| SAM-LAD: Segment Anything Model Meets Zero-Shot Logic Anomaly Detection | Jun 2, 2024 | Anomaly DetectionDefect Detection | —Unverified | 0 |
| OLIVE: Object Level In-Context Visual Embeddings | Jun 2, 2024 | ObjectZero-shot Generalization | CodeCode Available | 0 |
| Collaborative Novel Object Discovery and Box-Guided Cross-Modal Alignment for Open-Vocabulary 3D Object Detection | Jun 2, 2024 | 3D Object Detectioncross-modal alignment | CodeCode Available | 3 |
| Adversarial 3D Virtual Patches using Integrated Gradients | Jun 1, 2024 | Autonomous VehiclesObject | —Unverified | 0 |
| Towards Generalizable Multi-Object Tracking | Jun 1, 2024 | Domain GeneralizationMulti-Object Tracking | CodeCode Available | 1 |
| CapeX: Category-Agnostic Pose Estimation from Textual Point Explanation | Jun 1, 2024 | 2D Pose EstimationAnimal Pose Estimation | CodeCode Available | 1 |
| Precision and Adaptability of YOLOv5 and YOLOv8 in Dynamic Robotic Environments | Jun 1, 2024 | Objectobject-detection | —Unverified | 0 |
| Learning Background Prompts to Discover Implicit Knowledge for Open Vocabulary Object Detection | Jun 1, 2024 | Knowledge DistillationObject | —Unverified | 0 |