| BYE: Build Your Encoder with One Sequence of Exploration Data for Long-Term Dynamic Scene Understanding | Dec 3, 2024 | Motion EstimationObject | —Unverified | 0 |
| Object Agnostic 3D Lifting in Space and Time | Dec 2, 2024 | Object | —Unverified | 0 |
| A2VIS: Amodal-Aware Approach to Video Instance Segmentation | Dec 2, 2024 | Instance SegmentationMultiple Object Tracking | —Unverified | 0 |
| 6DOPE-GS: Online 6D Object Pose Estimation using Gaussian Splatting | Dec 2, 2024 | 3D Object Reconstruction6D Pose Estimation using RGB | —Unverified | 0 |
| Object Tracking in a 360^o View: A Novel Perspective on Bridging the Gap to Biomedical Advancements | Dec 2, 2024 | Autonomous VehiclesObject | —Unverified | 0 |
| Referring Video Object Segmentation via Language-aligned Track Selection | Dec 2, 2024 | ObjectObject Tracking | CodeCode Available | 1 |
| MFTF: Mask-free Training-free Object Level Layout Control Diffusion Model | Dec 2, 2024 | DenoisingImage Generation | CodeCode Available | 0 |
| Hierarchical Object-Oriented POMDP Planning for Object Rearrangement | Dec 2, 2024 | ObjectObject Rearrangement | —Unverified | 0 |
| Identifying Reliable Predictions in Detection Transformers | Dec 2, 2024 | ObjectUncertainty Quantification | —Unverified | 0 |
| Multi-Granularity Video Object Segmentation | Dec 2, 2024 | ObjectSegmentation | CodeCode Available | 1 |
| Explaining Object Detectors via Collective Contribution of Pixels | Dec 1, 2024 | Object | —Unverified | 0 |
| MCBLT: Multi-Camera Multi-Object 3D Tracking in Long Videos | Dec 1, 2024 | 2D Object Detection3D Object Detection | —Unverified | 0 |
| Particle-based 6D Object Pose Estimation from Point Clouds using Diffusion Models | Dec 1, 2024 | 6D Pose Estimation using RGBObject | CodeCode Available | 1 |
| LiDAR-EDIT: LiDAR Data Generation by Editing the Object Layouts in Real-World Scenes | Nov 30, 2024 | Autonomous Drivingcounterfactual | —Unverified | 0 |
| Motion Modes: What Could Happen Next? | Nov 29, 2024 | DiversityObject | —Unverified | 0 |
| One-Shot Real-to-Sim via End-to-End Differentiable Simulation and Rendering | Nov 29, 2024 | BenchmarkingObject | —Unverified | 0 |
| Feedback-driven object detection and iterative model improvement | Nov 29, 2024 | Objectobject-detection | CodeCode Available | 0 |
| QUOTA: Quantifying Objects with Text-to-Image Models for Any Domain | Nov 29, 2024 | Domain GeneralizationImage Generation | —Unverified | 0 |
| Robust Bayesian Scene Reconstruction by Leveraging Retrieval-Augmented Priors | Nov 29, 2024 | ObjectRetrieval | —Unverified | 0 |
| GREAT: Geometry-Intention Collaborative Inference for Open-Vocabulary 3D Object Affordance Grounding | Nov 29, 2024 | Collaborative InferenceObject | CodeCode Available | 1 |
| Semi-Supervised Neural Processes for Articulated Object Interactions | Nov 28, 2024 | Object | —Unverified | 0 |
| Efficient Track Anything | Nov 28, 2024 | ObjectSegmentation | CodeCode Available | 7 |
| Lost & Found: Tracking Changes from Egocentric Observations in 3D Dynamic Scene Graphs | Nov 28, 2024 | Object | CodeCode Available | 2 |
| Detailed Object Description with Controllable Dimensions | Nov 28, 2024 | Object | CodeCode Available | 0 |
| Structured Object Language Modeling (SoLM): Native Structured Objects Generation Conforming to Complex Schemas with Self-Supervised Denoising | Nov 28, 2024 | DenoisingLanguage Modeling | —Unverified | 0 |
| ObjectRelator: Enabling Cross-View Object Relation Understanding in Ego-Centric and Exo-Centric Videos | Nov 28, 2024 | ObjectObject Localization | —Unverified | 0 |
| SADG: Segment Any Dynamic Gaussian Without Object Trackers | Nov 28, 2024 | 3D ReconstructionAutonomous Driving | CodeCode Available | 2 |
| OPCap:Object-aware Prompting Captioning | Nov 27, 2024 | AttributeDecoder | —Unverified | 0 |
| SpotLight: Shadow-Guided Object Relighting via Diffusion | Nov 27, 2024 | Image RelightingNeural Rendering | CodeCode Available | 1 |
| Optimizing Multispectral Object Detection: A Bag of Tricks and Comprehensive Benchmarks | Nov 27, 2024 | Multispectral Object DetectionObject | —Unverified | 0 |
| A comparison of extended object tracking with multi-modal sensors in indoor environment | Nov 27, 2024 | ObjectObject Tracking | —Unverified | 0 |
| G3Flow: Generative 3D Semantic Flow for Pose-aware and Generalizable Object Manipulation | Nov 27, 2024 | Imitation LearningObject | CodeCode Available | 0 |
| VLM-HOI: Vision Language Models for Interpretable Human-Object Interaction Analysis | Nov 27, 2024 | Human-Object Interaction DetectionImage-text matching | —Unverified | 0 |
| From Open Vocabulary to Open World: Teaching Vision Language Models to Detect Novel Objects | Nov 27, 2024 | Autonomous DrivingObject | CodeCode Available | 1 |
| Exploring Aleatoric Uncertainty in Object Detection via Vision Foundation Models | Nov 26, 2024 | Objectobject-detection | —Unverified | 0 |
| Adversarial Bounding Boxes Generation (ABBG) Attack against Visual Object Trackers | Nov 26, 2024 | Object | CodeCode Available | 0 |
| Box for Mask and Mask for Box: weak losses for multi-task partially supervised learning | Nov 26, 2024 | Objectobject-detection | CodeCode Available | 0 |
| DreamMix: Decoupling Object Attributes for Enhanced Editability in Customized Image Inpainting | Nov 26, 2024 | AttributeDiversity | CodeCode Available | 2 |
| On-Road Object Importance Estimation: A New Dataset and A Model with Multi-Fold Top-Down Guidance | Nov 26, 2024 | Object | —Unverified | 0 |
| GMFlow: Global Motion-Guided Recurrent Flow for 6D Object Pose Estimation | Nov 26, 2024 | 6D Pose Estimation using RGBComputational Efficiency | —Unverified | 0 |
| AnchorCrafter: Animate CyberAnchors Saling Your Products via Human-Object Interacting Video Generation | Nov 26, 2024 | Human-Object Interaction DetectionObject | —Unverified | 0 |
| Distilling Spectral Graph for Object-Context Aware Open-Vocabulary Semantic Segmentation | Nov 26, 2024 | ObjectOpen Vocabulary Semantic Segmentation | —Unverified | 0 |
| Object-centric proto-symbolic behavioural reasoning from pixels | Nov 26, 2024 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Online Episodic Memory Visual Query Localization with Egocentric Streaming Object Memory | Nov 25, 2024 | Objectobject-detection | —Unverified | 0 |
| Leveraging Foundation Models To learn the shape of semi-fluid deformable objects | Nov 25, 2024 | Knowledge DistillationObject | —Unverified | 0 |
| Open Vocabulary Monocular 3D Object Detection | Nov 25, 2024 | 3D Object DetectionMonocular 3D Object Detection | CodeCode Available | 2 |
| InTraGen: Trajectory-controlled Video Generation for Object Interactions | Nov 25, 2024 | ObjectVideo Generation | CodeCode Available | 1 |
| Hyperspectral Image Cross-Domain Object Detection Method based on Spectral-Spatial Feature Alignment | Nov 25, 2024 | Objectobject-detection | —Unverified | 0 |
| Diffusion Features for Zero-Shot 6DoF Object Pose Estimation | Nov 25, 2024 | ObjectPose Estimation | CodeCode Available | 0 |
| Open-Vocabulary Octree-Graph for 3D Scene Understanding | Nov 25, 2024 | ObjectScene Understanding | —Unverified | 0 |