| Through-The-Mask: Mask-based Motion Trajectories for Image-to-Video Generation | Jan 6, 2025 | Image to Video GenerationObject | —Unverified | 0 |
| Generalization-Enhanced Few-Shot Object Detection in Remote Sensing | Jan 5, 2025 | Few-Shot LearningFew-Shot Object Detection | CodeCode Available | 1 |
| ONDA-Pose: Occlusion-Aware Neural Domain Adaptation for Self-Supervised 6D Object Pose Estimation | Jan 1, 2025 | 6D Pose Estimation using RGBDomain Adaptation | —Unverified | 0 |
| EntitySAM: Segment Everything in Video | Jan 1, 2025 | DecoderObject | —Unverified | 0 |
| SET: Spectral Enhancement for Tiny Object Detection | Jan 1, 2025 | Objectobject-detection | —Unverified | 0 |
| Common3D: Self-Supervised Learning of 3D Morphable Models for Common Objects in Neural Feature Space | Jan 1, 2025 | Instance SegmentationObject | CodeCode Available | 0 |
| Composing Parts for Expressive Object Generation | Jan 1, 2025 | AttributeDenoising | —Unverified | 0 |
| One-shot 3D Object Canonicalization based on Geometric and Semantic Consistency | Jan 1, 2025 | Object | CodeCode Available | 2 |
| HOIGPT: Learning Long-Sequence Hand-Object Interaction with Language Models | Jan 1, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Hierarchical Compact Clustering Attention (COCA) for Unsupervised Object-Centric Learning | Jan 1, 2025 | ClusteringDecoder | —Unverified | 0 |
| HORP: Human-Object Relation Priors Guided HOI Detection | Jan 1, 2025 | Human-Object Interaction DetectionObject | —Unverified | 0 |
| Language-Guided Salient Object Ranking | Jan 1, 2025 | ObjectSaliency Ranking | —Unverified | 0 |
| Cross-Modal Distillation for 2D/3D Multi-Object Discovery from 2D Motion | Jan 1, 2025 | Multi-object discoveryObject | —Unverified | 0 |
| BIGS: Bimanual Category-agnostic Interaction Reconstruction from Monocular Videos via 3D Gaussian Splatting | Jan 1, 2025 | 3D Hand Pose Estimation3D Object Reconstruction | —Unverified | 0 |
| GLASS: Guided Latent Slot Diffusion for Object-Centric Learning | Jan 1, 2025 | Conditional Image GenerationImage Generation | —Unverified | 0 |
| Dragin3D: Image Editing by Dragging in 3D Space | Jan 1, 2025 | 3D Object Reconstructioncontinuous-control | —Unverified | 0 |
| Perceptual Inductive Bias Is What You Need Before Contrastive Learning | Jan 1, 2025 | Contrastive LearningDepth Estimation | —Unverified | 0 |
| LatentHOI: On the Generalizable Hand Object Motion Generation with Latent Hand Diffusion. | Jan 1, 2025 | Motion GenerationObject | —Unverified | 0 |
| Hand-held Object Reconstruction from RGB Video with Dynamic Interaction | Jan 1, 2025 | 3D Generation3D geometry | —Unverified | 0 |
| Learning Endogenous Attention for Incremental Object Detection | Jan 1, 2025 | Objectobject-detection | —Unverified | 0 |
| PIAD: Pose and Illumination agnostic Anomaly Detection | Jan 1, 2025 | Anomaly DetectionObject | —Unverified | 0 |
| Rethinking Correspondence-based Category-Level Object Pose Estimation | Jan 1, 2025 | ObjectPose Estimation | —Unverified | 0 |
| Prior-free 3D Object Tracking | Jan 1, 2025 | 3D Object TrackingObject | CodeCode Available | 1 |
| GO-N3RDet: Geometry Optimized NeRF-enhanced 3D Object Detector | Jan 1, 2025 | 3D Object DetectionNeRF | CodeCode Available | 1 |
| ROD-MLLM: Towards More Reliable Object Detection in Multimodal Large Language Models | Jan 1, 2025 | Large Language ModelObject | —Unverified | 0 |
| Camouflage Anything: Learning to Hide using Controlled Out-painting and Representation Engineering | Jan 1, 2025 | Camouflaged Object SegmentationObject | —Unverified | 0 |
| Robust Multi-Object 4D Generation for In-the-wild Videos | Jan 1, 2025 | ObjectScene Generation | —Unverified | 0 |
| UniHOPE: A Unified Approach for Hand-Only and Hand-Object Pose Estimation | Jan 1, 2025 | hand-object poseHand Pose Estimation | CodeCode Available | 0 |
| CaMuViD: Calibration-Free Multi-View Detection | Jan 1, 2025 | Camera CalibrationManagement | —Unverified | 0 |
| Radio Frequency Ray Tracing with Neural Object Representation for Enhanced RF Modeling | Jan 1, 2025 | Object | —Unverified | 0 |
| InterAct: Advancing Large-Scale Versatile 3D Human-Object Interaction Generation | Jan 1, 2025 | BenchmarkingHuman-Object Interaction Detection | —Unverified | 0 |
| VODiff: Controlling Object Visibility Order in Text-to-Image Generation | Jan 1, 2025 | DenoisingImage Generation | —Unverified | 0 |
| Be More Specific: Evaluating Object-centric Realism in Synthetic Images | Jan 1, 2025 | Object | —Unverified | 0 |
| DynaMoDe-NeRF: Motion-aware Deblurring Neural Radiance Field for Dynamic Scenes | Jan 1, 2025 | DeblurringNeRF | —Unverified | 0 |
| MAD: Memory-Augmented Detection of 3D Objects | Jan 1, 2025 | Object | —Unverified | 0 |
| Autoregressive Sequential Pretraining for Visual Tracking | Jan 1, 2025 | ObjectObject Tracking | —Unverified | 0 |
| DreamRelation: Bridging Customization and Relation Generation | Jan 1, 2025 | Image GenerationObject | —Unverified | 0 |
| Learning Class Prototypes for Unified Sparse-Supervised 3D Object Detection | Jan 1, 2025 | 3D Object DetectionObject | —Unverified | 0 |
| Learning Partonomic 3D Reconstruction from Image Collections | Jan 1, 2025 | 3D ReconstructionImage Generation | CodeCode Available | 0 |
| Style-Editor: Text-driven Object-centric Style Editing | Jan 1, 2025 | Object | —Unverified | 0 |
| Generalizable Object Keypoint Localization from Generative Priors | Jan 1, 2025 | Cross-Domain Few-ShotImage Generation | —Unverified | 0 |
| PICO: Reconstructing 3D People In Contact with Objects | Jan 1, 2025 | Human-Object Interaction DetectionObject | —Unverified | 0 |
| CorrBEV: Multi-View 3D Object Detection by Correlation Learning with Multi-modal Prototypes | Jan 1, 2025 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| FusionSORT: Fusion Methods for Online Multi-object Visual Tracking | Jan 1, 2025 | ObjectVisual Tracking | CodeCode Available | 0 |
| RORem: Training a Robust Object Remover with Human-in-the-Loop | Jan 1, 2025 | Object | CodeCode Available | 2 |
| VideoRefer Suite: Advancing Spatial-Temporal Object Understanding with Video LLM | Dec 31, 2024 | ObjectVideo Understanding | CodeCode Available | 3 |
| B2Net: Camouflaged Object Detection via Boundary Aware and Boundary Fusion | Dec 31, 2024 | Objectobject-detection | —Unverified | 0 |
| YOLO-UniOW: Efficient Universal Open-World Object Detection | Dec 30, 2024 | Incremental LearningObject | CodeCode Available | 2 |
| Solar Filaments Detection using Active Contours Without Edges | Dec 30, 2024 | Image SegmentationObject | —Unverified | 0 |
| Differential Evolution Integrated Hybrid Deep Learning Model for Object Detection in Pre-made Dishes | Dec 29, 2024 | Objectobject-detection | —Unverified | 0 |