| HALLUCINOGEN: A Benchmark for Evaluating Object Hallucination in Large Visual-Language Models | Dec 29, 2024 | HallucinationObject | CodeCode Available | 0 |
| DriveEditor: A Unified 3D Information-Guided Framework for Controllable Object Editing in Driving Scenes | Dec 27, 2024 | Autonomous DrivingNovel View Synthesis | CodeCode Available | 1 |
| Interacted Object Grounding in Spatio-Temporal Human-Object Interactions | Dec 27, 2024 | Human-Object Interaction DetectionObject | CodeCode Available | 1 |
| Future Success Prediction in Open-Vocabulary Object Manipulation Tasks Based on End-Effector Trajectories | Dec 26, 2024 | ObjectPrediction | —Unverified | 0 |
| Revisiting Monocular 3D Object Detection from Scene-Level Depth Retargeting to Instance-Level Spatial Refinement | Dec 26, 2024 | 3D Object DetectionDepth Estimation | —Unverified | 0 |
| Symbolic Disentangled Representations for Images | Dec 25, 2024 | DisentanglementObject | —Unverified | 0 |
| EC-Diffuser: Multi-Object Manipulation via Entity-Centric Behavior Generation | Dec 25, 2024 | ObjectZero-shot Generalization | —Unverified | 0 |
| CGCOD: Class-Guided Camouflaged Object Detection | Dec 25, 2024 | Objectobject-detection | CodeCode Available | 2 |
| Evaluating the Adversarial Robustness of Detection Transformers | Dec 25, 2024 | Adversarial RobustnessAutonomous Driving | —Unverified | 0 |
| Distortion-Aware Adversarial Attacks on Bounding Boxes of Object Detectors | Dec 25, 2024 | Objectobject-detection | CodeCode Available | 0 |
| COMO: Cross-Mamba Interaction and Offset-Guided Fusion for Multimodal Object Detection | Dec 24, 2024 | MambaObject | —Unverified | 0 |
| PartGen: Part-level 3D Generation and Reconstruction with Multi-View Diffusion Models | Dec 24, 2024 | 3D Generation3D Reconstruction | —Unverified | 0 |
| Multi-Point Positional Insertion Tuning for Small Object Detection | Dec 24, 2024 | Objectobject-detection | —Unverified | 0 |
| S-INF: Towards Realistic Indoor Scene Synthesis via Scene Implicit Neural Field | Dec 23, 2024 | Indoor Scene SynthesisObject | CodeCode Available | 0 |
| Cross-View Referring Multi-Object Tracking | Dec 23, 2024 | Cross-view Referring Multi-Object TrackingMulti-Object Tracking | CodeCode Available | 2 |
| OLiDM: Object-aware LiDAR Diffusion Models for Autonomous Driving | Dec 23, 2024 | Autonomous DrivingObject | —Unverified | 0 |
| Seamless Detection: Unifying Salient Object Detection and Camouflaged Object Detection | Dec 22, 2024 | DecoderObject | CodeCode Available | 1 |
| Concept Guided Co-saliency Objection Detection | Dec 21, 2024 | Objectobject-detection | —Unverified | 0 |
| Generalizable Articulated Object Perception with Superpoints | Dec 21, 2024 | DecoderObject | —Unverified | 0 |
| Improving Object Detection for Time-Lapse Imagery Using Temporal Features in Wildlife Monitoring | Dec 20, 2024 | Objectobject-detection | CodeCode Available | 0 |
| Affordance-Aware Object Insertion via Mask-Aware Dual Diffusion | Dec 19, 2024 | Object | CodeCode Available | 1 |
| Multi-Sensor Object Anomaly Detection: Unifying Appearance, Geometry, and Internal Properties | Dec 19, 2024 | Anomaly DetectionObject | CodeCode Available | 2 |
| Arti-PG: A Toolbox for Procedurally Synthesizing Large-Scale and Diverse Articulated Objects with Rich Annotations | Dec 19, 2024 | Object | —Unverified | 0 |
| Leveraging Color Channel Independence for Improved Unsupervised Object Detection | Dec 19, 2024 | DisentanglementObject | —Unverified | 0 |
| LeviTor: 3D Trajectory Oriented Image-to-Video Synthesis | Dec 19, 2024 | Object | CodeCode Available | 2 |
| ManiVideo: Generating Hand-Object Manipulation Video with Dexterous and Generalizable Grasping | Dec 18, 2024 | ObjectVideo Generation | —Unverified | 0 |
| Descriptive Caption Enhancement with Visual Specialists for Multimodal Perception | Dec 18, 2024 | DescriptiveHuman-Object Interaction Detection | CodeCode Available | 0 |
| Real Classification by Description: Extending CLIP's Limits of Part Attributes Recognition | Dec 18, 2024 | AttributeDescriptive | CodeCode Available | 0 |
| MMO-IG: Multi-Class and Multi-Scale Object Image Generation for Remote Sensing | Dec 18, 2024 | DenoisingImage Generation | —Unverified | 0 |
| Temporally Consistent Object-Centric Learning by Contrasting Slots | Dec 18, 2024 | Inductive BiasObject | —Unverified | 0 |
| RelationField: Relate Anything in Radiance Fields | Dec 18, 2024 | 3d scene graph generationGraph Generation | CodeCode Available | 2 |
| Object Style Diffusion for Generalized Object Detection in Urban Scene | Dec 18, 2024 | Autonomous DrivingDomain Generalization | —Unverified | 0 |
| PixelMan: Consistent Object Editing with Diffusion Models via Pixel Manipulation and Generation | Dec 18, 2024 | Object | CodeCode Available | 1 |
| M^3-VOS: Multi-Phase, Multi-Transition, and Multi-Scenery Video Object Segmentation | Dec 18, 2024 | ObjectSemantic Segmentation | CodeCode Available | 1 |
| CREST: An Efficient Conjointly-trained Spike-driven Framework for Event-based Object Detection Exploiting Spatiotemporal Dynamics | Dec 17, 2024 | Objectobject-detection | CodeCode Available | 1 |
| Efficient Oriented Object Detection with Enhanced Small Object Recognition in Aerial Images | Dec 17, 2024 | Computational EfficiencyObject | —Unverified | 0 |
| Differential Alignment for Domain Adaptive Object Detection | Dec 17, 2024 | Objectobject-detection | CodeCode Available | 1 |
| Attentive Eraser: Unleashing Diffusion Model's Object Removal Potential via Self-Attention Redirection Guidance | Dec 17, 2024 | Image GenerationObject | CodeCode Available | 3 |
| PromptDet: A Lightweight 3D Object Detection Framework with LiDAR Prompts | Dec 17, 2024 | 3D Object DetectionDepth Estimation | —Unverified | 0 |
| Efficient Object-centric Representation Learning with Pre-trained Geometric Prior | Dec 16, 2024 | Computational EfficiencyDecoder | —Unverified | 0 |
| Probabilistic GOSPA: A Metric for Performance Evaluation of Multi-Object Filters with Uncertainties | Dec 16, 2024 | Object | CodeCode Available | 0 |
| MeshArt: Generating Articulated Meshes with Structure-Guided Transformers | Dec 16, 2024 | Object | —Unverified | 0 |
| Leveraging Retrieval-Augmented Tags for Large Vision-Language Understanding in Complex Scenes | Dec 16, 2024 | Contrastive LearningMultimodal Reasoning | —Unverified | 0 |
| Category Level 6D Object Pose Estimation from a Single RGB Image using Diffusion | Dec 16, 2024 | 6D Pose Estimation using RGBObject | —Unverified | 0 |
| Oriented Tiny Object Detection: A Dataset, Benchmark, and Dynamic Unbiased Learning | Dec 16, 2024 | Objectobject-detection | —Unverified | 0 |
| MOVIS: Enhancing Multi-Object Novel View Synthesis for Indoor Scenes | Dec 16, 2024 | DenoisingNovel View Synthesis | —Unverified | 0 |
| Redefining Normal: A Novel Object-Level Approach for Multi-Object Novelty Detection | Dec 15, 2024 | Knowledge DistillationNovelty Detection | CodeCode Available | 0 |
| Exploring Enhanced Contextual Information for Video-Level Object Tracking | Dec 15, 2024 | ObjectObject Tracking | CodeCode Available | 2 |
| DeMo: Decoupled Feature-Based Mixture of Experts for Multi-Modal Object Re-Identification | Dec 14, 2024 | Mixture-of-ExpertsObject | CodeCode Available | 2 |
| MambaPro: Multi-Modal Object Re-Identification with Mamba Aggregation and Synergistic Prompt | Dec 14, 2024 | MambaObject | CodeCode Available | 2 |