| VolumeDiffusion: Flexible Text-to-3D Generation with Efficient Volumetric Encoder | Dec 18, 2023 | 3D GenerationObject | —Unverified | 0 |
| CLIM: Contrastive Language-Image Mosaic for Region Representation | Dec 18, 2023 | Objectobject-detection | CodeCode Available | 1 |
| Appearance-Based Refinement for Object-Centric Motion Segmentation | Dec 18, 2023 | Motion SegmentationObject | —Unverified | 0 |
| Diffusion-Based Particle-DETR for BEV Perception | Dec 18, 2023 | Autonomous VehiclesObject | —Unverified | 0 |
| Squeezed Edge YOLO: Onboard Object Detection on Edge Devices | Dec 18, 2023 | Autonomous NavigationObject | —Unverified | 0 |
| Primitive-based 3D Human-Object Interaction Modelling and Programming | Dec 17, 2023 | 3D ReconstructionHuman-Object Interaction Detection | —Unverified | 0 |
| PETDet: Proposal Enhancement for Two-Stage Fine-Grained Object Detection | Dec 16, 2023 | Multi-Task LearningObject | CodeCode Available | 1 |
| Simple Image-level Classification Improves Open-vocabulary Object Detection | Dec 16, 2023 | Knowledge DistillationObject | CodeCode Available | 1 |
| End-to-End Training of Neural Networks for Automotive Radar Interference Mitigation | Dec 15, 2023 | Objectobject-detection | —Unverified | 0 |
| Painterly Image Harmonization by Learning from Painterly Objects | Dec 15, 2023 | Image HarmonizationObject | CodeCode Available | 1 |
| Implicit Modeling of Non-rigid Objects with Cross-Category Signals | Dec 15, 2023 | Object | —Unverified | 0 |
| Deep Active Perception for Object Detection using Navigation Proposals | Dec 15, 2023 | Objectobject-detection | —Unverified | 0 |
| Ins-HOI: Instance Aware Human-Object Interactions Recovery | Dec 15, 2023 | DescriptiveDisentanglement | CodeCode Available | 1 |
| CAGE: Controllable Articulation GEneration | Dec 15, 2023 | DenoisingObject | —Unverified | 0 |
| Multiscale Vision Transformer With Deep Clustering-Guided Refinement for Weakly Supervised Object Localization | Dec 15, 2023 | ClusteringDeep Clustering | —Unverified | 0 |
| ADA-YOLO: Dynamic Fusion of YOLOv8 and Adaptive Heads for Precise Image Detection and Diagnosis | Dec 14, 2023 | Blood Cell CountEdge-computing | —Unverified | 0 |
| UCMCTrack: Multi-Object Tracking with Uniform Camera Motion Compensation | Dec 14, 2023 | Motion CompensationMulti-Object Tracking | CodeCode Available | 2 |
| General Object Foundation Model for Images and Videos at Scale | Dec 14, 2023 | Instance SegmentationLong-tail Video Object Segmentation | CodeCode Available | 3 |
| SKDF: A Simple Knowledge Distillation Framework for Distilling Open-Vocabulary Knowledge to Open-world Object Detector | Dec 14, 2023 | Knowledge DistillationObject | CodeCode Available | 1 |
| Multi-Scene Generalized Trajectory Global Graph Solver with Composite Nodes for Multiple Object Tracking | Dec 14, 2023 | Multi-Object TrackingMultiple Object Tracking | —Unverified | 0 |
| Class-Wise Buffer Management for Incremental Object Detection: An Effective Buffer Training Strategy | Dec 14, 2023 | class-incremental learningClass Incremental Learning | —Unverified | 0 |
| Learned Fusion: 3D Object Detection using Calibration-Free Transformer Feature Fusion | Dec 14, 2023 | 3D Object DetectionObject | —Unverified | 0 |
| UniTeam: Open Vocabulary Mobile Manipulation Challenge | Dec 14, 2023 | Object | —Unverified | 0 |
| LEMON: Learning 3D Human-Object Interaction Relation from 2D Images | Dec 14, 2023 | Human-Object Interaction DetectionObject | —Unverified | 0 |
| PhyOT: Physics-informed object tracking in surveillance cameras | Dec 14, 2023 | Deep LearningObject | —Unverified | 0 |
| DualTeacher: Bridging Coexistence of Unlabelled Classes for Semi-supervised Incremental Object Detection | Dec 13, 2023 | Objectobject-detection | CodeCode Available | 1 |
| NViST: In the Wild New View Synthesis from a Single Image with Transformers | Dec 13, 2023 | Generalizable Novel View SynthesisNovel View Synthesis | —Unverified | 0 |
| Object-Centric Conformance Alignments with Synchronization (Extended Version) | Dec 13, 2023 | Object | —Unverified | 0 |
| Neural Radiance Fields for Transparent Object Using Visual Hull | Dec 13, 2023 | NeRFNovel View Synthesis | —Unverified | 0 |
| Mono3DVG: 3D Visual Grounding in Monocular Images | Dec 13, 2023 | 3D Object Detection3D visual grounding | CodeCode Available | 1 |
| Efficient Multi-Object Pose Estimation using Multi-Resolution Deformable Attention and Query Aggregation | Dec 13, 2023 | ObjectPose Estimation | —Unverified | 0 |
| Chat-Scene: Bridging 3D Scene and Large Language Models with Object Identifiers | Dec 13, 2023 | 3D Question Answering (3D-QA)Attribute | CodeCode Available | 2 |
| CenterGrasp: Object-Aware Implicit Representation Learning for Simultaneous Shape Reconstruction and 6-DoF Grasp Estimation | Dec 13, 2023 | DecoderObject | —Unverified | 0 |
| Unveiling Parts Beyond Objects:Towards Finer-Granularity Referring Expression Segmentation | Dec 13, 2023 | DescriptiveObject | CodeCode Available | 1 |
| MedYOLO: A Medical Image Object Detection Framework | Dec 12, 2023 | Computed Tomography (CT)Object | CodeCode Available | 1 |
| Mixed Pseudo Labels for Semi-Supervised Object Detection | Dec 12, 2023 | Objectobject-detection | CodeCode Available | 1 |
| Template Free Reconstruction of Human-object Interaction with Procedural Interaction Generation | Dec 12, 2023 | Human-Object Interaction DetectionObject | —Unverified | 0 |
| GenHowTo: Learning to Generate Actions and State Transformations from Instructional Videos | Dec 12, 2023 | Object | CodeCode Available | 1 |
| Shifted Autoencoders for Point Annotation Restoration in Object Counting | Dec 12, 2023 | General KnowledgeObject | —Unverified | 0 |
| Efficient Object Detection in Autonomous Driving using Spiking Neural Networks: Performance, Energy Consumption Analysis, and Insights into Open-set Object Discovery | Dec 12, 2023 | Autonomous DrivingAutonomous Vehicles | CodeCode Available | 1 |
| Edge Wasserstein Distance Loss for Oriented Object Detection | Dec 12, 2023 | Objectobject-detection | —Unverified | 0 |
| ADOD: Adaptive Domain-Aware Object Detection with Residual Attention for Underwater Environments | Dec 11, 2023 | domain classificationDomain Generalization | CodeCode Available | 0 |
| DFGET: Displacement-Field Assisted Graph Energy Transmitter for Gland Instance Segmentation | Dec 11, 2023 | Instance SegmentationMulti-Task Learning | —Unverified | 0 |
| Learning Polynomial Representations of Physical Objects with Application to Certifying Correct Packing Configurations | Dec 11, 2023 | ObjectOne-Class Classification | —Unverified | 0 |
| SqueezeSAM: User friendly mobile interactive segmentation | Dec 11, 2023 | Data AugmentationInteractive Segmentation | —Unverified | 0 |
| SimMining-3D: Altitude-Aware 3D Object Detection in Complex Mining Environments: A Novel Dataset and ROS-Based Automatic Annotation Pipeline | Dec 11, 2023 | 3D Object DetectionObject | —Unverified | 0 |
| Mitigating Perspective Distortion-induced Shape Ambiguity in Image Crops | Dec 11, 2023 | 3D Object DetectionDepth Estimation | —Unverified | 0 |
| RCA-NOC: Relative Contrastive Alignment for Novel Object Captioning | Dec 11, 2023 | Contrastive LearningObject | —Unverified | 0 |
| HOI-Diff: Text-Driven Synthesis of 3D Human-Object Interactions using Diffusion Models | Dec 11, 2023 | Human-Object Interaction DetectionMotion Generation | —Unverified | 0 |
| Beyond Classification: Definition and Density-based Estimation of Calibration in Object Detection | Dec 11, 2023 | Density EstimationObject | CodeCode Available | 0 |