| GiVE: Guiding Visual Encoder to Perceive Overlooked Information | Oct 26, 2024 | ObjectQuestion Answering | —Unverified | 0 |
| Semantics in Robotics: Environmental Data Can't Yield Conventions of Human Behaviour | Oct 25, 2024 | Object | —Unverified | 0 |
| MonoDGP: Monocular 3D Object Detection with Decoupled-Query and Geometry-Error Priors | Oct 25, 2024 | 3D Object DetectionDepth Estimation | CodeCode Available | 2 |
| IPPON: Common Sense Guided Informative Path Planning for Object Goal Navigation | Oct 25, 2024 | Common Sense ReasoningLanguage Modeling | —Unverified | 0 |
| Non-rigid Relative Placement through 3D Dense Diffusion | Oct 25, 2024 | ObjectRobot Manipulation | —Unverified | 0 |
| DECADE: Towards Designing Efficient-yet-Accurate Distance Estimation Modules for Collision Avoidance in Mobile Advanced Driver Assistance Systems | Oct 25, 2024 | 3D Object DetectionCollision Avoidance | —Unverified | 0 |
| Radar and Camera Fusion for Object Detection and Tracking: A Comprehensive Survey | Oct 24, 2024 | Objectobject-detection | —Unverified | 0 |
| Learning Global Object-Centric Representations via Disentangled Slot Attention | Oct 24, 2024 | ObjectPosition | —Unverified | 0 |
| You Only Look Around: Learning Illumination Invariant Feature for Low-light Object Detection | Oct 24, 2024 | Objectobject-detection | CodeCode Available | 1 |
| Optimizing Edge Offloading Decisions for Object Detection | Oct 24, 2024 | Objectobject-detection | CodeCode Available | 1 |
| Dynamic 3D Gaussian Tracking for Graph-Based Neural Dynamics Modeling | Oct 24, 2024 | 3DGSObject | —Unverified | 0 |
| Zero-shot Object Navigation with Vision-Language Models Reasoning | Oct 24, 2024 | Decision MakingLanguage Modeling | —Unverified | 0 |
| Personalized Instance-based Navigation Toward User-Specific Objects in Realistic Environments | Oct 23, 2024 | ObjectVisual Navigation | CodeCode Available | 1 |
| DREB-Net: Dual-stream Restoration Embedding Blur-feature Fusion Network for High-mobility UAV Object Detection | Oct 23, 2024 | Image RestorationObject | CodeCode Available | 1 |
| YOLO-Vehicle-Pro: A Cloud-Edge Collaborative Framework for Object Detection in Autonomous Driving under Adverse Weather Conditions | Oct 23, 2024 | Autonomous DrivingImage Dehazing | —Unverified | 0 |
| OVT-B: A New Large-Scale Benchmark for Open-Vocabulary Multi-Object Tracking | Oct 23, 2024 | Multi-Object TrackingObject | CodeCode Available | 1 |
| YOLOv11: An Overview of the Key Architectural Enhancements | Oct 23, 2024 | Computational EfficiencyInstance Segmentation | CodeCode Available | 0 |
| DI-MaskDINO: A Joint Object Detection and Instance Segmentation Model | Oct 22, 2024 | DecoderInstance Segmentation | CodeCode Available | 2 |
| Multi Kernel Estimation based Object Segmentation | Oct 22, 2024 | ObjectSegmentation | CodeCode Available | 0 |
| Towards Real Zero-Shot Camouflaged Object Segmentation without Camouflaged Annotations | Oct 22, 2024 | Camouflaged Object SegmentationLarge Language Model | CodeCode Available | 0 |
| SINGAPO: Single Image Controlled Generation of Articulated Parts in Objects | Oct 21, 2024 | Object | —Unverified | 0 |
| Mitigating Object Hallucination via Concentric Causal Attention | Oct 21, 2024 | HallucinationObject | CodeCode Available | 2 |
| Few-shot target-driven instance detection based on open-vocabulary object detection models | Oct 21, 2024 | Image AugmentationObject | —Unverified | 0 |
| Joint Top-Down and Bottom-Up Frameworks for 3D Visual Grounding | Oct 21, 2024 | 3D visual groundingObject | —Unverified | 0 |
| SAM2Long: Enhancing SAM 2 for Long Video Segmentation with a Training-Free Memory Tree | Oct 21, 2024 | Heuristic SearchObject | CodeCode Available | 4 |
| Online Pseudo-Label Unified Object Detection for Multiple Datasets Training | Oct 21, 2024 | Objectobject-detection | —Unverified | 0 |
| Object-Centric Temporal Consistency via Conditional Autoregressive Inductive Biases | Oct 21, 2024 | ObjectQuestion Answering | —Unverified | 0 |
| Deep Learning and Machine Learning -- Object Detection and Semantic Segmentation: From Theory to Applications | Oct 21, 2024 | Deep LearningModel Optimization | —Unverified | 0 |
| SOCIAL MEDIA MANAGEMENT SYSTEM PROJECT REPORT | Oct 20, 2024 | ManagementObject | —Unverified | 0 |
| TrackMe:A Simple and Effective Multiple Object Tracking Annotation Tool | Oct 20, 2024 | Multiple Object TrackingObject | CodeCode Available | 1 |
| GRS: Generating Robotic Simulation Tasks from Real-World Images | Oct 20, 2024 | ObjectSemantic Segmentation | —Unverified | 0 |
| Open-vocabulary vs. Closed-set: Best Practice for Few-shot Object Detection Considering Text Describability | Oct 20, 2024 | Few-Shot Object Detectionimage-classification | CodeCode Available | 0 |
| 3D Multi-Object Tracking Employing MS-GLMB Filter for Autonomous Driving | Oct 19, 2024 | 3D Multi-Object TrackingAutonomous Driving | CodeCode Available | 0 |
| Skill Generalization with Verbs | Oct 18, 2024 | ObjectSkill Generalization | —Unverified | 0 |
| Interpretable end-to-end Neurosymbolic Reinforcement Learning agents | Oct 18, 2024 | Atari GamesDeep Reinforcement Learning | —Unverified | 0 |
| Accelerating Object Detection with YOLOv4 for Real-Time Applications | Oct 17, 2024 | Objectobject-detection | —Unverified | 0 |
| GraspDiffusion: Synthesizing Realistic Whole-body Hand-Object Interaction | Oct 17, 2024 | Human-Object Interaction DetectionImage Generation | —Unverified | 0 |
| Generative Location Modeling for Spatially Aware Object Insertion | Oct 17, 2024 | Object | —Unverified | 0 |
| Object Pose Estimation Using Implicit Representation For Transparent Objects | Oct 17, 2024 | NeRFObject | —Unverified | 0 |
| Help Me Identify: Is an LLM+VQA System All We Need to Identify Visual Concepts? | Oct 17, 2024 | AllLanguage Modeling | CodeCode Available | 0 |
| VLM-Grounder: A VLM Agent for Zero-Shot 3D Visual Grounding | Oct 17, 2024 | 3D geometry3D visual grounding | CodeCode Available | 2 |
| Spatiotemporal Object Detection for Improved Aerial Vehicle Detection in Traffic Monitoring | Oct 17, 2024 | Objectobject-detection | —Unverified | 0 |
| Hiding-in-Plain-Sight (HiPS) Attack on CLIP for Targetted Object Removal from Images | Oct 16, 2024 | Image CaptioningObject | —Unverified | 0 |
| Optimizing YOLOv5s Object Detection through Knowledge Distillation algorithm | Oct 16, 2024 | Knowledge DistillationObject | —Unverified | 0 |
| Stable Object Placement Planning From Contact Point Robustness | Oct 16, 2024 | Object | —Unverified | 0 |
| MambaBEV: An efficient 3D detection model with Mamba2 | Oct 16, 2024 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| Cocoon: Robust Multi-Modal Perception with Uncertainty-Aware Sensor Fusion | Oct 16, 2024 | 3D Object DetectionObject | —Unverified | 0 |
| Multiview Scene Graph | Oct 15, 2024 | DecoderObject | CodeCode Available | 2 |
| Open World Object Detection: A Survey | Oct 15, 2024 | Incremental LearningObject | CodeCode Available | 2 |
| Jigsaw++: Imagining Complete Shape Priors for Object Reassembly | Oct 15, 2024 | Object | —Unverified | 0 |